Tutorials
Code-Llama
Natural Language to Code-Llama
Code-Llama-7b
Model Endpoint for Codellama-7b
Custom-Container
Natural Language to Code-Llama
Gemma
Deploy Inference for Gemma
LLAMA 3
Using TensorRT-LLM
Meta LLAMA 3 8B-IT
Deploy Inference for Meta LLAMA 3 8B-IT
Meta LLMA 2
Deploy Inference Endpoint For Meta LLMA 2
Stable Diffusion v2.1
Deploy Inference Endpoint for Stable Diffusion v2.1
Stable Video Diffusion xt
Deploy Model Endpoint for Stable Video Diffusion xt
TorchServe
TorchServe
Triton Inference
Triton Inference
VLLM
VLLM with OpenAI Client
YOLOv8
Deploy Model Endpoint for YOLOv8