vLLM TPU
Overview & TPU
Initializing search
GitHub
vLLM TPU
GitHub
Home
Getting Started
Getting Started
TPU Setup
Installation
Serving model
Basic Usage (I use HuggingFace model)
Basic Usage (I use HuggingFace model)
Overview & TPU
Recommended Models
Deploying model on GCE
Deploying model on GCE
v7x setup (Ironwood)
v6e setup (Trillium)
Code Examples
Code Examples
Multi Modal Inference
Offline LoRA
Advanced Usage (I need my custom model)
Advanced Usage (I need my custom model)
JAX
JAX
How it works
Model Development
Torch
Torch
How it works
Model Development
Your custom model on GCE
Recommended Features
See Also
See Also
Profiling
FAQ
Contributors Guide
Overview & TPU
Back to top