Skip to main content
LicensePRs Welcome

🎯 What You’ll Find Here

This cookbook aggregates battle-tested SGLang recipes covering:
  • Models: Mainstream LLMs and Vision-Language Models (VLMs)
  • Use Cases: Inference serving, deployment strategies, multimodal applications
  • Hardware: GPU and CPU configurations, optimization for different accelerators
  • Best Practices: Configuration templates, performance tuning, troubleshooting guides
Each recipe provides step-by-step instructions to help you quickly implement SGLang solutions for your specific requirements.

Guides

Autoregressive Models

Qwen

DeepSeek

Llama

GLM

OpenAI

Moonshotai

MiniMax

NVIDIA

Ernie

InternVL

InternLM

Jina AI

Mistral

Xiaomi

FlashLabs

Diffusion Models

FLUX

Qwen-Image

Wan

Z-Image

Benchmarks

Reference

πŸš€ Quick Start

  1. Browse the recipe index above to find your model
  2. Follow the step-by-step instructions in each guide
  3. Adapt configurations to your specific hardware and requirements
  4. Join our community to share feedback and improvements

🀝 Contributing

We believe the best documentation comes from practitioners. Whether you’ve optimized SGLang for a specific model, solved a tricky deployment challenge, or discovered performance improvements, we encourage you to contribute your recipes! Ways to contribute:
  • Add a new recipe for a model not yet covered
  • Improve existing recipes with additional tips or configurations
  • Report issues or suggest enhancements
  • Share your production deployment experiences
To contribute:
# Fork the repo and clone locally
git clone https://github.com/YOUR_USERNAME/sglang-cookbook.git
cd sglang-cookbook

# Create a new branch
git checkout -b add-my-recipe

# Add your recipe following the template in DeepSeek-V3.2
# Submit a PR!

πŸ› οΈ Local Development

Prerequisites

Setup and Run

Install the Mintlify CLI and start the development server:
# Install Mintlify CLI globally
npm i -g mint

# Start development server (hot reload enabled)
mint dev
The site will automatically open in your browser at http://localhost:3000.

πŸ“– Resources

πŸ“„ License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
Let’s build this resource together! πŸš€ Star the repo and contribute your recipes to help the SGLang community grow.