Skip to content

Commit

Permalink
add line for lora adapter aware routing
Browse files Browse the repository at this point in the history
  • Loading branch information
samos123 committed Dec 4, 2024
1 parent f9c233d commit 1c9e069
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions docs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ The easiest way to serve ML models in production. Supports LLMs, embeddings, and
✅️ OpenAI API Compatibility: Drop-in replacement for OpenAI
⚖️ Autoscaling: Scale from zero, autoscale based on load
🧠 Serve text generation models with vLLM or Ollama
🔌 Lora Adapter aware routing
💬 Speech to Text API with FasterWhisper
🧮 Embedding/Vector API with Infinity
🚀 Multi-platform: CPU, GPU, TPU
Expand Down

0 comments on commit 1c9e069

Please sign in to comment.