Skip to content

Commit

Permalink
Update README.md (#305)
Browse files Browse the repository at this point in the history
  • Loading branch information
samos123 authored Nov 8, 2024
1 parent 6f71d75 commit 766d1d8
Showing 1 changed file with 5 additions and 3 deletions.
8 changes: 5 additions & 3 deletions docs/README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,14 @@
# KubeAI: Private Open AI on Kubernetes
# KubeAI: AI Inferencing Operator

Get inferencing running on Kubernetes: LLMs, Embeddings, Speech-to-Text.

✅️ Drop-in replacement for OpenAI with API compatibility
🧠 Serve top OSS models (LLMs, Whisper, etc.)
⚖️ Scale from zero, autoscale based on load
🧠 Serve text generation models (LLMs, VLMs, etc.)
💬 Speech to Text API
🧮 Embedding/Vector API
🚀 Multi-platform: CPU-only, GPU, TPU
💾 Model caching with shared filesystems (EFS, Filestore, etc.)
⚖️ Scale from zero, autoscale based on load
🛠️ Zero dependencies (does not depend on Istio, Knative, etc.)
💬 Chat UI included ([OpenWebUI](https://github.com/open-webui/open-webui))
🤖 Operates OSS model servers (vLLM, Ollama, FasterWhisper, Infinity)
Expand Down

0 comments on commit 766d1d8

Please sign in to comment.