Highlights
- Pro
AI/ML
A high-throughput and memory-efficient inference and serving engine for LLMs
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Python bindings for the Transformer models implemented in C/C++ using GGML library.
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Faster Whisper transcription with CTranslate2
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…
Tools for merging pretrained large language models.
Drag & drop UI to build your customized LLM flow
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
🚀 PR-Agent (Qodo Merge open-source): An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Supercharge Your LLM Application Evaluations 🚀
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
OpenUI let's you describe UI using your imagination, then see it rendered live.
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper