Skip to content
View gphorvath's full-sized avatar

Highlights

  • Pro

Block or report gphorvath

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI/ML

35 repositories

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,292 5,260 Updated Jan 22, 2025

Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 20,998 1,477 Updated Jan 21, 2025

Python bindings for the Transformer models implemented in C/C++ using GGML library.

C 1,828 140 Updated Jan 28, 2024

Chat with your documents offline using AI.

Python 713 103 Updated Sep 29, 2023

Python bindings for llama.cpp

Python 8,452 1,015 Updated Jan 20, 2025

Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.

Go 108,867 8,717 Updated Jan 21, 2025

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,710 309 Updated Jan 8, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 38,060 5,543 Updated Jan 22, 2025

Faster Whisper transcription with CTranslate2

Python 13,599 1,146 Updated Jan 1, 2025

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 7,990 755 Updated Jan 2, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 170,718 44,879 Updated Jan 21, 2025

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,349 2,214 Updated Jan 15, 2025

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

Jupyter Notebook 2,241 309 Updated Jan 19, 2025

Set of generative ai services in python

Python 7 2 Updated Jun 26, 2023

Tools for merging pretrained large language models.

Python 5,132 479 Updated Jan 6, 2025

Drag & drop UI to build your customized LLM flow

TypeScript 34,174 17,683 Updated Jan 21, 2025

Sweep: open-source AI-powered Software Developer for small features and bug fixes.

Jupyter Notebook 7,480 438 Updated Oct 23, 2024

🚀 PR-Agent (Qodo Merge open-source): An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍

Python 6,528 650 Updated Jan 21, 2025

The Memory layer for your AI apps

Python 24,066 2,227 Updated Jan 21, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

JavaScript 30,615 3,071 Updated Jan 22, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 7,935 803 Updated Jan 21, 2025

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,063 148 Updated Sep 3, 2024

OpenUI let's you describe UI using your imagination, then see it rendered live.

TypeScript 19,725 1,830 Updated Oct 21, 2024

An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.

Python 1,573 177 Updated Sep 9, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 64,280 6,883 Updated Jan 21, 2025

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

TypeScript 6,656 725 Updated Jan 15, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,188 1,073 Updated Jan 21, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,722 164 Updated Jan 21, 2025

🙌 OpenHands: Code Less, Make More

Python 44,204 4,899 Updated Jan 21, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 26,617 2,072 Updated Jan 21, 2025