Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Robust Speech Recognition via Large-Scale Weak Supervision
💫 Toolkit to help you get started with Spec-Driven Development
A high-throughput and memory-efficient inference and serving engine for LLMs
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Unified web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
Interact with your documents using the power of GPT, 100% privately, no data leaks
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
No fortress, purely open ground. OpenManus is Coming.
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
AI agents running research on single-GPU nanochat training automatically
A collection of design patterns/idioms in Python
aider is AI pair programming in your terminal
🎨 Diagram as Code for prototyping cloud system architectures
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Instant voice cloning by MIT and MyShell. Audio foundation model.
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
Easily train a good VC model with voice data <= 10 mins!
A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic
State-of-the-art 2D and 3D Face Analysis Project
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
SGLang is a high-performance serving framework for large language models and multimodal models.




