Stars
Stable Diffusion web UI
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
real time face swap and one-click video deepfake with only a single image
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
CowAgent是基于大模型的超级AI助理,能主动思考和任务规划、访问操作系统和外部资源、创造和执行Skills、拥有长期记忆并不断成长,比OpenClaw更轻量和便捷。同时支持微信、飞书、钉钉、企微、QQ、公众号、网页等接入,可选择OpenAI/Claude/Gemini/DeepSeek/ Qwen/GLM/Kimi/LinkAI,能处理文本、语音、图片和文件,可快速搭建个人AI助理和企…
A generative speech model for daily dialogue.
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
open-source agentic AI data assistant for the next generation of AI + Data products.
🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天记录创造…
Train your AI self, amplify you, bridge the world
The official GitHub page for the survey paper "A Survey of Large Language Models".
Open-source framework for conversational voice AI agents
AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using A…
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
ReMe: Memory Management Kit for Agents - Remember Me, Refine Me.
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
AI-powered Xiaohongshu/Rednote content creation and publishing tool with PyQt desktop UI, FastAPI service, login-state reuse, preview publish, and automated browser workflows.
English pronunciation correction teacher built with gemini
An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)