Stars
mini retro game console, raspberry pi zero w + waveshare gamepi15, optimized for framerate
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, e…
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Model Context Protocol Servers
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Solve Visual Understanding with Reinforced VLMs
Janus-Series: Unified Multimodal Understanding and Generation Models
Python tool for converting files and office documents to Markdown.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A local-ready LLM-generated and LLM-driven virtual pet with thoughts and feelings. 100% Javascript.
Production-ready platform for agentic workflow development.
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
[TPAMI 2026] ConsistentID : Portrait Generation with Multimodal Fine-Grained Identity Preserving
GPT4V-level open-source multi-modal model based on Llama3-8B
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
