Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A feature-rich command-line audio/video downloader
💫 Toolkit to help you get started with Spec-Driven Development
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
No fortress, purely open ground. OpenManus is Coming.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
We write your reusable computer vision tools. 💜
An open-source RAG-based tool for chatting with your documents.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Toolkit for linearizing PDFs for LLM datasets/training
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
Translate the video from one language to another and embed dubbing & subtitles.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Train your AI self, amplify you, bridge the world
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
