Stars
Git with a cup of tea! Painless self-hosted all-in-one software development service, including Git hosting, code review, team collaboration, package registry and CI/CD
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
An asynchronous streaming data management module for efficient post-training.
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
Official implementation of Text2VectorSQL: Towards a Unified Interface for Vector Search and SQL Queries
OpenDCAI / MyScaleDB
Forked from OriginHubAI/MyScaleDBAI Database for unified, scalable SQL + vector data management, search and analytics
AI Database for unified, scalable SQL + vector management, search and analytics
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Pioneering Automated GUI Interaction with Native Agents
Fully open reproduction of DeepSeek-R1
Python tool for converting files and office documents to Markdown.
RAG that intelligently adapts to your use case, data, and queries
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…
A generative world for general-purpose robotics & embodied AI learning.
Scalable data pre processing and curation toolkit for LLMs
AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Co-create PowerPoint slide decks with AI
AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale


