Stars
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
An open-source alternative to Claude Cowork built for teams, powered by opencode
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
This API provides programmatic access to the AlphaGenome model developed by Google DeepMind.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
[ICCV 2025] LayerAnimate: Layer-specific Control for Animation
A Conversational Speech Generation Model
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Genome modeling and design across all domains of life
Wan: Open and Advanced Large-Scale Video Generative Models
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
SkyReels V1: The first and most advanced open-source human-centric video foundation model
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
A feature-rich command-line audio/video downloader
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
Janus-Series: Unified Multimodal Understanding and Generation Models
Get started quickly with Next.js, Postgres, Stripe, and shadcn/ui.
The trust-minimized, zero-knowledge bridging protocol, designed for censorship resistance, extremely high security, and usage in decentralized finance.