Lists (26)
Sort Name ascending (A-Z)
🗿 3D
🔌 APIs
🔊 Audio
📋 Awesome Lists
📚 C++
👀 Computer Vision
💽 Data
👷 Dev Ops
☁️ Gaussian Splatting
🖼️ Gen-AI
📺 Graphics
📱 GUI
🕹️ Interactive ML
🔤 LLM
🧠 Machine Learning
📱 Mobile
💡 P-Comp
⚛ Physics & Math
📈 TouchDesigner
🎮 Unity
🕹️ Unreal
🪛 Utilities
Apps, helpers, tools📹 Video
👗 VTON
Virtual Try-On🌐 Web
🪟 Windows
- All languages
- AutoIt
- C
- C#
- C++
- CMake
- CSS
- Cython
- Dart
- Dockerfile
- Elixir
- GDScript
- GLSL
- Go
- HLSL
- HTML
- Haxe
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- MATLAB
- Makefile
- Markdown
- Mathematica
- Objective-C
- PHP
- PowerShell
- Processing
- PureScript
- Python
- QML
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- ShaderLab
- Shell
- Svelte
- Swift
- TypeScript
- VHDL
- Visual Basic .NET
- Vue
- Zig
Starred repositories
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
Clone a voice in 5 seconds to generate arbitrary speech in real-time
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
We write your reusable computer vision tools. 💜
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
OpenMMLab Detection Toolbox and Benchmark
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Open-Sora: Democratizing Efficient Video Production for All
A generative world for general-purpose robotics & embodied AI learning.
State-of-the-art 2D and 3D Face Analysis Project
Industry leading face manipulation platform
Generative Models by Stability AI
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.







