Skip to content
View mnjm's full-sized avatar
🗿
🗿

Block or report mnjm

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

66 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,431 11,911 Updated Dec 15, 2025

Animation engine for explanatory math videos

Python 85,444 7,177 Updated Mar 14, 2026

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 75,778 8,481 Updated Mar 21, 2026

Deep Learning for humans

Python 63,949 19,740 Updated Mar 21, 2026

Unified web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 57,482 4,841 Updated Mar 22, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,366 9,427 Updated Nov 12, 2025

The best ChatGPT that $100 can buy.

Python 49,912 6,537 Updated Mar 17, 2026

Official inference framework for 1-bit LLMs

Python 36,304 3,128 Updated Mar 10, 2026

Official inference repo for FLUX.1 models

Python 25,335 1,867 Updated Jul 31, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,705 5,869 Updated Aug 14, 2024

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,931 2,063 Updated Jan 13, 2026

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,641 1,417 Updated Feb 8, 2026

Resume builder for academics and engineers

Python 16,066 1,147 Updated Mar 21, 2026

LLM Council works together to answer your hardest questions

Python 16,019 3,205 Updated Nov 22, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,779 2,216 Updated Jul 24, 2024

End-to-End Object Detection with Transformers

Python 15,172 2,662 Updated Mar 12, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,770 1,788 Updated Mar 17, 2026

An open source implementation of CLIP.

Python 13,544 1,260 Updated Mar 12, 2026

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 13,252 1,413 Updated Mar 22, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 12,966 1,581 Updated Feb 27, 2026

Access large language models from the command-line

Python 11,400 773 Updated Mar 17, 2026

🐍 Geometric Computer Vision Library for Spatial AI

Python 11,125 1,168 Updated Mar 22, 2026

Refine high-quality datasets and visual AI models

Python 10,490 728 Updated Mar 21, 2026

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,385 1,024 Updated Jul 1, 2024

tiny vision language model

Python 9,443 742 Updated Nov 14, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,924 605 Updated May 3, 2024

[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.

Python 5,944 725 Updated Mar 22, 2026

This tool has been deprecated. Use Agentic Document Extraction instead.

Python 5,275 600 Updated Jan 29, 2026

🐢 Open-Source Evaluation & Testing library for LLM Agents

Python 5,190 418 Updated Mar 20, 2026

A PyTorch native platform for training generative AI models

Python 5,171 754 Updated Mar 22, 2026
Next