Skip to content
View gavin1332's full-sized avatar

Block or report gavin1332

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 49,051 6,826 Updated Mar 21, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 329,400 63,985 Updated Mar 22, 2026

A 10000+ hours dataset for Chinese speech recognition

Shell 594 53 Updated Jan 9, 2026

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 128,209 18,117 Updated Mar 22, 2026

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 5,467 766 Updated Sep 26, 2025

Retrieval and Retrieval-augmented LLMs

Python 11,432 844 Updated Mar 10, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 24,865 4,941 Updated Mar 22, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,429 11,911 Updated Dec 15, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,503 1,119 Updated Mar 20, 2026

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,810 135 Updated Mar 13, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,977 1,939 Updated Jan 9, 2026

A generative speech model for daily dialogue.

Python 38,960 4,228 Updated Jan 18, 2026

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 5,560 348 Updated Sep 12, 2025

LLM training in simple, raw C/CUDA

Cuda 29,228 3,440 Updated Jun 26, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,782 748 Updated Mar 21, 2026

LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。

TypeScript 323 29 Updated Jun 19, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,105 410 Updated Mar 21, 2026

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,436 2,112 Updated May 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 68,869 8,392 Updated Mar 21, 2026

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,538 512 Updated Mar 22, 2026

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,742 483 Updated Jan 8, 2024

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,962 329 Updated Jul 31, 2024

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,826 82 Updated Jul 27, 2025

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,174 416 Updated Mar 19, 2026

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 79,541 15,154 Updated May 10, 2024

中文公开聊天语料库

Python 4,174 783 Updated Apr 23, 2024

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,430 207 Updated Feb 7, 2026

Awesome LLM compression research papers and tools.

1,794 119 Updated Feb 23, 2026

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,943 235 Updated Sep 6, 2023

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,219 3,645 Updated Jul 4, 2024
Next