Skip to content
View muyihao's full-sized avatar

Block or report muyihao

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 524 230 Updated Mar 23, 2026

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 1,782 124 Updated Mar 23, 2026

Train transformer language models with reinforcement learning.

Python 17,761 2,582 Updated Mar 23, 2026

Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.

Python 851 110 Updated Mar 23, 2026

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 7,960 847 Updated Mar 23, 2026
Python 2 Updated Mar 3, 2026

Deep Learning how-to's using Lance file format

Jupyter Notebook 23 6 Updated Jun 9, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,200 6,672 Updated Sep 30, 2025

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 332,159 64,721 Updated Mar 23, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,963 621 Updated Mar 23, 2026

An I/O benchmark for deep Learning applications

Python 104 56 Updated Mar 18, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,172 2,209 Updated Mar 23, 2026

A collection of RBIR projects and posts for anyone interested in joining this journey.

Rust 318 13 Updated Mar 23, 2026

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,797 371 Updated Jul 25, 2025

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,777 516 Updated Mar 13, 2026

Nano vLLM

Python 12,391 1,772 Updated Nov 3, 2025

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

Python 1,251 174 Updated Mar 23, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 1 Updated Jan 30, 2026

A toolkit to run Ray applications on Kubernetes

Go 1 Updated Dec 21, 2025

A Cloud Native Batch System (Project under CNCF)

Go 5,398 1,308 Updated Mar 19, 2026

Production-Grade Container Scheduling and Management

Go 121,333 42,719 Updated Mar 23, 2026

A Gym for Agentic LLMs

Python 467 30 Updated Jan 21, 2026

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,929 372 Updated Dec 7, 2024

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,111 347 Updated Mar 23, 2026

Browse Lance tables from your local machine in a simple web UI. No database to set up. Mount a folder and go.

Python 24 5 Updated Mar 16, 2026

a static analytical model for LLM distributed training

Python 125 16 Updated Jan 8, 2026

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 2,925 770 Updated Mar 23, 2026

Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration

Go 5,341 1,082 Updated Mar 22, 2026
C++ 532 43 Updated Feb 10, 2026
Next