Stars
Training library for Megatron-based models with bidirectional Hugging Face conversion capability
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
Train transformer language models with reinforcement learning.
Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
Deep Learning how-to's using Lance file format
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
An I/O benchmark for deep Learning applications
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
A collection of RBIR projects and posts for anyone interested in joining this journey.
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
muyihao / ray
Forked from ray-project/rayRay is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
muyihao / kuberay
Forked from ray-project/kuberayA toolkit to run Ray applications on Kubernetes
A Cloud Native Batch System (Project under CNCF)
Production-Grade Container Scheduling and Management
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Browse Lance tables from your local machine in a simple web UI. No database to set up. Mount a folder and go.
a static analytical model for LLM distributed training
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration
