Pinned Loading
- accelerate
accelerate PublicForked from huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Python
- DeepSpeed
DeepSpeed PublicForked from deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python
- Megatron-Bridge
Megatron-Bridge PublicForked from NVIDIA-NeMo/Megatron-Bridge
Training library for Megatron-based models
Python
- Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
- mergekit
mergekit PublicForked from arcee-ai/mergekit
Tools for merging pretrained large language models.
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


