🚀 dlcalc

command-line tools for deep learning optimization

📋 Overview

dlcalc is a collection of tools for deep learning practitioners, providing calculators and tools for:

🧮 Performance Modeling - Estimate training throughput, memory usage, and MFU
🌐 Topology Analysis - Analyze and optimize network topology for distributed training
📊 Metrics Conversion - Convert between different performance metrics
🔍 Checkpoint Analysis - Inspect and summarize model checkpoints

🔧 Installation

Via pip (recommended)

pip install dlcalc

or

From source

git clone https://github.com/jfc4050/dlcalc cd dlcalc pip install -e .

After this you should have access to the command line tools described below. Some people may need to add --user to their pip install command for them to properly go under $PATH.

🛠 Tools

📐 Performance Modeling

3D Training Calculator (`3dtrn`)

Calculator for estimating performance characteristics of ND parallel transformer training:

3dtrn examples/llama3_70b.yaml

We recommend to use this with profilers like NVIDIA Nsight Systems or PyTorch Profiler to give theoretical grounding to your performance profiling.

🌐 Topology Optimization

Tool	Command	Purpose
Visualizer	`topoviz`	Generate network topology graphs from Kubernetes clusters
Evaluator	`topoeval`	Analyze topology optimality for DP rings
Scheduler	`topoassign`	Compute topology-aware rank assignments

# Visualize cluster topology topoviz -h # Evaluate training job topology topoeval -h # Generate optimal rank assignments topoassign -h

📊 Metrics & KPIs

Samples/Sec → MFU Converter (`sps2mfu`)

Convert training throughput to Model FLOPs Utilization (MFU):

sps2mfu --samples-per-sec 100 --seqlen 2048 --model-size 70b \ --n-accelerators 512 --tflops-per-accelerator 312

Samples/Sec → Tokens/Day Converter (`sps2tpd`)

Calculate daily token throughput:

sps2tpd --samples-per-sec 100 --seqlen 2048

🔍 Utilities

Checkpoint Summarizer (`ckpt-summarize`)

Analyze PyTorch checkpoint contents:

ckpt-summarize model.pt

🧑‍💻 Development

Setup Development Environment

# Install with development dependencies pip install -e .[dev] # Install pre-commit hooks pre-commit install

Run Quality Checks

# Run all checks (formatting, linting, type checking, tests) bash checks

Testing

# Run full test suite pytest tests/ -v # Run with coverage pytest tests/ --cov=dlcalc --cov-report=term-missing

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📮 Support

Made with ❤️ for the deep learning community

Name		Name	Last commit message	Last commit date
Latest commit History 250 Commits
.github/workflows		.github/workflows
.vscode		.vscode
dlcalc		dlcalc
examples		examples
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
checks		checks
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 dlcalc

📋 Overview

🔧 Installation

Via pip (recommended)

From source

🛠 Tools

📐 Performance Modeling

3D Training Calculator (`3dtrn`)

🌐 Topology Optimization

📊 Metrics & KPIs

Samples/Sec → MFU Converter (`sps2mfu`)

Samples/Sec → Tokens/Day Converter (`sps2tpd`)

🔍 Utilities

Checkpoint Summarizer (`ckpt-summarize`)

🧑‍💻 Development

Setup Development Environment

Run Quality Checks

Testing

🤝 Contributing

📄 License

📮 Support

About

Uh oh!

Releases 13

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 dlcalc

📋 Overview

🔧 Installation

Via pip (recommended)

From source

🛠 Tools

📐 Performance Modeling

3D Training Calculator (3dtrn)

🌐 Topology Optimization

📊 Metrics & KPIs

Samples/Sec → MFU Converter (sps2mfu)

Samples/Sec → Tokens/Day Converter (sps2tpd)

🔍 Utilities

Checkpoint Summarizer (ckpt-summarize)

🧑‍💻 Development

Setup Development Environment

Run Quality Checks

Testing

🤝 Contributing

📄 License

📮 Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 13

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

3D Training Calculator (`3dtrn`)

Samples/Sec → MFU Converter (`sps2mfu`)

Samples/Sec → Tokens/Day Converter (`sps2tpd`)

Checkpoint Summarizer (`ckpt-summarize`)

Packages