Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
View all resources
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
GitHub Stars
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
Appleyc
/
DeepSpeed
Public
forked from
deepspeedai/DeepSpeed
Notifications
You must be signed in to change notification settings
Fork
0
Star
0
Code
Pull requests
0
Actions
Projects
Security
0
Insights
Additional navigation options
Code
Pull requests
Actions
Projects
Security
Insights
Commits
Branch selector
master
User selector
All users
All time
Commit History
Commits on Nov 23, 2021
Enable AVX256 on AMD CPU (#1360)
Show description for a637cc2
fumihwh
and
jeffra
authored
a637cc2
Copy full SHA for a637cc2
Removing `ImportError` from tutel import try/except (#1583)
Show description for 1bc13fe
alexandremuzio
and
jeffra
authored
1bc13fe
Copy full SHA for 1bc13fe
Replace brute force and add log (#1560)
Show description for e2b39de
chunyang-wen
and
jeffra
authored
e2b39de
Copy full SHA for e2b39de
Add documentation for TensorBoard logging (#1577)
Show description for e1b4aa8
manuelciosici
and
jeffra
authored
e1b4aa8
Copy full SHA for e1b4aa8
remove debug prints (#1585)
stas00
authored
bcf2bdd
Copy full SHA for bcf2bdd
Commits on Nov 20, 2021
bump to 0.5.8
jeffra
committed
8220674
Copy full SHA for 8220674
Commits on Nov 19, 2021
Several fixes for our read-the-docs build (#1579)
jeffra
authored
a8a17f2
Copy full SHA for a8a17f2
Enables ZeRO-3 inference (#1514)
jeffra
authored
2332cb3
Copy full SHA for 2332cb3
Commits on Nov 18, 2021
[CI] transformers@master has been fixed (#1573)
stas00
authored
74baf5b
Copy full SHA for 74baf5b
switch bin files to use python3 instead of python (#1185)
Show description for 236890d
jeffra
and
tjruwase
authored
236890d
Copy full SHA for 236890d
Render docs for pipe.ProcessTopology (#1505)
Show description for fafc827
3 people
authored
fafc827
Copy full SHA for fafc827
Remove hard tensorboardX requirement (#1571)
jeffra
authored
a90497e
Copy full SHA for a90497e
Commits on Nov 17, 2021
[launcher/runner] respect CUDA_VISIBLE_DEVICES for a single node (#960)
Show description for e3c2d7b
3 people
authored
e3c2d7b
Copy full SHA for e3c2d7b
[autotuning] guard tabulate package import (#1569)
Show description for 938449e
jeffra
authored
938449e
Copy full SHA for 938449e
Enforce nccl/rccl alignment of start location of each shard (#1564)
Show description for 4a0b103
amathews-amd
and
tjruwase
authored
4a0b103
Copy full SHA for 4a0b103
bump DSE commit
jeffra
committed
4625add
Copy full SHA for 4625add
set hf hash (#1568)
jeffra
authored
da7bff4
Copy full SHA for da7bff4
Commits on Nov 16, 2021
Fix partial recovery of sparse_tensor_module_names and dynamically check if gradient data is sparse (#1562)
Show description for 4bf4ab7
3 people
authored
4bf4ab7
Copy full SHA for 4bf4ab7
Add autotuning news post (#1565)
cli99
authored
bda3d0e
Copy full SHA for bda3d0e
Commits on Nov 15, 2021
[build] support cuda-11.5 (#1558)
stas00
authored
fa8d6c0
Copy full SHA for fa8d6c0
Commits on Nov 13, 2021
Update offload parameter names (#1536)
Show description for 7567c76
tjruwase
and
jeffra
authored
7567c76
Copy full SHA for 7567c76
Autotuning (#1554)
Show description for 9caa74e
4 people
authored
9caa74e
Copy full SHA for 9caa74e
Add documentation for bfloat16 (git commit 648f7bfa5009484b822064d0c28d377da6dd71a0) (#1516)
Show description for b7cc7c8
manuelciosici
and
jeffra
authored
b7cc7c8
Copy full SHA for b7cc7c8
Commits on Nov 12, 2021
Fix zinf none swapper (#1550)
tjruwase
authored
488105e
Copy full SHA for 488105e
Add warmup_type arguments in WarmupLR and WarmupDecayLR (#1530)
Show description for 76847f4
3 people
authored
76847f4
Copy full SHA for 76847f4
Fix sparse attention for small block-sizes (#1545)
Show description for 3ed7730
RezaYazdaniAminabadi
and
jeffra
authored
3ed7730
Copy full SHA for 3ed7730
Tensor-Parallelism general support (#1512)
Show description for 9ce00a2
3 people
authored
9ce00a2
Copy full SHA for 9ce00a2
Commits on Nov 11, 2021
backward compatibility (#1549)
conglongli
authored
b16dd94
Copy full SHA for b16dd94
bump to 0.5.7
jeffra
committed
fa9d3e8
Copy full SHA for fa9d3e8
Fix 1bit extra issue (#1542)
jeffra
authored
2665c8b
Copy full SHA for 2665c8b
Use cuda tensors for allgather (#1548)
tjruwase
authored
bd3ebdd
Copy full SHA for bd3ebdd
Commits on Nov 9, 2021
CPU-Adam: Fix compile Issue (#1537)
Show description for af443f6
RezaYazdaniAminabadi
authored
af443f6
Copy full SHA for af443f6
Modify inference engine (#1520)
Show description for f012200
4 people
authored
f012200
Copy full SHA for f012200
Commits on Nov 8, 2021
[unit tests] allow unique port for tests
jeffra
committed
0af15b9
Copy full SHA for 0af15b9
fstr for multnode_runner (#1532)
chunyang-wen
authored
93c7183
Copy full SHA for 93c7183
Pagination
Previous
Next
You can’t perform that action at this time.