Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
View all resources
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
GitHub Stars
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
2018luyi
/
DeepSpeed
Public
forked from
deepspeedai/DeepSpeed
Notifications
You must be signed in to change notification settings
Fork
0
Star
0
Code
Pull requests
0
Actions
Projects
Security
0
Insights
Additional navigation options
Code
Pull requests
Actions
Projects
Security
Insights
Commits
Breadcrumbs
History for
DeepSpeed
deepspeed
on
master
User selector
All users
All time
Commit History
Commits on Feb 27, 2021
fixed typo (#802)
vfdev-5
authored
db987cf
Copy full SHA for db987cf
Commits on Feb 26, 2021
document the requirement to call for all ranks (#801)
stas00
authored
7eb083c
Copy full SHA for 7eb083c
Delete out2 (#798)
vfdev-5
authored
62396b7
Copy full SHA for 62396b7
Commits on Feb 24, 2021
Fix the bias-add and add the layer-norm-eps parameter (#791)
Show description for e2dfcad
RezaYazdaniAminabadi
authored
e2dfcad
Copy full SHA for e2dfcad
Fixing the module-inject Api (#786)
RezaYazdaniAminabadi
authored
48065c0
Copy full SHA for 48065c0
Commits on Feb 19, 2021
Update engine.py (#767)
jeffra
authored
29fa4b2
Copy full SHA for 29fa4b2
Commits on Feb 17, 2021
[dist] set args.local_rank to LOCAL_RANK (#764)
jeffra
authored
68e138b
Copy full SHA for 68e138b
Fix NameError: name 'dist' is not defined (#763)
tma15
authored
8067efa
Copy full SHA for 8067efa
Commits on Feb 16, 2021
Checks for None tensors and skip them when splitting the buckets in zero stage 2. (#728)
Show description for 7cab55c
cli99
authored
7cab55c
Copy full SHA for 7cab55c
Commits on Feb 12, 2021
Activation checkpointing for non-tensor arguments and return values (#741)
Show description for ec8b1cb
tjruwase
authored
ec8b1cb
Copy full SHA for ec8b1cb
Replace timer print rank 0 with logging (#732)
Show description for 6fb1610
Sean Naren
and
jeffra
authored
6fb1610
Copy full SHA for 6fb1610
Commits on Feb 11, 2021
fix spelling mistake (#749)
sdtblck
authored
1b8ca8e
Copy full SHA for 1b8ca8e
Only initialize distributed if required (#734)
Show description for 59eed17
Sean Naren
and
jeffra
authored
59eed17
Copy full SHA for 59eed17
Add flops profiler tutorial (#682)
Show description for e2dfe0d
cli99
authored
e2dfe0d
Copy full SHA for e2dfe0d
Commits on Feb 8, 2021
Improve starred expressions (#696)
Show description for b08aa6f
joneyolfson
and
cli99
authored
b08aa6f
Copy full SHA for b08aa6f
Commits on Feb 4, 2021
[launcher] look ma, no more zombies (#714)
Show description for 4f1d827
stas00
and
jeffra
authored
4f1d827
Copy full SHA for 4f1d827
Commits on Feb 1, 2021
local rank of -1 means not set (#720)
jeffra
authored
45c33ee
Copy full SHA for 45c33ee
properly set engine.local_rank if it's set to -1
jeffra
committed
3cecbc1
Copy full SHA for 3cecbc1
Commits on Jan 29, 2021
set_batch_fn and remove old sanity check (#712)
Shaden Smith
authored
5e522ef
Copy full SHA for 5e522ef
Dist testing backend fixes, etc. (#708)
jeffra
authored
2e2dd86
Copy full SHA for 2e2dd86
Commits on Jan 25, 2021
Add optional timeout parameter to deepspeed.init_distributed (#637)
Show description for 852c524
sdtblck
and
jeffra
authored
852c524
Copy full SHA for 852c524
Commits on Jan 20, 2021
Fix ZeRO 2 + Pipelining (#677)
Show description for 34c83a5
leogao2
authored
34c83a5
Copy full SHA for 34c83a5
Commits on Jan 15, 2021
skip empty lines in hostfile (#669)
jeffra
authored
6217a6c
Copy full SHA for 6217a6c
Support optimizer AdamW type (#670)
tjruwase
authored
865104b
Copy full SHA for 865104b
Commits on Jan 14, 2021
Validate consistent ckpt tags across ranks (#667)
jeffra
authored
f032e56
Copy full SHA for f032e56
Commits on Jan 13, 2021
squash latest flops profiling changes (#1) (#664)
Show description for e2fbe4d
cli99
and
jeffra
authored
e2fbe4d
Copy full SHA for e2fbe4d
Commits on Jan 12, 2021
Handle actvitation checkpointing args that are None or non-tensors (#660)
Show description for adcfd26
Shaden Smith
authored
adcfd26
Copy full SHA for adcfd26
Commits on Jan 8, 2021
LR scheduler unit tests (#429)
Show description for da5563a
tjruwase
and
jeffra
authored
da5563a
Copy full SHA for da5563a
Remove a very verbose print statement. (#649)
Show description for af212f6
awan-10
authored
af212f6
Copy full SHA for af212f6
add additional validation checks in elastic config (#646)
jeffra
authored
bc046dc
Copy full SHA for bc046dc
Commits on Jan 7, 2021
Add deepspeed.init_distributed to RTD page (#645)
Show description for 4e2dc4e
jeffra
and
tjruwase
authored
4e2dc4e
Copy full SHA for 4e2dc4e
Commits on Jan 6, 2021
Module replacement support (#586)
Show description for 44bd538
3 people
authored
44bd538
Copy full SHA for 44bd538
Commits on Jan 5, 2021
Fix docstring format (#640)
tjruwase
authored
5ab1279
Copy full SHA for 5ab1279
change dist to torch.distributed to fix bug in assert. (#638)
awan-10
authored
d38ad6a
Copy full SHA for d38ad6a
Allow DeepSpeed models to be initialized with optimizer=None (#469)
Show description for a9a83a6
gcooper-isi
and
Shaden Smith
authored
a9a83a6
Copy full SHA for a9a83a6
Pagination
Previous
Next
You can’t perform that action at this time.