Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
View all resources
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
GitHub Stars
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
yzs981130
/
DeepSpeed
Public
forked from
deepspeedai/DeepSpeed
Notifications
You must be signed in to change notification settings
Fork
0
Star
0
Code
Pull requests
0
Actions
Projects
Security
0
Insights
Additional navigation options
Code
Pull requests
Actions
Projects
Security
Insights
Commits
Branch selector
master
User selector
All users
All time
Commit History
Commits on Oct 7, 2022
Change type to tuple in replace_wo_policy isinstance check (#2387)
Show description for 46a886c
4 people
authored
46a886c
Copy full SHA for 46a886c
pin transformers version for unit tests (#2402)
mrwyattii
authored
6f3dec6
Copy full SHA for 6f3dec6
Commits on Oct 5, 2022
allow building with latest CUDA (11.8), it is backwards compatible (#2390)
Thomas-MMJ
authored
f5a8348
Copy full SHA for f5a8348
Fix the MLP output tensor's shape (#2380)
arashb
authored
0a2ae2e
Copy full SHA for 0a2ae2e
Commits on Oct 4, 2022
Refactor remaining distributed tests (#2216)
Show description for ff42743
mrwyattii
and
tjruwase
authored
ff42743
Copy full SHA for ff42743
Commits on Sep 29, 2022
fix an exception when recursively casting dicts to fp16 (#2370)
mjksmith
authored
b609a29
Copy full SHA for b609a29
Commits on Sep 27, 2022
Capture error message during sweep tests (#2351)
Show description for eed4032
3 people
authored
eed4032
Copy full SHA for eed4032
Refactor fused_bias_residual kernels for better readability (#2356)
Show description for e14d40e
arashb
and
tjruwase
authored
e14d40e
Copy full SHA for e14d40e
Extend residual_add kernel tests to conver pre_attn_norm (#2354)
Show description for 79692af
arashb
and
jeffra
authored
79692af
Copy full SHA for 79692af
Add missing pytest fixture scope (#2353)
Show description for b450da4
3 people
authored
b450da4
Copy full SHA for b450da4
fix cuda invalid config error in dequant kernel (#2362)
Show description for 3486afb
GuanhuaWang
authored
3486afb
Copy full SHA for 3486afb
Commits on Sep 26, 2022
Update issue templates
jeffra
committed
8e8c866
Copy full SHA for 8e8c866
Updated issue templates (#2363)
jeffra
authored
70e883a
Copy full SHA for 70e883a
Refactor gptj_residual_add kernels for better readability (#2358)
Show description for 9df604b
arashb
and
RezaYazdaniAminabadi
authored
9df604b
Copy full SHA for 9df604b
download cifar to blob storage (#2342)
Show description for 6ef16de
mrwyattii
and
tjruwase
authored
6ef16de
Copy full SHA for 6ef16de
docs(mixture-of-experts-inference): fix typo in tuto (#2345)
Show description for 2b1b0d2
jqueguiner
and
tjruwase
authored
2b1b0d2
Copy full SHA for 2b1b0d2
Add Onebit Optimzers in __init__ (#2340)
Show description for f210256
3 people
authored
f210256
Copy full SHA for f210256
Kernel Data Conversion Utility (#2327)
Show description for 9aa7b63
cmikeh2
authored
9aa7b63
Copy full SHA for 9aa7b63
Commits on Sep 23, 2022
Inference profiling updates/fixes (#2348) (#2349)
Show description for 9932643
3 people
authored
9932643
Copy full SHA for 9932643
fix zero docs (#2350)
jeffra
authored
76de924
Copy full SHA for 76de924
Extend scratch buffer for long prompts (#2212)
Show description for 3d097bb
4 people
authored
3d097bb
Copy full SHA for 3d097bb
Commits on Sep 22, 2022
increase min pre-commit versions (#2346)
jeffra
authored
b76e0f4
Copy full SHA for b76e0f4
mem access for quantize kernel (#2331)
Show description for 954e0c6
3 people
authored
954e0c6
Copy full SHA for 954e0c6
Commits on Sep 21, 2022
Refactor residual add kernels (#2333)
Show description for 48c5220
arashb
and
awan-10
authored
48c5220
Copy full SHA for 48c5220
MOE matmult with memaccess (#2336)
Show description for 12e1cb8
samadejacobs
authored
12e1cb8
Copy full SHA for 12e1cb8
Commits on Sep 19, 2022
MOE residual matmult unit test (#2323)
Show description for 80b10d0
3 people
authored
80b10d0
Copy full SHA for 80b10d0
bump to 0.7.4
jeffra
committed
0f0a7a5
Copy full SHA for 0f0a7a5
Commits on Sep 16, 2022
Add more options to inference benchmark (#2325)
mrwyattii
authored
1592381
Copy full SHA for 1592381
Commits on Sep 14, 2022
only override forward if using cuda-graph (#2291)
jeffra
authored
cf638be
Copy full SHA for cf638be
add quant unit test (#2315)
Show description for 95d1151
GuanhuaWang
and
awan-10
authored
95d1151
Copy full SHA for 95d1151
refactor to use mem_access (#2317)
mrwyattii
authored
c199eda
Copy full SHA for c199eda
Commits on Sep 13, 2022
ZeRO-Inference blog - Update README (#2322)
tjruwase
authored
060078a
Copy full SHA for 060078a
ZeRO-Inference blog - wrap up (#2321)
tjruwase
authored
f5230be
Copy full SHA for f5230be
ZeRO-Inference blog (#2271)
Show description for 276eec7
3 people
authored
276eec7
Copy full SHA for 276eec7
Commits on Sep 12, 2022
Upgrade P40 tests to torch 1.8 (#2316)
Show description for 18ee381
mrwyattii
and
jeffra
authored
18ee381
Copy full SHA for 18ee381
Pagination
Previous
Next
You can’t perform that action at this time.