Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
View all resources
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
GitHub Stars
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
bitsandbytes-foundation
/
bitsandbytes
Public
Uh oh!
There was an error while loading.
Please reload this page
.
Notifications
You must be signed in to change notification settings
Fork
838
Star
8.1k
Code
Issues
32
Pull requests
16
Discussions
Actions
Projects
Security
0
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights
Issues
Search Issues
is
:
issue
state
:
open
is:issue state:open
Search
Labels
Milestones
New issue
Search results
Open
Closed
CUDA SETUP ERROR: Missing dependency: libnvJitLink.so.13 - Google Colab
Status: Open.
#1905
In bitsandbytes-foundation/bitsandbytes;
·
tanvircr7
opened
on Mar 25, 2026
Params4bit.__getattr__ breaks torch.compile - use @property instead
Status: Open.
#1904
In bitsandbytes-foundation/bitsandbytes;
·
kbabiuchx
opened
on Mar 23, 2026
Question: intentional FP16-only path for int8_vectorwise_quant / LLM.int8 activation quant? (BF16 support + removing casts)
Status: Open.
Feature
#1868
In bitsandbytes-foundation/bitsandbytes;
·
sanghyunna
opened
on Feb 16, 2026
Default LLM.int8() mixed-precision decomposition causes 17-147% energy overhead across consumer and datacenter GPUs
Status: Open.
#1867
In bitsandbytes-foundation/bitsandbytes;
·
hongping-zh
opened
on Feb 15, 2026
gemv_4bit silently produces wrong results when weight is quantized in (in_features, out_features) layout
Status: Open.
#1862
In bitsandbytes-foundation/bitsandbytes;
·
TimDettmers
opened
on Feb 14, 2026
[Feature Gap] CUDA compared to other backends like XPU/CPU
Intel
Status: Open.
#1852
In bitsandbytes-foundation/bitsandbytes;
·
jiqing-feng
opened
on Jan 30, 2026
[Performance/Energy] 4-bit NF4 shows significant energy efficiency penalty on Blackwell (RTX 5090) for small models
Status: Open.
#1851
In bitsandbytes-foundation/bitsandbytes;
·
hongping-zh
opened
on Jan 29, 2026
Failed to quant MoE models with fused expert weights in transformers v5
Hugging Face Integration
An issue or PR that is related to the interaction between bitsandbytes and HF libraries.
An issue or PR that is related to the interaction between bitsandbytes and HF libraries.
Status: Open.
#1849
In bitsandbytes-foundation/bitsandbytes;
·
ITcarrot
opened
on Jan 25, 2026
# 70B 4-bit LLM decode bottlenecked by HIP kernel (
kgemm_4bit_inference_naive
) efficiency — 49% vs 91% memory bandwidth on ROCm/gfx1151
ROCm
Status: Open.
#1842
In bitsandbytes-foundation/bitsandbytes;
·
BellaDoggie
opened
on Jan 19, 2026
Support quantizing tensors when numel() > INT_MAX
CUDA
Issues and PRs related to the CUDA backend, excluding installation/support help.
Issues and PRs related to the CUDA backend, excluding installation/support help.
Status: Open.
Feature
#1785
In bitsandbytes-foundation/bitsandbytes;
·
matthewdouglas
opened
on Oct 22, 2025
·
v0.50.0
Reduce CUDA build matrix
Build
CUDA
Issues and PRs related to the CUDA backend, excluding installation/support help.
Issues and PRs related to the CUDA backend, excluding installation/support help.
Status: Open.
Task
#1778
In bitsandbytes-foundation/bitsandbytes;
·
matthewdouglas
opened
on Oct 3, 2025
·
v0.50.0
Can't get llm_int8_skip_modules to work: 'Parameter' object has no attribute 'SCB'
Bug
Something isn't working
Something isn't working
Status: Open.
#1634
In bitsandbytes-foundation/bitsandbytes;
·
redbrain
opened
on May 12, 2025
You can’t perform that action at this time.