huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 32.6k
Star 158k

Code
Issues 1.1k
Pull requests 1.2k
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: huggingface/transformers

Labels 137 Milestones 0

New pull request New

1,219 Open 24,361 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix: implement Mxfp4Dequantize.reverse_op for save_pretrained support

#44983 opened Mar 25, 2026 by Hyungkeun-Park-Nota

Loading…

Trainer: set skip_logits for loss-only eval when liger enabled

#44981 opened Mar 25, 2026 by AkshajKashyap

Loading…

4 of 6 tasks

bug-fix: do not assume torch.cuda is available when setting up norm values, even if flash linear attention is available

#44980 opened Mar 24, 2026 by kallewoof

Loading…

2 of 6 tasks

Module Fusion API

#44979 opened Mar 24, 2026 by michaelbenayoun

Loading…

fix: handle absent sys.modules entry in modeling_utils

#44978 opened Mar 24, 2026 by cjkindel

Loading…

2 of 6 tasks

Refactor core_model_loading to support FSDP shard-on-read loading

#44974 opened Mar 24, 2026 by 3outeille

Loading…

Fix max_seqlen type in vision attention for torch.compile + FA2

#44973 opened Mar 24, 2026 by andylizf

Loading…

Fix CPU 16 bytes alignment issue using equivalent fallback

#44970 opened Mar 24, 2026 by IlyasMoutawwakil

Loading…

6 tasks

Fix FA kernel launch needs correct cuda device ctx in multi-gpu env

#44967 opened Mar 24, 2026 by Qubitium

Loading…

2 of 6 tasks

try

#44965 opened Mar 24, 2026 by ydshieh

Loading…

6 tasks

fixed import error with PILImageResampling

#44958 opened Mar 23, 2026 by josh-kean

Loading…

3 tasks done

[WIP] Add HyperCLOVAX model

#44956 opened Mar 23, 2026 by bigshanedogg • Draft

3 of 6 tasks

[docs] pipeline cleanup

#44954 opened Mar 23, 2026 by stevhliu

Loading…

Fix: Add correct return behaviour when output_hidden_states=True for CLIP and SIGLIP vision models

#44952 opened Mar 23, 2026 by Jess-Co-Del

Loading…

2 tasks done

feat: Add router_logits override to enable Routing Replay for MoE models

#44951 opened Mar 23, 2026 by hemantmm

Loading…

1 task done

[Cache] Native mamba & hybrid cache

#44950 opened Mar 23, 2026 by Cyrilvallez

Loading…

Fix: NotebookProgressCallback crash when evaluating with the Trainer

#44949 opened Mar 23, 2026 by Charly21r

Loading…

3 tasks done

Add doc page for capturing outputs

#44947 opened Mar 23, 2026 by zucchini-nlp

Loading…

Add inference time layer fusion optimisations via PreTrainedModel.from_pretrained(fuse_layers=True)

#44942 opened Mar 23, 2026 by hmellor • Draft

fix tie_weights skipping logic is not tied to model thread scope

#44940 opened Mar 23, 2026 by Qubitium

Loading…

2 of 6 tasks

[MOE] MoE routing capture and replay support

#44925 opened Mar 22, 2026 by kashif

Loading…

5 tasks

fix: avoid unconditional model_info call in _patch_mistral_regex

#44923 opened Mar 22, 2026 by prakhar-agarwal • Draft

fix: skip clean_up_tokenization for BPE tokenizers in PreTrainedTokenizerFast

#44915 opened Mar 21, 2026 by maxsloef-goodfire

Loading…

3 of 5 tasks

Remove unnecessary expand_as in get_placeholder_mask across VLMs

#44907 opened Mar 21, 2026 by syncdoth

Loading…

7 tasks done

fix(models): Fix Perceiver interpolate_pos_encoding interpolating to the source size

#44899 opened Mar 20, 2026 by harshaljanjani

Loading…

3 of 5 tasks

Previous 1 2 3 4 5 … 48 49 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!