PaddlePaddle / PaddleNLP Public

Notifications You must be signed in to change notification settings
Fork 3.1k
Star 12.9k

Code
Issues 128
Pull requests 433
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: PaddlePaddle/PaddleNLP

Labels 58 Milestones 0

New pull request New

433 Open 6,849 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Normalize gates on expert dim before calculating seq_aux_loss

#11160 opened Nov 3, 2025 by lshpku

Loading…

support sharding stage3 for deepseekv3 model contributor

#11149 opened Oct 23, 2025 by AlAuAu

Loading…

debug use

#11148 opened Oct 22, 2025 by XieYunshen

Loading…

2 tasks

[Auto-Paralllel] fix intermediate api use

#11124 opened Sep 28, 2025 by Xing-lil

Loading…

2 tasks

【FlexCheckpoint】fix_the_optimizer_init contributor

#11123 opened Sep 27, 2025 by zty-king

Loading…

2 tasks

Feat: support chatglm v2 faster infer in p800

#11118 opened Sep 25, 2025 by mingMelody

Loading…

hack offload optimizer减少一次master weight的offload&reload

#11111 opened Sep 23, 2025 by Wennie396

Loading…

update using_post_norm_recompute stale

#11093 opened Sep 16, 2025 by chen2016013

Loading…

2 tasks

add keti7 scripts stale

#11086 opened Sep 11, 2025 by FeixLiu

Loading…

2 tasks

Support uc save for deepseek stale

#11078 opened Sep 6, 2025 by DesmonDay

Loading…

实现HuggingFace Cache和缩专家加载 stale

#11042 opened Sep 2, 2025 by lshpku

Loading…

clip expert grad stale

#11035 opened Sep 1, 2025 by zhangbo9674

Loading…

2 tasks

Add support for DRAG contributor stale

#11021 opened Aug 27, 2025 by Kinandra

Loading…

opt reader and gather stale

#11016 opened Aug 26, 2025 by phlrain

Loading…

2 tasks

add script for training gpt3 on XPU machine using flagcx as comm backend contributor stale

#11014 opened Aug 26, 2025 by mikethegoblin

Loading…

2 tasks

Optimie moe and dense overlap stale

#11013 opened Aug 26, 2025 by phlrain

Loading…

2 tasks

[NOT MERGE]Pr adapt flex checkpoint contributor stale

#10996 opened Aug 25, 2025 by zty-king

Loading…

2 tasks

[BUG]: fix the bug in PretrainedModel.recompute_disable() contributor stale

#10988 opened Aug 21, 2025 by hongjx175

Loading…

2 tasks

Make pp_stream wait on attn_backward_dx stale

#10984 opened Aug 21, 2025 by lshpku

Loading…

recompute support offload tensor stale

#10981 opened Aug 21, 2025 by blacksheep-Aristotle

Loading…

2 tasks

moe_layer support fine_grained_forward stale

#10980 opened Aug 21, 2025 by blacksheep-Aristotle

Loading…

2 tasks

update expert parallel init logic stale

#10966 opened Aug 18, 2025 by blacksheep-Aristotle

Loading…

2 tasks

optimize mtp speed stale

#10965 opened Aug 18, 2025 by phlrain

Loading…

2 tasks

best N1C8 performance stale

#10964 opened Aug 17, 2025 by chen2016013

Loading…

2 tasks

Fix model load bug stale

#10962 opened Aug 16, 2025 by phlrain

Loading…

2 tasks

Previous 1 2 3 4 5 … 17 18 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!