Skip to content

Pull requests: PaddlePaddle/PaddleNLP

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

【FlexCheckpoint】add config memory_growth_threshold for save hf
#11183 by xingmingyyj was merged Nov 28, 2025 Loading…
2 tasks
【FlexCheckpoint】support save hf
#11180 by xingmingyyj was merged Nov 26, 2025 Loading…
2 tasks
Fix EMA bug when load different strategies
#11177 by sneaxiy was merged Nov 18, 2025 Loading…
2 tasks
fix int64 to_int for performance improvement
#11175 by zhengshengning was merged Nov 14, 2025 Loading…
2 tasks
[Cherry-pick]Add Non ZCC EMA Callback
#11174 by sneaxiy was merged Nov 13, 2025 Loading…
2 tasks
Add Non ZCC EMA Callback
#11173 by sneaxiy was merged Nov 13, 2025 Loading…
2 tasks
big tensor: tokens_unzip_gather
#11172 by zhengshengning was merged Nov 12, 2025 Loading…
2 tasks done
fix eb5 big tensor bug
#11169 by wanghuancoder was merged Nov 11, 2025 Loading…
2 tasks
fix opt chunk offload
#11167 by Wennie396 was merged Nov 12, 2025 Loading…
2 tasks
Test CI
#11166 by DrownFish19 was closed Nov 11, 2025 Loading…
2 tasks
Fix ZCC ema GPU alloc bug
#11165 by sneaxiy was merged Nov 7, 2025 Loading…
2 tasks
TARE最新提交 contributor
#11164 by PatriciaPulec was closed Nov 5, 2025 Loading…
2 tasks
Update moe model save for moe_sharding
#11162 by DrownFish19 was merged Nov 12, 2025 Loading…
Fix the non-convergence in DSV3 post-pretrain
#11161 by zhangbo9674 was merged Nov 4, 2025 Loading…
2 tasks
Update run_finetune.py
#11159 by a31413510 was merged Nov 3, 2025 Loading…
2 tasks
Update moe model save for moe_sharding
#11155 by DrownFish19 was merged Nov 12, 2025 Loading…
TARE method added contributor
#11151 by PatriciaPulec was merged Nov 5, 2025 Loading…
2 tasks
paddleformers PR check
#11150 by swgu98 was merged Oct 23, 2025 Loading…
2 tasks
support sharding stage3 for deepseekv3 model contributor
#11149 by AlAuAu was closed Nov 27, 2025 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.