- Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: PaddlePaddle/PaddleNLP
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Disable check for
llama_align_dy2st_fthenb_and_vpp_auto_bs2_fp32_DP1-MP1-PP4 and llm_gpt_dygraph_auto_bs8_fp32_DP2-MP2 #11182 opened Nov 28, 2025 by zrr1999 Loading…
1 task done
Normalize gates on expert dim before calculating seq_aux_loss
#11160 opened Nov 3, 2025 by lshpku Loading…
[Auto-Paralllel] fix intermediate api use stale
#11124 opened Sep 28, 2025 by Xing-lil Loading…
2 tasks
【FlexCheckpoint】fix_the_optimizer_init contributor
#11123 opened Sep 27, 2025 by zty-king Loading…
2 tasks
Feat: support chatglm v2 faster infer in p800 stale
#11118 opened Sep 25, 2025 by mingMelody Loading…
hack offload optimizer减少一次master weight的offload&reload
#11111 opened Sep 23, 2025 by Wennie396 Loading…
add script for training gpt3 on XPU machine using flagcx as comm backend contributor stale
#11014 opened Aug 26, 2025 by mikethegoblin Loading…
2 tasks
[NOT MERGE]Pr adapt flex checkpoint contributor stale
#10996 opened Aug 25, 2025 by zty-king Loading…
2 tasks
[BUG]: fix the bug in PretrainedModel.recompute_disable() contributor stale
#10988 opened Aug 21, 2025 by hongjx175 Loading…
2 tasks
recompute support offload tensor stale
#10981 opened Aug 21, 2025 by blacksheep-Aristotle Loading…
2 tasks
moe_layer support fine_grained_forward stale
#10980 opened Aug 21, 2025 by blacksheep-Aristotle Loading…
2 tasks
update expert parallel init logic stale
#10966 opened Aug 18, 2025 by blacksheep-Aristotle Loading…
2 tasks
Previous Next
ProTip! Filter pull requests by the default branch with base:develop.