- Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: PaddlePaddle/PaddleNLP
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
【FlexCheckpoint】add config memory_growth_threshold for save hf
#11183 by xingmingyyj was merged Nov 28, 2025 Loading…
2 tasks
Disable check for
llama_align_dy2st_fthenb_and_vpp_auto_bs2_fp32_DP1-MP1-PP4 and llm_gpt_dygraph_auto_bs8_fp32_DP2-MP2 #11182 by zrr1999 was closed Dec 3, 2025 Loading…
1 task done
Fix EMA bug when load different strategies
#11177 by sneaxiy was merged Nov 18, 2025 Loading…
2 tasks
fix int64 to_int for performance improvement
#11175 by zhengshengning was merged Nov 14, 2025 Loading…
2 tasks
big tensor: tokens_unzip_gather
#11172 by zhengshengning was merged Nov 12, 2025 Loading…
2 tasks done
[Embedding] Expand training and shortgpt_prune code to support more model Beijing Innovation Consortium contributor
#11168 by Li-Z-Q was merged Nov 12, 2025 Loading…
Fix the non-convergence in DSV3 post-pretrain
#11161 by zhangbo9674 was merged Nov 4, 2025 Loading…
2 tasks
BugFix: qwen pp cannot send bool when using fuse loss contributor
#11157 by Jason233333 was merged Nov 7, 2025 Loading…
BugFix: qwen pp cannot send bool when using fuse loss contributor
#11156 by Jason233333 was merged Nov 7, 2025 Loading…
LoRA: add lora target modules support for qwen2 and qwen3 dense model… contributor
#11153 by aiyinyuedejustin was merged Oct 27, 2025 Loading…
LoRA: add lora target modules support for qwen2 and qwen3 dense model… contributor
#11152 by aiyinyuedejustin was merged Oct 27, 2025 Loading…
support sharding stage3 for deepseekv3 model contributor
#11149 by AlAuAu was closed Nov 27, 2025 Loading…
Previous Next
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.