Skip to content

Pull requests: PaddlePaddle/PaddleNLP

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[WIP] support deepseek-v3
#9769 opened Jan 10, 2025 by yuanlehome Draft
2 tasks
[LLM] Add fused attention in Qwen2MoE
#9767 opened Jan 10, 2025 by DrownFish19 Loading…
2 tasks
Update README.md
#9766 opened Jan 10, 2025 by ZHUI Loading…
2 tasks
fix loraga merge
#9765 opened Jan 9, 2025 by greycooker Loading…
2 tasks
[MoE] fix expert parallel
#9760 opened Jan 9, 2025 by DesmonDay Loading…
Fix ernie ci auto trainer error
#9758 opened Jan 8, 2025 by blacksheep-Aristotle Loading…
【LLM】solve dynamic to static problem contributor
#9755 opened Jan 8, 2025 by Fantasy-02 Loading…
2 tasks
Fix requirements.txt
#9754 opened Jan 8, 2025 by DrownFish19 Loading…
2 tasks done
Update ci_unit.sh
#9733 opened Jan 2, 2025 by ZHUI Loading…
[llm]add adam
#9732 opened Jan 2, 2025 by lugimzzz Loading…
[New Features]Add lorapro
#9729 opened Jan 2, 2025 by greycooker Loading…
Auto sft
#9728 opened Jan 2, 2025 by blacksheep-Aristotle Loading…
fix auto tokenizer
#9726 opened Jan 2, 2025 by lyuwenyu Loading…
[Embedding] update embedding document
#9724 opened Jan 2, 2025 by DesmonDay Loading…
support HF tokenizer and stop_seqs
#9723 opened Jan 2, 2025 by ming1753 Loading…
[LLM Benchmark]optimize runtime
#9722 opened Dec 31, 2024 by Liujie0926 Loading…
add enable_offload_queue to PipelineParallel
#9708 opened Dec 27, 2024 by GuoxiaWang Loading…
mergekit gpu 1226
#9702 opened Dec 26, 2024 by Mangodadada Loading…
Unified amp strategy in auto_trainner
#9696 opened Dec 25, 2024 by From00 Loading…
Add Llama2 and Qwen2.5 Pretrain Configurations
#9694 opened Dec 25, 2024 by sneaxiy Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.