Skip to content

Pull requests: huggingface/open-r1

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Pin training dependencies
#393 opened Feb 22, 2025 by lewtun Loading…
Update prompt template and sampling parameters for evaluation
#392 opened Feb 22, 2025 by lewtun Loading…
1 task
WIP "Faster" grpo trainer
#371 opened Feb 19, 2025 by edbeeching Draft
Fix dataset url
#347 opened Feb 17, 2025 by Zzhiter Loading…
Fix reasoning_steps_reward function
#335 opened Feb 16, 2025 by rocke2020 Loading…
fix bug, solutions not found
#334 opened Feb 15, 2025 by hellen9527 Loading…
Update sglang README.md
#330 opened Feb 15, 2025 by yh-yao Loading…
Update grpo.py
#325 opened Feb 14, 2025 by tpoisonooo Loading…
add text similarity for more common accuracy reward
#322 opened Feb 14, 2025 by sungatetop Loading…
fix: sft fix
#307 opened Feb 13, 2025 by pointerhacker Loading…
Fix eval max length
#297 opened Feb 12, 2025 by Some-random Loading…
[rewards] use dense rep penalty
#296 opened Feb 12, 2025 by kashif Loading…
Update README.md
#291 opened Feb 12, 2025 by tpoisonooo Loading…
Performance improvements of reward calculation
#286 opened Feb 11, 2025 by saidineshpola Loading…
fix: easier environment setup; pin trl, transformers
#199 opened Feb 6, 2025 by ctjlewis Loading…
2
6
[Feat] Adding minimal training for multimodal model
#136 opened Jan 31, 2025 by kcz358 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.