Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 4

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

159 Open 1,228 Closed

🏋 GRPO ⚡ PEFT

#2698 opened Jan 30, 2025 by gagan3012

5 tasks done

⚡accelerate ⚡ PEFT 🏋 PPO

#2696 opened Jan 30, 2025 by daehuikim

5 tasks done

🐛 bug 🚀 deepspeed 🏋 GRPO

#2688 opened Jan 30, 2025 by abacaj

5 tasks done

🐛 bug 🏋 GRPO

#2686 opened Jan 29, 2025 by shirinyamani

🏋 GRPO ❓ question 🏋 Reward

#2685 opened Jan 29, 2025 by shirinyamani

✨ enhancement 🏋 GRPO ⚡ PEFT

#2684 opened Jan 29, 2025 by howardzhou

🏋 GRPO ❓ question

#2681 opened Jan 29, 2025 by macheng6

✨ enhancement 🏋 GRPO

#2680 opened Jan 29, 2025 by Palmik

🐛 bug ⏳ needs more info 🏋 Reward

#2674 opened Jan 28, 2025 by Tarak200

🐛 bug 🏋 GRPO 🏋 Online DPO

#2671 opened Jan 28, 2025 by benjamin-marie

5 tasks done

whiten_rewards parameter in RLOO config is not used. 🐛 bug 🏋 RLOO

#2665 opened Jan 26, 2025 by velezbeltran

5 tasks done

🐛 bug 🏋 DPO

#2660 opened Jan 25, 2025 by baichuanzhou

🐛 bug ⏳ needs more info 🏋 PPO 🏋 RLOO

#2657 opened Jan 25, 2025 by Superskyyy

5 tasks done

📚 documentation 👶 good first issue 🏋 SFT

#2649 opened Jan 24, 2025 by ParagEkbote

5 tasks done

How to stop SFTTrainer from auto tokenizing my messages ? ❓ question 🏋 SFT

#2642 opened Jan 24, 2025 by MohamedAliRashad

🐛 bug 🏋 DPO 🏋 DPPO 🏋 GKD 🏋 GRPO 🏋 Iterative SFT 🏋 KTO 🏋 Online DPO 🏋 ORPO 🏋 PPO 🏋 PRM 🏋 Reward 🏋 RLOO 🏋 SFT 🏋 XPO

#2617 opened Jan 23, 2025 by qgallouedec

13 of 18 tasks

ProTip! What’s not been updated in a month: updated:<2024-12-30.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list