-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GRPO] add cosine reward #206
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, will probably need to be moved around if #144 is merged before this one.
yes will wait till that is merged and then move it |
Sorry, some linting errors it looks like, I'll get the fix in ~2 hours |
The #144 is ready to ship |
i'm fixing the merge conflicts |
97d5670
to
2e73e71
Compare
* add cosine reward * fix merge * fix typo * fix check
adds the cosine reward from the paper: https://arxiv.org/pdf/2502.03373