Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language specific code format reward #377

Merged
merged 1 commit into from
Feb 21, 2025

Conversation

zeenolife
Copy link
Contributor

I was looking at the logs of code generation. It does look like sometimes the completion does have valid code, but it doesn't get picked up by code_reward function.

This one should replace the format_reward for code gen

@zeenolife
Copy link
Contributor Author

@lewtun please

@zeenolife
Copy link
Contributor Author

Or @kashif perhaps?

Copy link
Member

@lewtun lewtun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch on the code formatting @zeenolife ! This should certainly help the models generate code more consistently - LGTM

@lewtun lewtun merged commit 8322b31 into huggingface:main Feb 21, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants