🍃 GRPO - Do not load reference model when beta == 0 (#2806) #7541
Annotations
1 error
Test with pytest
Process completed with exit code 2.
|
Loading