Skip to content

Refactor training configs and unify Slurm for training SFT & GRPO (#231) #370

Refactor training configs and unify Slurm for training SFT & GRPO (#231)

Refactor training configs and unify Slurm for training SFT & GRPO (#231) #370