Skip to content
@policy-gradient

policy-gradient

Popular repositories Loading

  1. GRPO-Zero GRPO-Zero Public

    Implementing DeepSeek R1's GRPO algorithm from scratch

    Python 1.2k 38

Repositories

Showing 1 of 1 repositories
  • GRPO-Zero Public

    Implementing DeepSeek R1's GRPO algorithm from scratch

    policy-gradient/GRPO-Zero’s past year of commit activity
    Python 1,212 Apache-2.0 38 4 0 Updated Apr 18, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…