-
University of Konstanz, ETH Zurich
- Konstanz, Germany
Highlights
- Pro
Pinned Loading
-
rlhfblender
rlhfblender PublicRLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
-
multi-type-feedback
multi-type-feedback PublicImplementation for the paper: Reward Learning from Multiple Feedback Types (ICLR2025)
Jupyter Notebook 1
-
-
rlworkbench
rlworkbench PublicRL Workbench - A comprehensive, interactive framework for RL experimentation
JavaScript
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.