official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives
reward inverse-reinforcement-learning large-language-models rlhf reward-models largelanguagemodels reward-modeling llm-aligment llmalignment
-
Updated
Apr 2, 2025 - Python