llmalignment

Here is 1 public repository matching this topic...

holarissun / RewardModelingBeyondBradleyTerry

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

reward inverse-reinforcement-learning large-language-models rlhf reward-models largelanguagemodels reward-modeling llm-aligment llmalignment

Updated Apr 2, 2025
Python

Improve this page

Add a description, image, and links to the llmalignment topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llmalignment topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llmalignment

Here is 1 public repository matching this topic...

holarissun / RewardModelingBeyondBradleyTerry

Improve this page

Add this topic to your repo