RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

RLHFlow 's collections 12

RLHFLow Reward Models
Reward models trained by RLHFlow codebase (https://github.com/RLHFlow/RLHF-Reward-Modeling/)
Standard-format-preference-dataset
We collect the open-source datasets and process them into the standard format.
RM-Bradley-Terry
We train the reward model as the maximum likelihood estimation of the Bradley-Terry model.
Online RLHF
Datasets, code, and models for online RLHF (i.e., iterative DPO)
SFT Models
We train a series of SFT models on the high-quality SFT dataset of RLHFlow for research purpose.
Standard-format-preference-dataset
We collect the open-source datasets and process them into the standard format.
RM-Bradley-Terry
We train the reward model as the maximum likelihood estimation of the Bradley-Terry model.
Online RLHF
Datasets, code, and models for online RLHF (i.e., iterative DPO)
RLHFLow Reward Models
Reward models trained by RLHFlow codebase (https://github.com/RLHFlow/RLHF-Reward-Modeling/)
SFT Models
We train a series of SFT models on the high-quality SFT dataset of RLHFlow for research purpose.