Interconnects

Interconnects

Share this post

Interconnects
Interconnects
RLHF roundup: Getting good at PPO, sketching RLHF’s impact, RewardBench retrospective, and a reward model competition

RLHF roundup: Getting good at PPO, sketching…

Nathan Lambert
Jun 26, 2024
17

Share this post

Interconnects
Interconnects
RLHF roundup: Getting good at PPO, sketching RLHF’s impact, RewardBench retrospective, and a reward model competition
1

Things to be aware of if you work on language model fine-tuning.

Read →
Comments
User's avatar
© 2025 Interconnects AI, LLC
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share