Subscribe
Sign in
RLHF roundup: Getting good at PPO, sketching…
Nathan Lambert
Jun 26, 2024
17
1
Things to be aware of if you work on language model fine-tuning.
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
RLHF roundup: Getting good at PPO, sketching…
Things to be aware of if you work on language model fine-tuning.