Subscribe
Sign in
RLHF Roundup: Trying to get good at PPO…
Nathan Lambert
Jun 26, 2024
Things to be aware of if you work on language model fine-tuning.
Listen →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
RLHF Roundup: Trying to get good at PPO…
Things to be aware of if you work on language model fine-tuning.