Subscribe
Sign in
RL backlog: OpenAI's many RLs, clarifying…
Nathan Lambert
Apr 5
52
10
Notes I forgot to publish. Closing some loose ends in the reasoning model discussions.
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
RL backlog: OpenAI's many RLs, clarifying…
Notes I forgot to publish. Closing some loose ends in the reasoning model discussions.