Interconnects

Interconnects

Share this post

Interconnects
Interconnects
The implicit dynamics of optimizing costs vs. rewards vs. preferences
Copy link
Facebook
Email
Notes
More

The implicit dynamics of optimizing costs vs…

Nathan Lambert
Mar 27, 2023
15

Share this post

Interconnects
Interconnects
The implicit dynamics of optimizing costs vs. rewards vs. preferences
Copy link
Facebook
Email
Notes
More

With the emergence of reinforcement learning from human feedback, we've been applying old techniques with a new guiding function (🤫 RLHF).

Read →
Comments
User's avatar
© 2025 Interconnects AI, LLC
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More