Subscribe
Sign in
Home
Podcast
Navigation
($) Discord
Archive
About
Reasoning & Inference Compute
Latest
Top
Discussions
OpenAI's o3: Over-optimization is back and weirder than ever
Tools, true rewards, and a new direction for language models.
Apr 19
•
Nathan Lambert
118
Share this post
Interconnects
OpenAI's o3: Over-optimization is back and weirder than ever
Copy link
Facebook
Email
Notes
More
RL backlog: OpenAI's many RLs, clarifying distillation, and latent reasoning
Notes I forgot to publish. Closing some loose ends in the reasoning model discussions.
Apr 5
•
Nathan Lambert
48
Share this post
Interconnects
RL backlog: OpenAI's many RLs, clarifying distillation, and latent reasoning
Copy link
Facebook
Email
Notes
More
Recent reasoning research: GRPO tweaks, base model RL, and data curation
The papers I endorse as worth reading among a cresting wave of reasoning research.
Mar 31
•
Nathan Lambert
71
Share this post
Interconnects
Recent reasoning research: GRPO tweaks, base model RL, and data curation
Copy link
Facebook
Email
Notes
More
2
Gemini 2.5 Pro and Google's second chance with AI
The end of a busy spring of model improvements and what's next for the presumed leader in AI abilities.
Mar 26
•
Nathan Lambert
75
Share this post
Interconnects
Gemini 2.5 Pro and Google's second chance with AI
Copy link
Facebook
Email
Notes
More
Where inference-time scaling pushes the market for AI companies
Fundamentals emerging downstream from the RL reasoning models.
Mar 5
•
Nathan Lambert
52
Share this post
Interconnects
Where inference-time scaling pushes the market for AI companies
Copy link
Facebook
Email
Notes
More
1
Claude 3.7 thonks and what's next for inference-time scaling
The latest reasoning model and what it says about the direction of inference time compute and RL training.
Feb 24
•
Nathan Lambert
79
Share this post
Interconnects
Claude 3.7 thonks and what's next for inference-time scaling
Copy link
Facebook
Email
Notes
More
1
An unexpected RL Renaissance
New talk! Forecasting the Alpaca moment for reasoning models and why the new style of RL training is a far bigger deal than the emergence of RLHF.
Feb 13
•
Nathan Lambert
56
Share this post
Interconnects
An unexpected RL Renaissance
Copy link
Facebook
Email
Notes
More
39:48
Why reasoning models will generalize
People underestimate the long-term potential of “reasoning.”
Jan 28
•
Nathan Lambert
199
Share this post
Interconnects
Why reasoning models will generalize
Copy link
Facebook
Email
Notes
More
7
DeepSeek R1's recipe to replicate o1 and the future of reasoning LMs
Yes, ring the true o1 replication bells for DeepSeek R1 🔔🔔🔔. Where we go next.
Jan 21
•
Nathan Lambert
229
Share this post
Interconnects
DeepSeek R1's recipe to replicate o1 and the future of reasoning LMs
Copy link
Facebook
Email
Notes
More
2
Quick recap on the state of reasoning
My talk at the NeurIPS Latent Space live event.
Jan 2
•
Nathan Lambert
50
Share this post
Interconnects
Quick recap on the state of reasoning
Copy link
Facebook
Email
Notes
More
1
16:21
OpenAI's o3: The grand finale of AI in 2024
A step change as influential as the release of GPT-4. Reasoning language models are the current big thing.
Dec 20, 2024
•
Nathan Lambert
106
Share this post
Interconnects
OpenAI's o3: The grand finale of AI in 2024
Copy link
Facebook
Email
Notes
More
4
OpenAI's o1 using "search" was a PSYOP
How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought
Dec 4, 2024
•
Nathan Lambert
60
Share this post
Interconnects
OpenAI's o1 using "search" was a PSYOP
Copy link
Facebook
Email
Notes
More
2
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts