Reasoning & Inference Compute

New Talk: Building Olmo 3 Think

Re-recording my NeurIPS talks in one mega-take.

Dec 10, 2025 • Nathan Lambert

1:02:21

How to scale RL

The most covetable research.

Oct 20, 2025 • Nathan Lambert

Thinking, Searching, and Acting

A reflection on reasoning models.

Sep 22, 2025 • Nathan Lambert

Crafting a good (reasoning) model

A recent talk I gave on model training, reasoning, and the next frontier.

Jun 18, 2025 • Nathan Lambert

30:25

The rise of reasoning machines

And a debate that doesn't warrant repeating.

Jun 12, 2025 • Nathan Lambert

What comes next with reinforcement learning

Scaling RL, sparse rewards, continual learning, and the progress wall when pretraining really stops.

Jun 9, 2025 • Nathan Lambert

A taxonomy for next-generation reasoning models

Where we've been and where we're going with RLVR.

Jun 4, 2025 • Nathan Lambert

Reinforcement learning with random rewards actually works with Qwen 2.5

Making sense of research casting doubt on the potential of RLVR and where I'm optimistic for the next phase of scaling.

May 27, 2025 • Nathan Lambert

OpenAI's o3: Over-optimization is back and weirder than ever

Tools, true rewards, and a new direction for language models.

Apr 19, 2025 • Nathan Lambert

RL backlog: OpenAI's many RLs, clarifying distillation, and latent reasoning

Notes I forgot to publish. Closing some loose ends in the reasoning model discussions.

Apr 5, 2025 • Nathan Lambert

Recent reasoning research: GRPO tweaks, base model RL, and data curation

The papers I endorse as worth reading among a cresting wave of reasoning research.

Mar 31, 2025 • Nathan Lambert

Gemini 2.5 Pro and Google's second chance with AI

The end of a busy spring of model improvements and what's next for the presumed leader in AI abilities.

Mar 26, 2025 • Nathan Lambert

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts