Subscribe
Sign in
Home
Podcast
Navigation
($) Discord
Archive
About
Latest
Top
Discussions
The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data
Emergency special: The information we need to understand what Q* is was right in front of us, but the memes are more fun than reality.
Nov 24, 2023
•
Nathan Lambert
102
Share this post
Interconnects
The Q* hypothesis: Tree-of-thoughts reasoning, process reward models, and supercharging synthetic data
Copy link
Facebook
Email
Notes
More
5
Behind the curtain: what it feels like to work in AI right now (April 2023)
Fear, FOMO, and the scientific exodus driven by ChatGPT
Apr 5, 2023
•
Nathan Lambert
98
Share this post
Interconnects
Behind the curtain: what it feels like to work in AI right now (April 2023)
Copy link
Facebook
Email
Notes
More
21
DeepSeek R1's recipe to replicate o1 and the future of reasoning LMs
Yes, ring the true o1 replication bells for DeepSeek R1 🔔🔔🔔. Where we go next.
Jan 21
•
Nathan Lambert
232
Share this post
Interconnects
DeepSeek R1's recipe to replicate o1 and the future of reasoning LMs
Copy link
Facebook
Email
Notes
More
2
DeepSeek V3 and the actual cost of training frontier AI models
The $5M figure for the last training run should not be your basis for how much frontier AI models cost.
Jan 9
•
Nathan Lambert
125
Share this post
Interconnects
DeepSeek V3 and the actual cost of training frontier AI models
Copy link
Facebook
Email
Notes
More
8
OpenAI's o3: Over-optimization is back and weirder than ever
Tools, true rewards, and a new direction for language models.
Apr 19
•
Nathan Lambert
120
Share this post
Interconnects
OpenAI's o3: Over-optimization is back and weirder than ever
Copy link
Facebook
Email
Notes
More
GPT-4.5: "Not a frontier model"?
OpenAI's latest model raises more questions than answers, but no, the AI bubble isn't popping quite yet.
Feb 28
•
Nathan Lambert
84
Share this post
Interconnects
GPT-4.5: "Not a frontier model"?
Copy link
Facebook
Email
Notes
More
Reverse engineering OpenAI’s o1
What productionizing test-time compute shows us about the future of AI. Exploration has landed in language model training.
Sep 16, 2024
•
Nathan Lambert
133
Share this post
Interconnects
Reverse engineering OpenAI’s o1
Copy link
Facebook
Email
Notes
More
2
Qwen 3: The new open standard
A wonderful release, base models, reasoners, model size scales, and all before LlamaCon.
Apr 28
•
Nathan Lambert
81
Share this post
Interconnects
Qwen 3: The new open standard
Copy link
Facebook
Email
Notes
More
Why reasoning models will generalize
People underestimate the long-term potential of “reasoning.”
Jan 28
•
Nathan Lambert
203
Share this post
Interconnects
Why reasoning models will generalize
Copy link
Facebook
Email
Notes
More
7
Llama 4: Did Meta just push the panic button?
One of the weirdest releases of the year and understanding the future of the Llama endeavor. For the time being, we have some more amazing open weight…
Apr 7
•
Nathan Lambert
93
Share this post
Interconnects
Llama 4: Did Meta just push the panic button?
Copy link
Facebook
Email
Notes
More
Sycophancy and the art of the model
GPT-4o-simp, LMArena backlash, and people refusing to understand how messy and crucial RLHF is.
May 4
•
Nathan Lambert
53
Share this post
Interconnects
Sycophancy and the art of the model
Copy link
Facebook
Email
Notes
More
1
Llama 2: an incredible open LLM
Meta is continuing to deliver high-quality research artifacts and not backing down from pressure against open source.
Jul 18, 2023
•
Nathan Lambert
60
Share this post
Interconnects
Llama 2: an incredible open LLM
Copy link
Facebook
Email
Notes
More
7
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts