(Voiceover) Tülu 3: The next era in open post-training

Interconnects

(Voiceover) Tülu 3: The next era in open post-training

0:00

-7:58

(Voiceover) Tülu 3: The next era in open post-training

We give you open-source, frontier-model post-training.

Nov 21, 2024

Original post:

Tülu 3: The next era in open post-training

·

November 21, 2024

Tülu 3: The next era in open post-training

Post-training, the craft of eliciting powerful behaviors from a raw pretrained language model, has gone through many seasons and moods since the release of ChatGPT. In the era of Alpaca, Vicuna, Koala, and Dolly, a limited number of human datapoints with extended synthetic data in the style of

Read full story

Chapters

00:00 History

05:44 Technical details sneak peak

Figures

Fig 1, results: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/tulu3-img/results.webp

Fig 2, overview: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/tulu3-img/overview.webp

Fig 3, preferences: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/tulu3-img/preferences.webp

Fig 4, RLVR: https://huggingface.co/datasets/natolambert/interconnects-figures/resolve/main/tulu3-img/rlvr.webp

Discussion about this episode

Ready for more?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts