Discussion about this post

User's avatar
Joshua Saxe's avatar

Great post as always. It feels like the monolithic nature of the main final product -- a single model -- might be a temporary phenomenon in AI systems and accounts for a lot of the sensitivity of outcomes to exact org structure and even the character of a handful of leaders. I'm curious if you've thought about what a world in which we could have less monolithic artifacts (for example, imagine, say, MoE like model subcomponents that can be worked on in parallel, with router/adapter layers trained separately to mix them into multiple larger systems), and how this might derisk and stabilize modeling efforts.

Expand full comment
3 more comments...

No posts