Discussion about this post

User's avatar
Paul Triolo's avatar

Nice summary. This is only one potential response to the training challenge in China. It should be viewed along with distributed training and heterogenous hardware platforms as part of a set of responses to US controls and the focus on scaling. Even without controls Chinese firms should be taking a different road, playing a different game. See my latest SS on distributed computing and other innovations in China's AI sector...

Expand full comment
huhvgf6554's avatar

Hmm I don't know if it can really be called a real deepseek moment. Trading off that much precision might kneecap an entire generation of models. And software-led AI growth is just not realistic. We can see from GPT-5, everyone coalescing/plateauing at the same area that there are limits to using existing hardware to train new models. Hardware must grow commensurately

Expand full comment
4 more comments...

No posts