|
| |
|
| Project SyndicateOpinion: DeepSeek took Big Tech by surprise. What else are America’s AI leaders missing?China’s AI advancements could represent a 21st-century ‘Sputnik moment’ for the U.S. |
By
Daron Acemoglu
Last Updated: Feb. 10, 2025, 10:38 a.m. ET First Published: Feb. 10, 2025, 7:46 a.m. ET
Photo: Getty Images
Referenced Symbols
DeepSeek exposed ‘groupthink’ among the U.S.’s artificial-intelligence players — and blindness to alternative, cheaper, more promising approaches.
The release of DeepSeek-R1 on Jan. 20 triggered a massive drop in chip company Nvidia Corp.’s NVDA
+3.14%
share price and sharp declines in various other tech-sector valuations. Some declared this a “ Sputnik moment” in the Sino-American race for supremacy in artificial intelligence. While America’s AI industry arguably needed shaking up, the episode raised some difficult questions.
The U.S. tech industry’s investments in AI have been massive, with Goldman Sachs estimating that “mega tech firms, corporations, and utilities are set to spend around $1 trillion on capital expenditures in the coming years to support AI.” Yet for a long time, many observers, including me, have questioned the direction of AI investment and development in the United States. With all the leading companies following essentially the same playbook (though Meta Platforms META
+0.12%
has differentiated itself slightly with a partly open-source model), the industry seems to have put all its eggs in the same basket. Without exception, U.S. tech companies are obsessed with scale. Citing yet-to-be-proven “ scaling laws,” they assume that feeding ever more data and computing power into their models is the key to unlocking ever-greater capabilities. Some even assert that “ scale is all you need.” Before Jan. 20, U.S. companies were unwilling to consider alternatives to foundation models pretrained on massive data sets to predict the next word in a sequence. Given their priorities, they focused almost exclusively on diffusion models and chatbots aimed at performing human (or humanlike) tasks. Though DeepSeek’s approach is broadly the same, it appears to have relied more heavily on reinforcement learning, mixture-of-experts methods (using many smaller, more efficient models), distillation and refined chain-of-thought reasoning. This strategy reportedly allowed it to produce a competitive model at a fraction of the cost.
Although there is some dispute about whether DeepSeek has told us the whole story, this episode has exposed “groupthink” within the AI industry in the U.S. Its blindness to alternative, cheaper, more promising approaches, combined with hype, is precisely what Simon Johnson and I predicted in “Power and Progress,” which we wrote just before the generative-AI era began. The question is whether the U.S. AI industry has other, even more dangerous blind spots. For example, are the leading U.S. tech companies missing an opportunity to take their models in a more “pro-human direction”? I suspect that the answer is yes, but only time will tell.
One company’s achievement is not conclusive evidence that China can beat more open societies at innovation.
Then there is the question of whether China is leapfrogging the U.S. If so, does this mean that authoritarian, top-down structures (what James A. Robinson and I have called “extractive institutions”) can match or even outperform bottom-up arrangements in driving innovation? My bias is to think that top-down control hampers innovation, as Robinson and I argued in ” Why Nations Fail.” While DeepSeek’s success appears to challenge this claim, it is far from conclusive proof that innovation under extractive institutions can be as powerful or as durable as under inclusive institutions. After all, DeepSeek is building on years of advances in the U.S. (and some in Europe). All of its basic methods were pioneered in the U.S. Mixture-of-experts models and reinforcement learning were developed in academic research institutions decades ago, and it was U.S. tech companies that introduced transformer models, chain-of-thought reasoning and distillation. What DeepSeek has done is demonstrate success in engineering — combining the same methods more effectively than U.S. companies did. It remains to be seen whether Chinese firms and research institutions can take the next step of coming up with game-changing techniques, products and approaches of their own.
Moreover, DeepSeek seems to be unlike most other Chinese AI firms, which generally produce technologies for the government or with government funding. If the company (which was spun out of a hedge fund) was operating under the radar, would its creativity and dynamism continue now that it is under the spotlight? Whatever happens, one company’s achievement cannot be taken as conclusive evidence that China can beat more open societies at innovation. Another question concerns geopolitics. Does the DeepSeek saga mean that U.S. export controls and other measures to hold back Chinese AI research failed? The answer here is also unclear. While DeepSeek trained its latest models (V3 and R1) on older, less powerful chips, it may still need the most powerful chips to achieve further advances and to scale up.
America’s zero-sum approach was unworkable and ill advised.
Read: DeepSeek is giving Big Tech the drubbing it deserves. Will it be a wake-up call? Nonetheless, it is clear that America’s zero-sum approach was unworkable and ill-advised. Such a strategy makes sense only if you believe we are heading toward artificial general intelligence, or AGI (models that can match humans on any cognitive task), and that whoever gets to AGI first will have a huge geopolitical advantage. By clinging to these assumptions — neither of which is necessarily warranted — we have prevented fruitful collaboration with China in many areas. For example, if one country produces models that increase human productivity or help us regulate energy better, such innovation would benefit both countries, especially if it is widely used. Like its American cousins, DeepSeek does aspire to develop AGI, and creating a model that is significantly cheaper to train could be a game changer. But bringing down development costs with known methods will not miraculously get us to AGI in the next few years. Whether near-term AGI is achievable remains an open question (and whether it is desirable is even more debatable). Even if we do not yet know all the details about how DeepSeek developed its models or what its apparent achievement means for the future of the AI industry, one thing seems clear: that a Chinese upstart has punctured the tech industry’s obsession with scale and may have even shaken it out of its complacency. Daron Acemoglu, a 2024 Nobel laureate in economics and Institute Professor of Economics at MIT, is a co-author (with Simon Johnson) of Power and Progress: Our Thousand-Year Struggle Over Technology and Prosperity (PublicAffairs, 2023). This commentary was published with the permission of Project Syndicate — A Sputnik Moment for AI? Read on: How DeepSeek could spur Big Tech to double down on AI dominance Why DeepSeek will bring AI to more companies
Partner Center
Most Popular
Buy Nvidia’s stock ahead of earnings? This analyst makes the case. Super Micro’s stock heads for longest winning streak in months, despite many unknowns Tesla’s stock falls again, and this analyst says why it’s an Elon Musk problem Why 2025 could become the 'year of the bond' ‘It is hard to watch them buy a Mercedes’: My father left $600K to my stepmother. Her family put the money in a trust. Is my inheritance lost?
What to Read Next
403 ERRORThe request could not be satisfied.The Amazon CloudFront distribution is configured to block access from your country. We can't connect to the server for this app or website at this time. There might be too much traffic or a configuration error. Try again later, or contact the app or website owner. If you provide content to customers through CloudFront, you can find steps to troubleshoot and help prevent this error by reviewing the CloudFront documentation. Generated by cloudfront (CloudFront) Request ID: 1aO-ZX4ggMKzc5Lzw5VjTJOuoEG_GbB1rddgp9tRiBj6f0t4ASKMOQ==
Show Conversation (0)
Advertisement
Advertisement
Back To Top
Copyright © 2025 MarketWatch, Inc. All rights reserved.
Terms of Use UPDATED Privacy Notice Cookie Notice
Intraday Data provided by FACTSET and subject to terms of use. Historical and current end-of-day data provided by FACTSET. All quotes are in local exchange time. Real-time last sale data for U.S. stock quotes reflect trades reported through Nasdaq only. Intraday data delayed at least 15 minutes or per exchange requirements.
 |