TLexDR
DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters
Listen on YouTube Share on X Share on Bluesky Link copied!
Core Takeaways
DeepSeek's R1 model is 27 times cheaper than OpenAI's o1 model, costing $2 per million tokens.
Why it matters This cost difference positions DeepSeek as a highly competitive alternative in the AI model market.
NVIDIA's H20 chip, despite having lower FLOPS than the H100, performs better on reasoning tasks due to higher memory bandwidth. ▶ 1:23:45
Why it matters Memory bandwidth is crucial for reasoning tasks, highlighting the importance of architecture beyond raw compute power.
China's AI development is bolstered by a trillion RMB subsidy, but US export controls limit their access to cutting-edge GPUs. ▶ 1:23:45
Why it matters These controls aim to maintain US technological dominance but could incentivize China to develop independent capabilities.
OpenAI's joint venture with Oracle involves a $100 billion infrastructure investment, but funding remains uncertain. ▶ 1:23:45
Why it matters The financial uncertainty could impact OpenAI's ability to scale its infrastructure and maintain its competitive edge.
DeepSeek's mixture of experts model activates only 37 billion of its 600 billion parameters, reducing compute costs. ▶ 1:23:45
Why it matters Efficient parameter activation allows for significant compute savings, making large models more accessible.

Detailed Insights

DeepSeek's Competitive Edge
+
DeepSeek's R1 model is significantly cheaper than OpenAI's o1 model.
The mixture of experts model reduces compute costs by activating only a subset of parameters.
DeepSeek's open weights and permissive licensing offer flexibility in AI development.
NVIDIA's Chip Architecture
+
NVIDIA's H20 chip excels in reasoning tasks due to its memory bandwidth.
The H20's architecture highlights the importance of memory over sheer FLOPS for specific tasks.
US-China AI Geopolitics
+
China's AI development is supported by significant subsidies but hampered by US export controls.
These controls could push China to develop its own technological capabilities independently.
OpenAI's Infrastructure Investment
+
OpenAI's joint venture with Oracle involves a massive infrastructure investment.
Funding for this venture is uncertain, impacting OpenAI's future scalability.

How the conversation moved

The episode begins with a discussion on DeepSeek's innovative AI models, particularly focusing on the cost efficiency and technical advancements of the DeepSeek R1 model compared to OpenAI's offerings. Nathan Lambert outlines how the mixture of experts model allows DeepSeek to reduce compute costs significantly by activating only a subset of parameters during training and inference. This approach not only makes DeepSeek's models more accessible but also positions them as a competitive alternative in the AI landscape dominated by OpenAI.

Dylan Patel then shifts the conversation to NVIDIA's H20 chip, emphasizing its superior performance in reasoning tasks despite having lower FLOPS than the H100. The H20's architecture, which prioritizes memory bandwidth over sheer computational power, exemplifies how specific design choices can enhance performance for particular tasks. This insight challenges the conventional focus on FLOPS as the primary metric for evaluating AI hardware, suggesting that memory architecture can be equally, if not more, important.

The conversation takes a geopolitical turn as the speakers discuss the implications of US export controls on China's AI capabilities. Patel argues that while these controls aim to maintain US technological dominance, they could inadvertently encourage China to develop independent technological capabilities. This tension highlights the complex interplay between technological advancement and geopolitical strategy, with significant implications for global AI development.

The episode concludes with an exploration of OpenAI's financial challenges, particularly in relation to its joint venture with Oracle for AI infrastructure. Despite the ambitious $100 billion investment plan, funding remains uncertain, raising questions about OpenAI's ability to scale its operations and maintain its competitive edge. This financial uncertainty underscores the broader challenges faced by AI companies in balancing innovation with the practicalities of funding and infrastructure development.

Surprising moments

Dylan Patel
Dylan Patel claims that US export controls could ultimately benefit China's AI development by pushing them towards self-sufficiency.
Nathan Lambert
Nathan Lambert highlights that NVIDIA's H20 chip outperforms the H100 in reasoning tasks due to its memory bandwidth, challenging the focus on FLOPS.

Topics Covered

DeepSeek's Competitive Edge NVIDIA's Chip Architecture US-China AI Geopolitics OpenAI's Infrastructure Investment

Memorable Quotes

"DeepSeek is doing fantastic work for disseminating understanding of AI." — Nathan Lambert
"The open weights are you have your fate of data in your own hands, and that is something that is deeply connected to the soul of open source." — Nathan Lambert
"The H20 is actually better for certain tasks. And that certain task is reasoning." — Nathan Lambert
"The money does not exist." — Dylan Patel

Still open

Unresolved by the end of the conversation

  • Nathan Lambert questioned whether the cost of deploying AGI capabilities at scale would delay their widespread implementation.

Jargon glossary

mixture of experts
An AI model architecture where only a subset of parameters are activated during training, reducing compute costs.
open weights
Model weights that are available for public download, often with varying licenses affecting their use.
export controls
Regulations that limit the export of certain technologies to maintain national security and technological advantage.

References & Resources

Llama 3 by Meta paper
Common Crawl by Common Crawl other
The Bitter Lesson by Richard Sutton article
GPT-4 by OpenAI other
Dario Amodei's blog post on export controls by Dario Amodei article

For the specialist

What a senior practitioner would find new

  • DeepSeek's mixture of experts model activates only 37 billion of its 600 billion parameters, demonstrating a significant reduction in compute costs compared to traditional models.
  • NVIDIA's H20 chip, despite lower FLOPS, outperforms the H100 in reasoning tasks due to its superior memory bandwidth, highlighting the importance of architecture in AI performance.

Ask this episode Premium

Ask any question about this episode — get an answer grounded in the transcript.

Available with Premium. $9.99/month, cancel anytime.

Upgrade to chat

Related episodes

Other Lex conversations that overlap with this one.

Keyu Jin: China's Economy, Tariffs, Trade, Trump, Communism & Capitalism

▶ 1h 49m

Cite this episode

For papers, blog posts, anywhere.

Copied!

AI-generated summary · last refreshed 2026-05-29 06:26:41 · how we make these

Quotes are matched verbatim against the source transcript; references are checked to resolve to real URLs. Even so, AI can misread structure or attribute claims imperfectly. If you spot an error, please let us know.

Report an inaccuracy →