TLexDR
State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | Lex Fridman Podcast #490
Listen on YouTube Share on X Share on Bluesky Link copied!
Core Takeaways
DeepSeek's R1 model set a new benchmark in 2025 by achieving state-of-the-art performance at reduced compute costs.
Why it matters DeepSeek's efficiency could disrupt the AI landscape by lowering barriers to entry for high-performance models.
Chinese AI firms like Z.ai are gaining ground with open-weight models, challenging DeepSeek's dominance. ▶ 2:00
Why it matters The rise of Chinese open-weight models could shift global AI leadership and innovation hubs.
GPT-5.2's context score improvement from 30% to 70% highlights significant algorithmic advances. ▶ 12:00
Why it matters GPT-5.2's advancements could redefine the capabilities and applications of language models.
Reinforcement Learning with Verifiable Rewards (RLVR) can dramatically improve model accuracy in just 50 steps. ▶ 1:14:00
Why it matters RLVR's efficiency in improving accuracy could make it a cornerstone of future AI training methodologies.
The AI ecosystem in China is rapidly advancing, with multiple open-weight models emerging in 2026. ▶ 2:00
Why it matters China's AI advancements could challenge US dominance, prompting strategic shifts in AI policy and investment.

Detailed Insights

AI Competition
+
DeepSeek's R1 model set a new industry standard in 2025.
Chinese firms like Z.ai are challenging DeepSeek with open models.
US models outperform Chinese models in outputs, affecting market dynamics.
Model Performance and Usage
+
GPT-5.2's context score improved from 30% to 70%.
Open models are known for weights, not usability.
Claude Code offers an engaging interface for projects.
Reinforcement Learning and AI Training
+
RLVR can improve model accuracy significantly in 50 steps.
Scaling laws predict relationships between compute and accuracy.
RLHF scaling does not show linear performance improvements.
Chinese AI Ecosystem
+
China's AI advancements challenge US dominance.
Multiple Chinese open-weight models emerged in 2026.
Chinese models use larger Mixture of Experts architectures.

How the conversation moved

The episode begins with Lex Fridman framing the discussion around the current state and future trajectory of AI, focusing on the competitive landscape between US and Chinese AI firms. Sebastian Raschka and Nathan Lambert dive into the dynamics of AI models and companies, highlighting the impact of DeepSeek's R1 model, which set a new benchmark in 2025. They discuss the evolving dynamics between US and Chinese firms, noting that Chinese companies like Z.ai are gaining ground with competitive open-weight models, challenging DeepSeek's dominance.

Raschka argues that no single company will have exclusive access to technology due to frequent job changes among researchers. Lambert adds that the hype around Anthropic's Claude Opus 4.5 has overshadowed Google's Gemini 3, despite Gemini being a strong model. They explore the advancements in AI models, with GPT-5.2's context score improvement from 30% to 70% indicating significant algorithmic advancements. The conversation also touches on the role of open-weight models, with Chinese firms releasing models that are gaining popularity due to their unrestricted licenses.

Despite the comprehensive exploration of AI advancements, there is little pushback from Lex on the guests' claims. The conversation lacks explicit challenges to the optimistic view of AI's trajectory, such as the potential risks of open-weight models or the sustainability of current AI development practices. However, Raschka does question the notion that AI could replace the joy of debugging, arguing that the struggle is part of the learning process, which could be diminished by over-reliance on AI tools.

The discussion concludes with an examination of the broader implications of AI advancements, including the economic impact and the potential for AI to drive significant GDP growth over time. Lambert and Raschka emphasize the importance of open models in the US to compete with China's rapidly advancing AI ecosystem. They also touch on the historical impact of GPUs on deep learning and future technological advancements, suggesting that the future of technology may not solely be defined by AI but by the broader concept of computing and its evolution.

Surprising moments

Sebastian Raschka
Raschka argues that the joy of debugging is part of the learning process, which AI might diminish.
Nathan Lambert
Lambert disagrees with the AI 2027 report's timeline for achieving a superhuman coder, expressing skepticism.
Nathan Lambert
Lambert pushes back against the idea of a universal AI model, stating that specialized models are more realistic.

Topics Covered

AI Competition Model Performance and Usage Reinforcement Learning and AI Training Chinese AI Ecosystem

Memorable Quotes

"I don’t think nowadays, in 2026, that there will be any company having access to a technology that no other company has access to." — Sebastian Raschka
"DeepSeek is definitely winning the hearts of the people who work on open weight models because they share these as open models." — Sebastian Raschka
"Honestly, building an LLM from scratch is a lot of fun and a lot to learn." — Sebastian Raschka
"The idea of scaling laws came when people figured out that that was a very predictable relationship." — Nathan Lambert
"I think we’ll slowly push it out as AI solves more compelling tasks—like the likes of Claude Opus 4.5 making Claude Code just work for things." — Nathan Lambert
"The only fair way to evaluate an LLM is to have a new benchmark that is after the cutoff date when the model was deployed." — Sebastian Raschka
"I think the trick with the book is basically to understand how the LLM works." — Sebastian Raschka
"I think there will be some other big multi-billion dollar acquisitions, like Perplexity." — Nathan Lambert
"The US should be building the best models so that the best research happens in the US and those US companies take the value from being the home of where AI research is happening." — Nathan Lambert

Still open

Unresolved by the end of the conversation

  • Raschka questioned how future developers will learn if they rely too heavily on AI, suggesting it could hinder their development into experts.
  • Lambert expressed skepticism about the feasibility of fully automating programming in the near future.

Jargon glossary

open weight models
AI models whose parameters are publicly available for research and development.
Mixture of Experts (MoE)
A neural network architecture that uses multiple expert networks to improve performance.
Reinforcement Learning with Verifiable Rewards (RLVR)
A method where models learn from their outputs and improve accuracy through a generate-grade loop.

References & Resources

Thank you for listening ❤ Check out our sponsors
https://lexfridman.com/ai-sota-2026-transcript
https://lexfridman.com/wordpress/wp-content/uploads/2026/01/ai_models_2025.png
*Feedback* - give feedback to Lex
*AMA* - submit questions, videos or call-in
*Hiring* - join our team
*Other* - other ways to get in touch
Nathan's X
Nathan's Blog
Nathan's Website
Nathan's YouTube
Nathan's GitHub
Nathan's Book
Sebastian's X
Sebastian's Blog
Sebastian's Website
Sebastian's YouTube
Sebastian's GitHub
Build a Large Language Model (From Scratch)
Build a Reasoning Model (From Scratch)
Go to
Go to
Go to
Go to
Go to
Go to
Go to
Go to
- Podcast Website
- Apple Podcasts
- Spotify
- RSS
- Podcast Playlist
- Clips Channel
- X
- X
- Instagram
- TikTok
- LinkedIn
- Facebook
- Patreon
- Telegram
- Reddit
Attention Is All You Need by Vaswani et al. paper
AI2027 report by Unknown other

For the specialist

What a senior practitioner would find new

  • RLVR's generate-grade loop links training and inference-time scaling, offering a new paradigm in AI training.
  • Chinese open-weight models' unrestricted licenses make them more appealing for global research and development.
  • GPT-5.2's context score leap from 30% to 70% indicates a significant leap in algorithmic performance.

Ask this episode Premium

Ask any question about this episode — get an answer grounded in the transcript.

Available with Premium. $9.99/month, cancel anytime.

Upgrade to chat

Cite this episode

For papers, blog posts, anywhere.

Copied!

AI-generated summary · last refreshed 2026-05-31 17:04:54 · how we make these

Quotes are matched verbatim against the source transcript; references are checked to resolve to real URLs. Even so, AI can misread structure or attribute claims imperfectly. If you spot an error, please let us know.

Report an inaccuracy →