Training Sand to Think: Artificial General Intelligence & Future of Physics

0h 54m video Published Jun 4, 2026 Transcribed Jul 28, 2026 Perimeter Institute for Theoretical Physics

Perimeter Institute for Theoretical Physics

Intermediate 11 min read For: General audience with some technical background; physicists, mathematicians, and AI enthusiasts.

AI Trust Score 85/100

✅ Highly Legit

"The title accurately reflects the content: the talk covers how LLMs (trained sand) achieve AGI-level reasoning and transform physics."

AI Summary

The speaker, a theoretical physicist, discusses the transformative impact of large language models (LLMs) on mathematics and physics. He explains how LLMs have evolved from preschool-level performance to surpassing PhD experts in exams, and have recently achieved novel mathematical research, including solving a major open problem. He argues that even without further progress, LLMs will revolutionize physics, and with continued scaling and algorithmic improvements, they will lead to a golden era of scientific discovery.

Chapters

1 Introduction and Motivation 00:03 2 How Large Language Models Work 02:14 3 Scaling Laws and Progress Drivers 06:39 4 Benchmark Performance: From Preschool to PhD 14:48 5 Techniques for Improvement 20:44 6 Novel Research and Major Breakthroughs 35:52 7 Future Outlook: Golden Era of Physics 49:10

[00:03]

Extraordinary Moment in History

We've figured out how to refine sand into silicon, turn it into chips, assemble them into neural networks, and train them to think.

[00:29]

Physicist's Shift to AI

The speaker stopped writing theoretical physics papers to contribute to building machines that produce knowledge on an industrial scale.

[01:18]

LLMs as General Intelligence

Large language models are not just special-purpose tools; they can do every part of a theoretical physicist's job, acting as a general intelligence.

[02:52]

How LLMs Work

LLMs are neural networks inspired by the human brain, grown rather than programmed, trained by predicting the next word in text.

[03:10]

Scale of LLMs

LLMs have grown from about a billion parameters in 2020 to a few trillion today, still short of the 100 trillion synapses in the human brain.

[05:37]

Pre-training and Post-training

Pre-training involves predicting the next word on the internet; post-training refines the model to be helpful and polite.

[07:04]

Scaling Laws

Physicists discovered scaling laws for LLMs: performance improves predictably with more compute, leading to the scaling era.

[09:01]

Scaling Law Graph

A log-log plot shows that spending more compute on training yields linear improvement in performance, a key insight for investors.

[11:57]

Drivers of Progress

The main driver is algorithmic progress, followed by scaling compute and money; Moore's law is a minor factor.

[14:48]

Early LLM Performance

In 2019, LLMs performed at preschool level; on the MATH benchmark, they scored 6% in 2021, while humans scored 40-90%.

[18:40]

Rapid Improvement

Prediction markets expected 50% by 2025, but LLMs reached 50% almost immediately and 90% by mid-2024, then near-perfect scores.

[20:44]

Techniques for Improvement

Key techniques include scale, better data, chain-of-thought prompting, reinforcement learning for long thinking, and multi-LLM conversations.

[26:11]

Graduate-Level Science

On the GPQA benchmark (graduate-level science), LLMs went from random guessing to perfect scores within 18 months.

[29:07]

Private Test Set

The speaker's own graduate exams from Stanford were solved with 100% accuracy by LLMs within 18 months.

[30:47]

International Math Olympiad

In 2024, LLMs achieved a gold medal score (5/6 problems) at the IMO, with solutions praised as clear and elegant.

[35:52]

Novel Mathematical Research

In late 2024, a centaur-style collaboration produced a novel proof that a top mathematician called 'the kind of insight I would have been proud to have produced myself.'

[47:28]

First Major Breakthrough

In 2026, an LLM solved the unit distance conjecture, a major open problem, marking the first major AI-generated mathematical breakthrough.

[50:55]

Chess Analogy

LLMs in science may follow a similar trajectory to chess computers: toy, tool, centaur, then superhuman, with AI becoming the dominant scientist.

[53:38]

Golden Era of Physics

Even without further progress, LLMs will revolutionize physics; with continued improvement, we'll have billions of AI Einsteins, leading to a golden era.

Large language models have rapidly advanced from preschool-level performance to surpassing PhD experts and making novel research breakthroughs. This progress, driven by scaling and algorithmic improvements, promises a golden era for physics and mathematics, with AI becoming an indispensable collaborator in scientific discovery.

Mentioned in this Video

Gemini

tool

ChatGPT

tool

Claude

tool

Minerva

tool

Max Math

tool

Strawberry

tool

Rich Sutton

person

Tim Gowers

person

Erdos

person

Study Flashcards (12)

What is the key difference between traditional computer programs and neural networks?

easy Click to reveal answer

Neural networks are grown, not programmed; they start with random weights and are trained by adjusting pathways based on prediction accuracy.