Understanding LLMs Like Physicists: Observation, Hypothesis, Experimentation, and Prediction // TRAIN BRAIN

Understanding LLMs Like Physicists: Observation, Hypothesis, Experimentation, and Prediction

A Google TechTalk, presented by Tianyu Guo, 2025-02-20
Google Algorithms Seminar: ABSTRACT: Recently, methodologies from physics have inspired new research paradigms for scientific understandings of LLMs. In physics, knowledge often emerges through four stages: observing nature, forming hypotheses, conducting controlled experiments, and making real-world predictions. Here, I present two independent mechanisms discovered in LLMs following this methodology.
Dormant Heads: LLMs deactivate certain attention heads when they are irrelevant to the current task. A given head may serve a specific function, and when faced with an unrelated prompt, it becomes dormant, concentrating all attention on the first token.
Random Guessing in Two-Hop Reasoning: Pretrained LLMs resort to random guesses when distractions are present in two-hop reasoning. A well-designed supervised fine-tuning dataset can solve this issue.
I will discuss how these mechanisms emerge through observations, how hypotheses are formed, how we design and analyze controlled experiments, and how these mechanisms are validated in LLMs.
ABOUT THE SPEAKER: Tianyu Guo is a third-year PhD student in the UC Berkeley Statistics Department, advised by Song Mei and Michael I. Jordan. His research focuses on the Interpretability of Large Language models and Causal Inference.

Google TechTalks

Google Tech Talks is a grass-roots program at Google for sharing information of interest to the technical community. At its best, it's part of an ongoing discussion about our world featuring top experts in diverse fields. Presentations range from the br...

Is Learning Effective in Dynamic Strategic Interactions? Evidence from Stackelberg Games

Algorithmic Contract Design

Online Learning and Economics

Go Meetup April 2025 - i18n Go Experiment

Go Meetup April 2025 - Whats New in Go 1.24?

Go Meetup April 2025 - Git Bisect and Go Size Analyzer

Go Meetup April 2025 - Photobooth

Go Meetup April 2025 - Go Protobuf

Understanding LLMs Like Physicists: Observation, Hypothesis, Experimentation, and Prediction

Theoretical Limitations of Multi layer Transformers

AI Snake Oil

How I Wrote 10K Lines of Go in a Weekend

Supply Chain Security with Go

A Multi Dimensional Online Contention Resolution Scheme

Robust Distortion-free Watermarks for Language Models

Is it possible to make self-adjusting data structures concurrent?

Privacy Preserving ML with Fully Homomorphic Encryption

The Chinese Computer: A Global History of the Information Age

KAN: Kolmogorov-Arnold Networks

Learning through Transient Matching in Congested Markets

What Makes Software Work?

Algorithms and Hardness for Attention and Kernel Density Estimation

A Unified Analysis of Label Inference Attacks

Copyright Regenerated: Harnessing GenAI to Measure Originality and Copyright Scope

The Data Minimization Principle in Machine Learning

Challenges in Augmenting Large Language Models with Private Data

Greybeard Qualification (Linux Internals) part 1: Process Structure and IPC