Artificial Intelligence

Mastering the Game of Go with Deep Neural Networks and Tree Search

Silver et al. · 2016 · Nature

AlphaGo combined deep networks and Monte Carlo tree search to defeat world-class Go players.

Research objective

Build an artificial agent capable of expert-level Go play, long considered out of reach for AI.

Methodology

Combined supervised policy networks trained on human games, a value network for board evaluation, and Monte Carlo tree search. Later self-play refined the policy via reinforcement learning.

Key findings

Defeated European champion Fan Hui 5-0 and world champion Lee Sedol 4-1.
Demonstrated that combining learned heuristics with search can outperform purely human strategy.
Move 37 in game 2 vs. Lee Sedol shocked the Go community as a novel, beautiful move.

Strengths

Hybrid neural + search approach captured both intuition and deliberation.
Generalized to AlphaZero (chess, shogi) and AlphaFold (protein folding).

Limitations

Required massive compute and game-specific engineering.
Narrow expertise - no transfer to other domains.

Practical implications

Renewed interest in combining deep learning with explicit search.
Established DeepMind as a frontier AGI research lab.

Read the original paper

Related entities

Scientist · demis-hassabis Atlas · machine-intelligence

Related research

Attention Is All You Need

Introduced the Transformer, replacing recurrence with self-attention and reshaping modern AI.

Read summary

Deep Residual Learning for Image Recognition

Residual connections enabled training of very deep networks, winning ImageNet 2015.

Read summary