Artificial Intelligence

Deep Residual Learning for Image Recognition

He, Zhang, Ren, Sun · 2015 · CVPR

Residual connections enabled training of very deep networks, winning ImageNet 2015.

Research objective

Solve the degradation problem in which adding layers to deep networks paradoxically harms training accuracy.

Methodology

Introduced 'identity shortcut' connections that let layers learn residual functions F(x) + x. Trained networks of 50, 101, and 152 layers on ImageNet.

Key findings

152-layer ResNet won ILSVRC 2015 with 3.57% top-5 error, surpassing human-level performance on the benchmark.
Residual connections dramatically eased optimization of deep networks.
Generalized to detection, segmentation, and many downstream tasks.

Strengths

Tiny architectural change with outsized impact.
Made very deep networks practically trainable.
Became a default building block across computer vision and beyond.

Limitations

Did not eliminate vanishing gradients entirely; depth still has diminishing returns.
Convolutions later partially supplanted by Vision Transformers in some regimes.

Practical implications

Residual connections appear in nearly every modern deep architecture, including Transformers.
Established depth-via-skip-connections as a foundational design pattern.

Read the original paper

Related entities

Atlas · machine-intelligence Glossary · neural-network Glossary · deep-learning

Related research

Attention Is All You Need

Introduced the Transformer, replacing recurrence with self-attention and reshaping modern AI.

Read summary

Mastering the Game of Go with Deep Neural Networks and Tree Search

AlphaGo combined deep networks and Monte Carlo tree search to defeat world-class Go players.

Read summary