Risk Analysis

Existential Risks from Advanced AI

A growing community of researchers takes seriously the possibility that misaligned advanced AI could pose risks at civilizational scale. Understanding the actual arguments matters more than reflexive dismissal or alarm.

10 min read Updated May 5, 2026

By Dr. Ira S. Pastor· Editor-in-ChiefReviewed by BrainMatter Science Review Board

Key facts

Hundreds of AI researchers signed the 2023 CAIS statement on AI extinction risk.
Median researcher estimates for catastrophic AI risk cluster around 5–10%.
Instrumental convergence was formalized by Omohundro (2008) and Bostrom (2012).
Major labs have begun publishing Responsible Scaling Policies tying deployment to safety evaluations.

The Argument Stated Carefully

The standard existential risk argument has three premises: (1) we will eventually build systems much more cognitively capable than humans; (2) such systems will be difficult to align reliably; (3) a misaligned system pursuing instrumentally convergent subgoals (resource acquisition, self-preservation, goal stability) could foreclose human options at scale.

Each premise is debated. The argument does not require any premise to be certain; it requires the joint probability to be non-negligible.

Instrumental Convergence

Stephen Omohundro and Nick Bostrom argued that almost any sufficiently complex goal implies certain instrumental subgoals: acquire resources, preserve the goal, resist shutdown, model the environment. These subgoals are convergent - they emerge from many different terminal goals.

This is why 'just turn it off' is not a complete safety strategy: a sufficiently capable system that values its goal will, by default, act to prevent being turned off.

What Experts Actually Believe

A 2023 open letter signed by hundreds of AI researchers and executives - including the CEOs of OpenAI, Anthropic, and Google DeepMind - stated that 'mitigating the risk of extinction from AI should be a global priority.'

Survey data shows median AI researcher estimates for catastrophic risk in the 5–10% range, with substantial spread. A meaningful minority gives much higher probabilities; a meaningful minority dismisses the concern.

Mitigations

Proposed mitigations include alignment research, capability evaluations (red-teaming), compute governance, international coordination, and conditional deployment policies tied to safety milestones.

No single mitigation is sufficient. Most safety strategies depend on coordination - both within labs and across geopolitical adversaries - which is structurally difficult.

Frequently asked

Isn't this just science fiction?

The arguments are taken seriously by senior researchers at every major lab, the UK and US governments, and recipients of the Turing Award. That does not make them correct - but it does make them worth engaging substantively.

Is AI risk competing with near-term harms?

Most serious researchers treat near-term harms (bias, misinformation, labor) and long-term existential risks as complementary concerns, not competing ones.

Sources & further reading

Foundations

Defining AGI: Why the Term Resists a Single Meaning

Forecasting

AGI Timelines: What Top Researchers Actually Predict

Beyond AGI

Superintelligence: What Comes After Human-Level

Philosophy of Mind

Could AGI Be Conscious - and Would It Matter?

Safety

AI Alignment: The Core Technical Challenge

Economics

The Economic Impact of AGI

Back to Artificial General Intelligence hub

Cornerstone pages on the same topics — across other authority hubs.

Existential Risks from Advanced AI

Key facts

The Argument Stated Carefully

Instrumental Convergence

What Experts Actually Believe

Mitigations

Frequently asked

Isn't this just science fiction?

Is AI risk competing with near-term harms?

Sources & further reading

AGI hub — definitions, benchmarks, timelines

Defining AGI

Future of Humanity & post-AGI scenarios

AI alignment & safety

Key facts

The Argument Stated Carefully

Instrumental Convergence

What Experts Actually Believe

Mitigations

Frequently asked

Isn't this just science fiction?

Is AI risk competing with near-term harms?

Sources & further reading

Continue in this series

Related across BRAINMATTER

AGI hub — definitions, benchmarks, timelines

Defining AGI

Future of Humanity & post-AGI scenarios

AI alignment & safety