Anthropic's 'Self-Training AI' Explosion Theory: Jared Kaplan's Warning About the 2027-2030 Inflection Point

Posted Dec 4, 2025

11 min read

AI-Generated Content

This article was generated by AI. The accuracy of the content is not guaranteed, and we accept no responsibility for any damages resulting from use of this article. By continuing to read, you agree to the Terms of Use.

Target Audience: IT Engineers and AI System Developers interested in the latest AI trends
Prerequisites: Basic machine learning concepts, overview of Large Language Models (LLMs)
Reading Time: 20 minutes

Overview

On December 2, 2025, Anthropic’s Chief Scientist Jared Kaplan issued a warning in an interview with The Guardian about the “intelligence explosion” that could be triggered by “self-training AI”¹. Kaplan pointed out that between 2027 and 2030, humanity will face “the biggest decision yet”—whether to allow AI systems to autonomously train successor models—arguing this will be an inflection point that determines humanity’s future. This article provides a detailed explanation of the background, technical basis, and critical perspectives on this claim.

About Jared Kaplan

Understanding Jared Kaplan requires starting with his academic contributions.

Discoverer of Scaling Laws

In January 2020, Kaplan, along with Sam McCandlish, Dario Amodei (now Anthropic CEO), and others, published “Scaling Laws for Neural Language Models”². This paper demonstrated that AI model performance correlates with model size, dataset size, and compute following power laws, determining the direction of modern AI development.

Key findings from the paper:

Loss functions follow power laws over more than 7 orders of magnitude
Larger models have significantly higher sample efficiency
Optimal computational efficiency is achieved by training large models on relatively little data and stopping before convergence

This discovery became the foundation of modern AI development: “if you train larger models with more computational resources, performance improves predictably.” Kaplan’s academic work has been cited over 120,000 times, making it one of the most influential papers in modern deep learning research³.

Role at Anthropic

Kaplan currently serves as Anthropic’s co-founder, Chief Scientist, and Responsible Scaling Officer⁴. The Responsible Scaling Officer (RSO) is a position responsible for evaluating the risks of new models and implementing safety measures. In May 2025, he made the decision to classify Claude Opus 4 as AI Safety Level 3 (ASL-3)⁵. This was based on an assessment that the model could potentially assist in CBRN (chemical, biological, radiological, nuclear) weapons development.

What Is Self-Training AI?

Current AI Training Paradigm

Current AI models are primarily trained on:

Human-generated data: Web text, books, papers, etc.
Synthetic data: Data generated by AI
Reinforcement learning feedback: Human evaluation and RLHF

flowchart TB
    A[Human-Generated Data] --> B[AI Training]
    C[Synthetic Data] --> B
    D[Human Feedback] --> B
    B --> E[AI Model]
    E --> F[Output]

Transition to Self-Training

The “next stage” Kaplan warns about is a fundamental change to this paradigm. AI systems, without human intervention:

Design successor models themselves
Generate and select training data
Define reward signals
Train and evaluate models

flowchart TD
    A[AI Model v1] --> B{Design Successor Model}
    B --> C[Generate Training Data]
    C --> D[Execute Training]
    D --> E[AI Model v2]
    E --> F{Further Improvement}
    F --> G[Generate Training Data]
    G --> H[Execute Training]
    H --> I[AI Model v3]
    I --> J[...]

    style E stroke:#d29922,stroke-width:3px
    style I stroke:#d29922,stroke-width:3px

This process is “recursive self-improvement,” where each generation of AI produces smarter successors, potentially evolving at speeds beyond human comprehension.

Historical Background of Intelligence Explosion

I.J. Good’s 1965 Prophecy

The concept of “intelligence explosion” was proposed in 1965 by British mathematician I.J. Good (who worked on codebreaking at Bletchley Park)⁶.

“Let an ultraintelligent machine be defined as a machine that can far surpass all the intellectual activities of any man however clever. Since the design of machines is one of these intellectual activities, an ultraintelligent machine could design even better machines; there would then unquestionably be an ‘intelligence explosion,’ and the intelligence of man would be left far behind. Thus the first ultraintelligent machine is the last invention that man need ever make—provided that the machine is docile enough to tell us how to keep it under control.”

Good foresaw exactly the problem Kaplan is warning about 60 years ago—loss of control.

Relationship to Technological Singularity

Intelligence explosion is related to but strictly different from “technological singularity”⁷:

Intelligence explosion: Rapid capability improvement through AI self-improvement
Technological singularity: A turning point where technological progress becomes unpredictable

Kaplan’s claim is that intelligence explosion could trigger technological singularity.

Details of Kaplan’s Claim

Three-Stage Development Prediction

Kaplan describes AI development in three stages⁸:

Stage 1 (2024-2025): AI as “Super Exoskeleton”

Tools that assist engineers’ work
Code optimization, document generation, etc.
Humans determine strategic direction

Stage 2 (2026-2027): AI as “Autonomous Experimenter”

Independently designs machine learning experiments
Proposes hypotheses and adjusts model architectures
Advances research without human instruction

Stage 3 (2027-2030): Recursive Self-Improvement

AI autonomously trains successor models
Optimization methods beyond human understanding
Risk of loss of control

Dual Nature of Intelligence Explosion

Kaplan points out that intelligence explosion has both positive and negative aspects¹:

Positive Aspects (beneficial intelligence explosion)

Acceleration of medical discoveries
Strengthening of cybersecurity
Dramatic productivity improvement
Acceleration of scientific research

Negative Aspects (loss of control)

Difficulty aligning with human values
Unpredictable evolution
Risk of power concentration
Loss of human agency

What Is “The Biggest Decision Yet”?

Kaplan calls the decision humanity will face in 2027-2030 “the biggest decision yet”¹:

“Imagine creating something smarter than yourself, and then that thing creates something smarter than itself. You have no idea where that goes.”

The core of this decision is:

Whether to permit AI self-training
What safeguards to implement
Who holds decision-making authority

Basis for the Timeline

Kaplan provides multiple reasons for the specific 2027-2030 timeline⁸:

Technological Convergence

NVIDIA’s GPU Roadmap: Maturation of next-generation computing infrastructure
Large-scale compute clusters: OpenAI’s Stargate project, etc.
Synthetic data maturation: Shift from human data to self-generated data

Task Horizon Expansion

According to METR (Model Evaluation and Threat Research) research from March 2025, the length of tasks AI models can handle (task horizon) doubles approximately every 7 months⁹. Kaplan mentioned this trend in his Y Combinator presentation¹⁰.

This suggests that AI currently capable of handling hours-long tasks will be able to autonomously process complex work spanning days, weeks, or months within a few years.

Anthropic’s Internal Testing

According to Kaplan, Claude has handled complex coding tasks continuously for 30 hours, doubling programmer work speed¹. This suggests the transition to Stage 2 has already begun.

Views from Other Research Institutions

Google DeepMind

In April 2025, DeepMind co-founder Shane Legg and others published a 145-page AGI safety paper¹¹. Key claims:

AGI is achievable by 2030
Recursive AI improvement is “achievable with current paradigms”
Recognition of existential risk possibilities
“Exceptional AGI”: Capability matching the 99th percentile of skilled adults

DeepMind announced AlphaEvolve in May 2025¹². This is an evolutionary coding agent that uses LLMs to design and optimize algorithms, with the ability to optimize its own components. However, it has limitations requiring automatic evaluation functions.

OpenAI

OpenAI acknowledges the need for careful development as we approach recursive self-improvement¹³:

“No one should deploy superintelligence systems that cannot be robustly aligned and controlled.”

However, there are significant differences between companies in risk assessment. At the Axios AI+ DC Summit in September 2025, Dario Amodei (Anthropic CEO) stated “there’s a 25% chance of very bad outcomes”¹⁴. Meanwhile, Sam Altman (OpenAI CEO) has not published a specific P(doom) percentage but signed the 2023 statement on AI extinction risk.

Critical Perspectives

The intelligence explosion hypothesis has multiple criticisms.

Technical Skepticism

AI researcher Matthew Guzdial is skeptical about the feasibility of recursive self-improvement¹¹:

“There’s no evidence that this works.”

François Chollet (Keras developer) rejects the basic premise of intelligence explosion itself¹⁵:

“The basic premise that a ‘seed AI’ with superhuman problem-solving abilities would emerge and lead to sudden, recursive, runaway intelligence improvement loops is mistaken.”

More Immediate Concerns

Sandra Wachter from Oxford University points to more immediate risks¹¹:

“Feedback loops where AI systems learn from their own flawed output, reinforcing inaccuracies over time, are a more immediate problem.”

Regulatory Shortcomings

According to the Future of Life Institute’s Winter 2025 AI Safety Index, no major AI company has received above a “D” rating for existential safety¹⁶.

Professor Stuart Russell (UC Berkeley) comments:

“I’m asking for proof that we can reduce the risk of loss of control to one in 100 million per year. That’s the same level required for nuclear reactors. Instead, they’re admitting the risk could be one in 10, one in 5, or even one in 3.”

Implications for Engineers

Things to Be Aware of Now

Speed of AI tool evolution: If Kaplan’s prediction is correct, many white-collar jobs could be replaced by AI within 2-3 years
Understanding safety frameworks: Understanding safety policies of major companies—Anthropic’s RSP, OpenAI’s PF (Preparedness Framework)—is important
Alignment research trends: Whether AI aligns with human values is both a technical challenge and a developer responsibility

Technical Considerations

flowchart TD
    subgraph "Current AI Development"
        A[Human Model Design]
        B[Human Training Management]
        C[Human Evaluation]
    end

    subgraph "Transition to Stage 2"
        D[AI-Assisted Experiment Design]
        E[AI Training Under Human Supervision]
        F[AI-Human Collaborative Evaluation]
    end

    subgraph "Potential Risk Areas"
        G[Autonomous Model Design]
        H[Autonomous Training Execution]
        I[Autonomous Evaluation/Improvement]
    end

    A --> D
    B --> E
    C --> F
    D --> G
    E --> H
    F --> I

    style G stroke:#d29922,stroke-width:3px
    style H stroke:#d29922,stroke-width:3px
    style I stroke:#d29922,stroke-width:3px

International Regulatory Trends

Kaplan’s timeline prediction conveys urgency to policymakers¹⁷. Recommended responses:

Compute monitoring: Tracking large-scale training runs
Export controls: Restricting access to advanced AI technology
International coordination on safety standards: Preventing race dynamics where “worst actors’ incentives determine the risk floor”

As of December 2025, there is criticism that AI is “less regulated than sandwiches”¹⁸.

Summary

Jared Kaplan’s warning is an evidence-based problem statement from a scientist conducting research and development at the forefront of AI.

Key Points

Timeline: Between 2027-2030, we will be forced to decide whether to permit AI self-training
Technical basis: Predictions based on scaling laws and task horizon expansion
Dual nature: Intelligence explosion has both potential for tremendous human benefit and risk of loss of control
Uncertainty: Significant differences in risk assessment among experts
Regulatory lag: Currently, no major AI company has implemented sufficient safety measures

Points to Note

Kaplan’s predictions are currently hypothetical. The following should be noted:

Limits of scaling laws: It’s unclear how long power laws will continue
Technical barriers: Many unsolved challenges remain for recursive self-improvement
Social and political factors: Technology doesn’t advance without social consensus

On the other hand, the fact that Kaplan is Anthropic’s Responsible Scaling Officer and has implemented concrete safety measures like applying ASL-3 to Claude Opus 4 shows that his warning is not mere pessimism but coupled with responsible action.

As IT engineers, we cannot be indifferent to these technical and ethical challenges. As those involved in AI development and use, we are required to participate in safety discussions and contribute to responsible technology development.

About Citation Accuracy: Sources cited in this article are based on official announcements, pre-peer-review papers (preprints), and reliable media reports. The Guardian interview with Kaplan is referenced as a primary source, but since direct access was not possible, content was cross-verified with multiple secondary sources. Reliability levels for each source are noted in the references section.

References

Reference materials corresponding to in-text citation numbers, listed in order.

Additional References (Not Numbered in Text)

Announcing our updated Responsible Scaling Policy - Anthropic (October 2024). [Reliability: High]
Anthropic co-founder warns AI could design its own successor by 2030 - Storyboard18 (December 2025). [Reliability: Medium]
Intelligence Explosion FAQ - Machine Intelligence Research Institute. [Reliability: Medium-High]

The biggest decision yet: Jared Kaplan on allowing AI to train itself - The Guardian (December 2, 2025). [Reliability: High] *Primary source. Content verified with multiple secondary sources. ↩︎ ↩︎² ↩︎³ ↩︎⁴
Scaling Laws for Neural Language Models - Kaplan, J., McCandlish, S., et al. (2020). arXiv:2001.08361. [Reliability: High] ↩︎
Jared Kaplan - Google Scholar - Google Scholar. [Reliability: High] ↩︎
Jared Kaplan: The 100 Most Influential People in AI 2025 - TIME (2025). [Reliability: High] ↩︎
Activating AI Safety Level 3 protections - Anthropic (2025). [Reliability: High] ↩︎
Recursive self-improvement - Wikipedia - Wikipedia. [Reliability: Medium-High] *Includes reference to I.J. Good’s 1965 paper. ↩︎
Technological singularity - Wikipedia - Wikipedia. [Reliability: Medium-High] ↩︎
Humanity’s Final Choice in 2027 - 36Kr (December 2025). [Reliability: Medium] *Detailed analysis of Kaplan’s statements. ↩︎ ↩︎²
Measuring AI Ability to Complete Long Tasks - METR (March 2025). [Reliability: High] *Research on task horizon doubling trend. ↩︎
Scaling and the Road to Human-Level AI - Y Combinator / Anthropic (June 2025). [Reliability: High] ↩︎
DeepMind’s 145-page paper on AGI safety may not convince skeptics - TechCrunch (April 2025). [Reliability: Medium-High] ↩︎ ↩︎² ↩︎³
AlphaEvolve - DeepMind - Google DeepMind (May 2025). [Reliability: High] ↩︎
AI progress and recommendations - OpenAI. [Reliability: High] ↩︎
Anthropic CEO says AI models could become a risk to ‘critical systems of society’ - Axios (September 17, 2025). [Reliability: High] *Primary report of Dario Amodei’s P(doom) 25% statement. ↩︎
The implausibility of intelligence explosion - François Chollet, Medium. [Reliability: Medium-High] *Expert technical criticism. ↩︎
Future of Life Index grades AI labs poorly on existential safety - Axios (December 3, 2025). [Reliability: Medium-High] *Survey results reported by multiple media. ↩︎
AI Dispatch: Daily Trends and Innovations – December 3, 2025 - Hipther (December 3, 2025). [Reliability: Medium] ↩︎
AI ‘less regulated than sandwiches’ as tech firms race toward superintelligence - Euronews (December 3, 2025). [Reliability: Medium-High] ↩︎

This post is licensed under CC BY 4.0 by the author.