Phase 129: AGI Safety and Alignment

Phase 129 of the AI Encyclopedia — AGI Safety and Alignment. Topics 2561–2580.

Part of the AI Encyclopedia · Phase 129 of 130 · Topics 2561–2580

This phase covers AGI Safety and Alignment. Below are the 20 concepts grouped under this phase — each is a future article in the Insightful AI World encyclopedia.

2561 AGI Safety

2562 AGI Alignment

2563 Outer Alignment for AGI

2564 Inner Alignment for AGI

2565 Goal Misgeneralization

2566 Deceptive Alignment

2567 Instrumental Convergence

2568 Power-seeking Behavior

2569 Corrigibility

2570 Shutdown Problem

2571 Scalable Oversight

2572 Iterated Amplification

2573 Debate for Alignment

2574 Constitutional Alignment

2575 Interpretability for AGI

2576 Robustness for AGI

2577 Capability Control

2578 Deployment Governance

2579 Frontier Model Safety

2580 Existential Risk from AI

← Phase 128

All phases

Phase 130 →