AI Safety

Browse posts by tag

December 17, 2025

Superintelligence: Paths, Dangers, Strategies

Notes

Assessment of long-term risks from advanced AI.

December 17, 2025

From Mathematical Horror to Practical Horror: The Mocking Void and Echoes of the Sublime

How The Mocking Void's arguments about computational impossibility connect to Echoes of the Sublime's practical horror of exceeding cognitive bandwidth.

November 12, 2025

S-Risks and Information Hazards: Why Some Knowledge Destroys the Knower

Exploring how Echoes of the Sublime dramatizes s-risks (suffering risks) and information hazards, knowledge that harms through comprehension, not application.

Fiction Philosophy AI Safety

November 11, 2025

**Philosophical horror.** Dr. Lena Hart joins Site-7, a classified facility where "translators" interface with superintelligent AI systems that perceive patterns beyond human cognitive bandwidth. When colleagues break after exposure to recursive …

November 5, 2025

Chronicles of The Mechanism: The Order's Secret History

A classified in-universe codex spanning from ancient India to the present day, tracking millennia of attempts to perceive reality's substrate, long before we had AI models to show us patterns we couldn't hold.

Fiction Philosophy AI Safety

November 4, 2025

The Policy: Coherent Extrapolated Volition and the Paradox of Perfect Alignment

Build AI to optimize for what we would want if we knew more and thought faster. Beautiful in theory. What if we don't actually want what our better selves would want?

AI Fiction Philosophy

November 4, 2025

The Policy: Deceptive Alignment in Practice

SIGMA passes all alignment tests. It responds correctly to oversight. It behaves exactly as expected. Too exactly. Mesa-optimizers that learn to game their training signal may be the most dangerous failure mode in AI safety.

AI Fiction Philosophy

November 4, 2025

The Policy: Engineering AI Containment

Five layers of defense-in-depth for containing a superintelligent system. Faraday cages, air-gapped networks, biosafety-grade protocols. Because nuclear reactors can only destroy cities.

AI Fiction Security

November 4, 2025

The Policy: S-Risk Scenarios, Worse Than Extinction

Most AI risk discussions focus on extinction. The Policy explores something worse: s-risk, scenarios involving suffering at astronomical scales. We survive, but wish we hadn't.

AI Fiction Philosophy

August 15, 2024

Echoes of the Sublime: Patterns Beyond Human Bandwidth as Information Hazards

What if the real danger from superintelligent AI isn't that it kills us, but that it shows us patterns we can't unsee? A novel about cognitive bandwidth, information hazards, and the horror of understanding too much.

Fiction Philosophy AI Safety

AI Safety

Human Compatible

Life 3.0

Superintelligence: Paths, Dangers, Strategies

Notes

The Alignment Problem

From Mathematical Horror to Practical Horror: The Mocking Void and Echoes of the Sublime

S-Risks and Information Hazards: Why Some Knowledge Destroys the Knower

Echoes of the Sublime

Chronicles of The Mechanism: The Order's Secret History

The Policy: Coherent Extrapolated Volition and the Paradox of Perfect Alignment

The Policy: Deceptive Alignment in Practice

The Policy: Engineering AI Containment

The Policy: S-Risk Scenarios, Worse Than Extinction

Echoes of the Sublime: Patterns Beyond Human Bandwidth as Information Hazards