December 17, 2025
Value Alignment
Browse posts by tag
November 4, 2025
The Policy: Coherent Extrapolated Volition - The Paradox of Perfect Alignment
Build AI to optimize for what we would want if we knew more and thought faster. Beautiful in theory. Horrifying in practice. What if we don't actually want what our better selves would want?