Discussion & Related
DreamLog: Logic Programming That Dreams to Improve Itself
A logic programming system that alternates between wake and sleep phases—using LLMs for knowledge generation during wake, and compression-based learning during sleep.
October 8, 2025 · 3 min read
The Policy: Q-Learning vs Policy Learning
SIGMA uses Q-learning rather than direct policy learning. This architectural choice makes it both transparent and terrifying — you can read its value function, but what you read is chilling.
November 4, 2025 · 9 min read