Machine Learning

Browse posts by tag

June 25, 2024

Reverse-Process Synthetic Data Generation for Math Reasoning

Training LLMs on mathematical reasoning by inverting easy-to-solve problems: generate derivatives, reverse them into integration exercises with full step-by-step solutions.

January 18, 2026

Value Functions Over Reasoning Traces

What if reasoning traces could learn their own usefulness? A simple RL framing for trace memory, and why one reward signal is enough.

January 15, 2026

From A* to GPT: Rational Agents and the Representation Problem

The classical AI curriculum teaches rational agents as utility maximizers. The progression from search to RL to LLMs is really about one thing: finding representations that make decision-making tractable.

machine-learning AI

December 19, 2025

The Incomputability of Simple Learning

Why the simplest forms of learning are incomputable, and what that means for the intelligence we can build.

December 17, 2025

Bayesian Reasoning and Machine Learning

December 17, 2025

Machine Learning: A Probabilistic Perspective

December 17, 2025

Pattern Recognition and Machine Learning

December 17, 2025

Patterns, Predictions, and Actions: A Story About Machine Learning

Notes

Modern graduate ML text with causal inference, decision making, and ML foundations. Accessible free textbook with strong conceptual framing.

December 17, 2025

The Elements of Statistical Learning

December 17, 2025

The Master Algorithm

December 3, 2025

Infinigram: Corpus-Based Language Modeling via Suffix Arrays with LLM Probability Mixing

December 1, 2025

MCTS-Reasoning: A Canonical Specification of Monte Carlo Tree Search for LLM Reasoning

November 4, 2025

The Policy: Q-Learning vs Policy Learning

SIGMA uses Q-learning rather than direct policy learning. This architectural choice makes it both transparent and terrifying. You can read its value function, but what you read is chilling.

AI Fiction

October 8, 2025

DreamLog: Logic Programming That Dreams to Improve Itself

A logic programming system that alternates between wake and sleep phases, using LLMs for knowledge generation during wake and compression-based learning during sleep.

artificial-intelligence machine-learning

October 7, 2025

An Algebraic Framework for Language Model Composition: Unifying Projections, Mixtures, and Constraints

October 7, 2025

Automatic Fuzzy Rule Discovery Through Differentiable Soft Circuits

October 7, 2025

DreamLog: Neural-Symbolic Integration through Compression-Based Learning and Wake-Sleep Cycles

October 7, 2025

Learning Fuzzy Logic: Automatic Rule Discovery Through Differentiable Circuits

Learning fuzzy membership functions and inference rules automatically through gradient descent on soft circuits, instead of hand-crafting them.

January 15, 2025

Differentiation: Three Ways

Three approaches to computing derivatives, forward-mode AD, reverse-mode AD, and finite differences, each with different trade-offs for numerical computing and machine learning.

Computer Science Mathematics

January 5, 2025

Science as Verifiable Search

Science is search through hypothesis space. Intelligence prunes; testing provides signal. Synthetic worlds could accelerate the loop.

AI Research

December 1, 2024

MCTS-Reasoning: Tree Search for LLM Reasoning

Applying Monte Carlo Tree Search to large language model reasoning, with a formal specification of the algorithm.

AI Research

November 15, 2024

Cluster-Aware Retrieval for RAG Systems

Using GMM clustering to improve retrieval in topically diverse knowledge bases

technical

October 15, 2024

Latent Reasoning Traces: Memory as Learned Prior

What if LLMs could remember their own successful reasoning? A simple experiment in trace retrieval, and why 'latent' is the right word.

September 30, 2024

All Induction Is the Same Induction

Solomonoff induction, MDL, speed priors, and neural networks are all special cases of one Bayesian framework with four knobs.

machine-learning statistics

April 20, 2024

Fisher Flow: Optimization on the Statistical Manifold

Gradient descent in Euclidean space ignores the geometry of probability distributions. Natural gradient descent uses the Fisher information metric instead. Fisher Flow makes this continuous.

March 15, 2024

FemtoGrad: A Minimal Automatic Differentiation Library

A tiny autodiff library for understanding how backpropagation actually works.

March 12, 2024

Everything is Utility Maximization

The AI course this semester keeps hammering one idea: intelligence is utility maximization under uncertainty. A* search, reinforcement learning, Bayesian networks, MDPs. One principle connects all of it.

June 17, 2023

Uses and limits of abstractions

Abstractions let us reason about complex systems despite our cognitive limits. But some systems resist compression entirely.

philosophy machine-learning

June 17, 2023

Working Memory as an Inductive Bias

How the limited capacity of human working memory acts as regularization, shaping our reasoning and possibly preventing cognitive overfitting.

Cognitive Science Machine Learning

January 17, 2023

Reverse-Mode Automatic Differentiation

Reverse-mode automatic differentiation is just the chain rule applied systematically. I built one in C++20 to understand what PyTorch and JAX are actually doing.

Computer Science Mathematics

December 8, 2022

Discovering ChatGPT: The Theory Was Already There

I finally tried ChatGPT after weeks of ignoring it. My reaction was not surprise. It was recognition. The Solomonoff connection, language models as compression, prediction as intelligence. The pieces were all there.

September 20, 2021

Forward-Mode Automatic Differentiation

Dual numbers extend the reals with an infinitesimal epsilon where epsilon^2 = 0. Evaluate f(x + epsilon) and you get f(x) + f'(x)*epsilon. The derivative falls out of the algebra.

Computer Science Mathematics

February 1, 2020

Femtograd: Like Micrograd, But Worse

June 1, 2010

Introduction to Sequential Prediction

The problem of predicting what comes next, from compression to language models

Machine Learning Information Theory