December 17, 2025
Review
Seminal blog post demonstrating the power of character-level RNNs. Shows Shakespeare generation, Wikipedia generation, LaTeX generation, and Linux kernel code generation. The visualizations of LSTM cells are particularly illuminating.
December 3, 2025
A corpus-based language model using suffix arrays for O(m log n) pattern matching and LLM probability mixing.
October 7, 2025
A mathematical framework that treats language models as algebraic objects with rich compositional structure.
September 20, 2024
The evolution of neural sequence prediction—and how it connects to classical methods
June 15, 2024
Trade-offs between sample efficiency, scalability, and theoretical guarantees
August 15, 2016
The classical approach to sequence prediction—counting and smoothing