December 17, 2025
Review
Seminal blog post demonstrating the power of character-level RNNs. Shows Shakespeare generation, Wikipedia generation, LaTeX generation, and Linux kernel code generation. The visualizations of LSTM cells are particularly illuminating.
December 3, 2025
A corpus-based language model using suffix arrays for O(m log n) pattern matching. The corpus is the model.
October 7, 2025
A mathematical framework that treats language models as algebraic objects with compositional structure.
September 20, 2024
The evolution of neural sequence prediction, and how it connects to classical methods
June 15, 2024
The bias-data trade-off in sequential prediction: when to use CTW, n-grams, or neural language models.
August 15, 2016
The classical approach to sequence prediction: counting and smoothing