December 3, 2025Infinigram: Corpus-Based Language Modeling via Suffix Arrays with LLM Probability Mixing