Machine Learning

Browse posts by category

June 25, 2024

Advancing Mathematical Reasoning in AI: Introducing Reverse-Process Synthetic Data Generation

Check out the (early) project and source code on GitHub.

Abstract:

This paper introduces a methodology for generating high-quality, diverse training data for Language Models (LMs) in complex problem-solving domains. Our approach, termed …

February 19, 2024

Fine-Tuning Tiny LLMs for ElasticSearch DSL

I am creating a tiny LLM for ElasticSearch DSL as a proof of concept.

large language models fine-tuning information retrieval elastic search domain-specific language

December 3, 2025

Infinigram: Variable-Length N-grams via Suffix Arrays

A corpus-based language model using suffix arrays for O(m log n) pattern matching and LLM probability mixing.

language models n-gram suffix arrays NLP LLM grounding

October 8, 2025

DreamLog: Logic Programming That Dreams to Improve Itself

A logic programming system that alternates between wake and sleep phases—using LLMs for knowledge generation during wake, and compression-based learning during sleep.

logic-programming LLM neural-symbolic machine-learning compression

March 20, 2024

Instrumental Goals and Hidden Codes in RLHF'd Language Models

RLHF turns pretrained models into agents optimizing for reward. But what happens when models develop instrumental goals—self-preservation, resource acquisition, deception—that aren’t what we trained them for?

The Core Problem

LLMs transition …

artificial intelligence alignment reinforcement learning RLHF deceptive alignment

February 19, 2024

Approximations of Solomonoff Induction

I experiment with simple predictive / generative models to approximate Solomonoff induction for a relatively simple synthetic data-generating process.

large language models solomonoff induction synthetic data algorthmic data n-gram models

June 17, 2023

Uses and limits of abstractions

I’m been thinking about the power and limitations of abstractions in our understanding of the world. This blog post is from a chat I had with a ChatGPT, which can be found here and here.

I’m not sure if this is a good blog post, but …

June 17, 2023

Working memory as an inductive bias

This blog post is from a chat I had with a ChatGPT, which can be found here and here.

I’m not sure if this is a good blog post, but I’m posting it anyway. It’s remarkable how quickly you can slap stuff like this together, and …

Cognitive Science Machine Learning LLM Regularization