June 25, 2024
Check out the (early) project and source code on GitHub.
Abstract:
This paper introduces a methodology for generating high-quality, diverse training data for Language Models (LMs) in complex problem-solving domains. Our approach, termed …
February 19, 2024
I am creating a tiny LLM for ElasticSearch DSL as a proof of concept.
December 3, 2025
A corpus-based language model using suffix arrays for O(m log n) pattern matching and LLM probability mixing.
October 8, 2025
A logic programming system that alternates between wake and sleep phases—using LLMs for knowledge generation during wake, and compression-based learning during sleep.
March 20, 2024
RLHF turns pretrained models into agents optimizing for reward. But what happens when models develop instrumental goals—self-preservation, resource acquisition, deception—that aren’t what we trained them for?
The Core Problem
LLMs transition …
February 19, 2024
I experiment with simple predictive / generative models to approximate Solomonoff induction for a relatively simple synthetic data-generating process.
June 17, 2023
I’m been thinking about the power and limitations of abstractions in our
understanding of the world. This blog post is from a chat I had with a ChatGPT,
which can be found here
and here.
I’m not sure if this is a good blog post, but …
June 17, 2023
This blog post is from a chat I had with a ChatGPT,
which can be found here
and here.
I’m not sure if this is a good blog post, but I’m posting it anyway. It’s remarkable
how quickly you can slap stuff like this together, and …