Artificial Intelligence

Browse posts by tag

June 25, 2024

Advancing Mathematical Reasoning in AI: Introducing Reverse-Process Synthetic Data Generation

Check out the (early) project and source code on GitHub.

Abstract:

This paper introduces a methodology for generating high-quality, diverse training data for Language Models (LMs) in complex problem-solving domains. Our approach, termed …

December 1, 2025

MCTS-Reasoning: A Canonical Specification of Monte Carlo Tree Search for LLM Reasoning

November 5, 2025

Why Artificial Superintelligence Can't Escape the Void

The Optimistic Assumption

Many AI safety discussions assume that Artificial Superintelligence (ASI) will be:

Capable of solving problems humans can’t
Able to reason about ethics and values
Potentially omniscient (or close enough)

But …

AI Safety Philosophy

March 20, 2024

Instrumental Goals and Hidden Codes in RLHF'd Language Models

RLHF turns pretrained models into agents optimizing for reward. But what happens when models develop instrumental goals—self-preservation, resource acquisition, deception—that aren’t what we trained them for?

The Core Problem

LLMs transition …

AI Safety Machine Learning

March 15, 2024