Projects

Open source repositories spanning machine learning, cryptography, statistical computing, and software engineering

All Projects
Filter and Search Projects
Category
Primary Language

Unix-philosophy image manipulation CLI with lazy evaluation, JSON piping, and multi-image composition

A self-describing, TOML-backed personal metadata store. Unix-philosophy CLI for making personal details instantly available to coding agents and scripts.

Unified terminal graphics library with multiple renderers (braille, quadrants, sextants, ASCII, sixel, kitty)

aptus

Latin: fitted, adapted

REST API and Python client for remote LLM fine-tuning. Run the server on your GPU machine, submit training jobs from anywhere. …

A unified framework for sequential decision-making: from classical search to deep RL. All methods are approximations of expectimax with different representation …

Posts influenced by SICP—on abstraction, composition, and computation as a medium for expressing ideas

Photo Toolkit - CLI for managing personal photo libraries with AI-powered organization, SHA256 deduplication, and semantic search

dual

Forward-mode automatic differentiation via dual numbers for C++20.

Overview

Dual numbers are a simple yet powerful technique for computing exact …

clerk

A thin CLI for LLM agents to interact with email via IMAP/SMTP.

License: MIT Python 3.11+

Philosophy

Clerk is intentionally dumb. It’s a bridge, not a brain. …

Mail Toolkit - Personal email archive management with semantic search, relationship mapping, and privacy controls

A CLI-first, plaintext-native toolkit for capturing and organizing ideas, plans, tasks, and notes. Designed for the LLM era.

The Call of Asheron: An epic fantasy novel exploring forced migration, consciousness, and transformation through four protagonists on an alien world where …

A header-only C++20 library for zero-copy, prefix-free data representations with algebraic types and succinct data structures

Topology-aware RAG using complex network analysis. Features community detection, hub/bridge identification, and a YAML DSL for configuring field embeddings and …

Instrumental Goals and Latent Codes in RL-Fine-Tuned Language Models

A comprehensive theoretical and empirical analysis of mesa-optimization risks, deceptive …

NFA Tools: Regular Languages and Finite Automata

An elegant, pedagogical implementation of finite automata with NFA to DFA conversion, regex parsing, and …

Computational Basis Transforms - A header-only C++17 library for transformations between computational domains

Seqwise - Sequential Image Analysis with Vision Language Models

A simple, cost-free approach to analyzing sequences of images using local Vision Language Models …

The Dot Ecosystem

“What started as a single, humble function evolved into a complete, coherent ecosystem for manipulating data structures—a journey in …

Fisher Flow: A unified information-geometric framework for sequential inference revealing how modern optimizers (Adam, Natural Gradient, K-FAC, EWC) emerge as …

Cognitive MRI of AI Conversations: Network analysis of ChatGPT conversation logs using semantic embeddings to reveal knowledge topology, community structure, …

Reverse-Process Synthetic Data Generation: Automatically Generating Training Language Models for Complex Problem Solving

Abstract:

This paper introduces a …

ZeroIPC - Active Computational Substrate for Shared Memory

Overview

ZeroIPC transforms shared memory from passive storage into an active computational …

LangCalc: A Calculus for Language Models

An elegant mathematical framework for composing language models through algebraic operations, featuring efficient …

Tree Rewriter

A minimal term rewriting system. 15 lines of code. Infinite possibilities.

The Insight

What if we could express computational rules as simple …

A powerful, immutable-by-default tree manipulation library for Python with functional programming patterns, composable transformations, and advanced pattern …

A consistent API for hypothesis testing in R. Provides generic methods for p-values, test statistics, degrees of freedom, and significance testing. Includes LRT …


marp: true #theme: uncover math: mathjax

SLUUG Talk: Large Language Models

This repository contains the slides and code for the talk:

  • Demystifying Large …

Anonymous batch job execution system with Linux namespace/seccomp sandboxing, resource limits, and WebSocket streaming

ChatGPT chat search

This was the first python app I developed in quite some time. I wanted to host ChatGPT logs, experiment with heroku, and see how easy it …

R package for specifying and using likelihood models for statistical inference. Provides a flexible framework for independent likelihood contributions across …

Seeing how easy it is to convert an old project on Google App Engine to a modern framework with the help of ChatGPT

R package: md.tools

A miscellaneous set of tools for working with masked data and common features of masked data. The tool set takes inspiration from …

mdrelax

Relaxed Candidate Set Models for Masked Data in Series Systems

Overview

This R package implements likelihood-based inference for series systems with …

Apertures

A minimal Lisp-like language where “holes” (written ?x or ?ns.x) represent unknown values that can be filled later. This enables pausable, …

Alga

A mathematically elegant C++20 library for algebraic text processing and compositional parsing with fuzzy matching. Built on rigorous algebraic foundations …

Composable calculus expressions for C++20: symbolic differentiation, numerical integration, and algebraic composition

Closed-form MLE and Fisher information for exponential series systems with masked failure data. Includes theoretical results, proofs, and numerical validation.

Space-efficient approximate mappings using perfect hash functions. Supports arbitrary function approximation (X→Y) with configurable storage (8/16/32/64-bit) …

Bernoulli Data Type

A general framework for understanding and constructing probabilistic data structures with controlled error rates. This framework can also …