Projects
Open source repositories spanning machine learning, cryptography, statistical computing, and software engineering
Featured Projects
Highlighted work representing significant research and development contributions
memex
chartfold
longecho
clerk
Crier - Cross-Post Your Content Everywhere
CTK - Conversation Toolkit
Repoindex - Repository Management CLI
BTK - Bookmark Toolkit
Design vision for a next-generation 3D emergent physics simulation platform targeting 10M+ particles — mesoscale force models, Particle Mesh gravity, spring …
kofn
Component lifetime estimation from k-out-of-n system data — MLE for exponential and Weibull components under multiple observation schemes
Composition of Bernoulli sets — closed-form error rates for set-theoretic operations (complement, union, intersection, difference), monoidal structure, …
The Perfect Hash Filter: an efficient implementation of the positive Bernoulli set and Bernoulli map abstract data types
The Bernoulli Hash Function: Optimal Bernoulli Sets and Bernoulli Maps
sigmark
GPG signing for static site markdown content
Declarative language for constructing complex networks from structured data
bernoulli_sets
Bernoulli sets: a compositional algebra for random approximate sets with quantifiable false positive/negative error rates
Trapdoor computing: a paradigm for computing with values hidden behind one-way trapdoors. Cipher maps, Bernoulli approximations, and composable encrypted …
Information Recovery under Deterministic Masking in Exponential Series Systems — short communication
claude-anvil
A forge for Claude Code plugins — academic research, fiction worldbuilding, publication workflows, and more
Claude Code plugin for deets — personal metadata queries (identity, profiles, cross-platform disambiguation)
Claude Code plugin for academic research paper lifecycle management — thesis refinement, literature surveys, experimental design, editorial review, and …
mf
CLI toolkit for synchronizing external sources to a Hugo static site
Unified publication pipeline: R packages (CRAN/JOSS/JSS), Python packages (PyPI), and books (Amazon KDP)
Masked-cause likelihood models for series systems with arbitrary hazard components in R
Companion research directions for masked series systems likelihood framework
Masked Causes of Failure in Series Systems: A Likelihood Framework
narro
Narro: lightweight CPU text-to-speech
Series system distributions from flexible hazard rate components in R
minds-and-machines
AI alignment, moral agency, superintelligence, and the futures we might build
sequential-prediction
Research framework for sequential prediction algorithms, starting with Context Tree Weighting (CTW)
cryptoid
Client-side encrypted content for Hugo static sites with multi-user access control
pagevault
Password-protect semi-private content on static sites. Ideal for Hugo blogs, GitHub Pages, and static hosting.
posthumous
Federated deadman switch with TOTP authentication, post-trigger scheduling, and peer-to-peer state sync
R-universe package registry for queelius
curations
Multi-collection curated awesome-lists powered by curalist
chop
Unix-philosophy image manipulation CLI with lazy evaluation, JSON piping, and multi-image composition
deets
A self-describing, TOML-backed personal metadata store. Unix-philosophy CLI for making personal details instantly available to coding agents and scripts.
R package for automatic differentiation and gradient computation.
curalist
Multi-collection curated awesome-list manager
texwatch
Live-reloading TeX editor with PDF viewer in the browser
algebraic-cipher-types
Functorial framework for secure computation through homomorphic operations on encrypted algebraic structures
dapple
Unified terminal graphics library with multiple renderers (braille, quadrants, sextants, ASCII, sixel, kitty)
eidola
Generate a conversable persona from personal data: conversations, writings, emails, bookmarks, photos, reading notes
agentum
A unified framework for sequential decision-making: from classical search to deep RL. All methods are approximations of expectimax with different representation …
the-learning-problem
Essays on induction, inference, and the search for useful representations
sicp
Posts influenced by SICP—on abstraction, composition, and computation as a medium for expressing ideas
the-long-echo
Essays on digital legacy, graceful degradation, and designing systems that outlast their creators
ptk
Photo Toolkit - CLI for managing personal photo libraries with AI-powered organization, SHA256 deduplication, and semantic search
Paper: Preventing Ransomware Damages using In-Operation Off-Site Backup to Achieve a 10^-8 False-Negative Miss-Detection Rate (IEEE ICCI 2025)
mtk
Mail Toolkit - Personal email archive management with semantic search, relationship mapping, and privacy controls
jot
A CLI-first, plaintext-native toolkit for capturing and organizing ideas, plans, tasks, and notes. Designed for the LLM era.
The Call of Asheron: An epic fantasy novel exploring forced migration, consciousness, and transformation through four protagonists on an alien world where …
Echoes of the Sublime - A philosophical horror novel exploring AI safety, consciousness, and cognitive bandwidth limits (~103k words)
the-policy
The Policy - A literary SF novel exploring AI alignment, consciousness, and emergence
Symbolic likelihood models in Python. Build, compose, and analyze likelihood functions with automatic differentiation and symbolic manipulation.
A pattern matching and term rewriting library for Python. Define rewrite rules with intuitive DSL syntax and apply them to transform symbolic expressions.
pfc
A header-only C++20 library for zero-copy, prefix-free data representations with algebraic types and succinct data structures
Immutable graph library with 56+ algorithms, transformers, selectors, and lazy views.
Infinigram
Variable-length n-gram language models using suffix arrays.
High-performance N-dimensional sparse spatial hashing library for C++20
latent-codes-llm
Instrumental Goals and Latent Codes in RL-Fine-Tuned Language Models A comprehensive theoretical and empirical analysis of mesa-optimization risks, deceptive …
crypto-perf-hash
Theoretical analysis of cryptographic perfect hash functions with optimal space complexity
tex2html
Convert LaTeX documents to HTML using LaTeXML with theme and component support
llm-priors
LLMs as Intelligent Priors: Enhancing Classical Algorithms Through Learned Initialization
A virtual POSIX filesystem with content-addressable DAG structure. Features immutable nodes, Git-style hashing, functional transformations, and an embedded …
cbt
Computational Basis Transforms - A header-only C++17 library for transformations between computational domains
mcts-reasoning
Monte Carlo Tree Search for LLM-based reasoning with fluent API and advanced sampling strategies
seqwise
Seqwise - Sequential Image Analysis with Vision Language Models A simple, cost-free approach to analyzing sequences of images using local Vision Language Models …
dotsuite
The Dot Ecosystem “What started as a single, humble function evolved into a complete, coherent ecosystem for manipulating data structures—a journey in API …
A network-native functional language.
JSONL Algebra
A powerful relational algebra CLI and library for JSONL data manipulation.
ransomware-policy
Ransomware Detection using LLMs Research project exploring ransomware detection using Large Language Models, with the Active Detective Agent as the primary …
Research compendium: Cognitive MRI of AI Conversations. Conference paper (Complex Networks 2025, Springer) and journal extension (PLOS Complex Systems).
chatgpt-complex-net
Cognitive MRI of AI Conversations: Network analysis of ChatGPT conversation logs using semantic embeddings to reveal knowledge topology, community structure, …
dreamlog
Logic programming with LLM integration and wake-sleep learning cycles
src2md
Convert source code to structured, context-optimized markdown for LLMs with intelligent summarization.
EBK - eBook Toolkit
CLI tool for managing ebooks with semantic search, virtual libraries, annotations, and multi-format export. Part of the Long Echo toolkit for personal data …
JAF - Just Another Flow
A streaming data processing system for JSON with lazy evaluation, composable operations, and a fluent API.
fuzzy-infer
Unix-composable fuzzy logic inference with elegant Pythonic API
XTK - Expression Toolkit
A powerful symbolic expression toolkit for rule-based term rewriting.
space-sandbox-sim
RPSDG
Reverse-Process Synthetic Data Generation: Automatically Generating Training Language Models for Complex Problem Solving Abstract: This paper introduces a …
zeroipc
ZeroIPC - High-Performance Shared Memory IPC Zero-copy data sharing between processes in C++, Python, Go, and C. No serialization, no bindings — parallel native …
beautiful-deception
How 256 bits pretend to be infinity: A pedagogical exploration of random oracles and computational randomness
ollama_data_tools
Archived. This was an early experiment with Ollama tooling that didn’t pan out. The code is preserved for reference but is no longer maintained or …
AlgoTree
A powerful, immutable-by-default tree manipulation library for Python with functional programming patterns, composable transformations, and advanced pattern …
A consistent API for hypothesis testing in R. Provides generic methods for p-values, test statistics, degrees of freedom, and significance testing. Includes LRT …
sluug-talk-llm
marp: true #theme: uncover math: mathjax SLUUG Talk: Large Language Models This repository contains the slides and code for the talk: Demystifying Large …
elasticsearch-lm
ElasticSearch Query Fine-Tuning Training Data for Large Language Models
Scalable lock based on 2-thread Peterson lock.
When does model simplification matter? Consequence analysis and adaptive model selection for Weibull series systems with masked failure data.
A modern C++ header-only library implementing Disjoint Interval Sets as a complete Boolean algebra. Features elegant API, compile-time intervals, and …
[Archived] R package for Weibull series system estimation from masked failure data. Companion code for master's project (SIUE, 2023). Superseded by …
[Archived] Master's project (SIUE, 2023): MLE for series system reliability with Weibull components under right-censoring and masked failure data. See …
Flexible hazard rate distributions for survival analysis and reliability engineering in R
Likelihood model for series systems with masked component cause of failure and other censoring mechanisms
chatgpt-pysearch
ChatGPT chat search This was the first python app I developed in quite some time. I wanted to host ChatGPT logs, experiment with heroku, and see how easy it …
Likelihood model framework
R package for maximum likelihood estimation with compositional data.
Seeing how easy it is to convert an old project on Google App Engine to a modern framework with the help of ChatGPT
Like micrograd, but worse.
stepanov
Pedagogical blog posts on generic programming in C++, inspired by Alex Stepanov
R package: Algebra over distributions (random elements) with automatic simplification to closed forms
[Archived] Masked data tools for R. Superseded by internalized utilities in likelihood.model.series.md and dfr.lik.series.md.
algebraic.mle
Algebraic maximum likelihood estimators
mdrelax
mdrelax Relaxed Candidate Set Models for Masked Data in Series Systems Overview This R package implements likelihood-based inference for series systems with …
aperture
Apertures A minimal Lisp-like language where “holes” (written ?x or ?ns.x) represent unknown values that can be filled later. This enables pausable, …
Time series analysis of a confidentiality measure for an Encrypted search system We derive a confidentiality measure against an adversary deploying a …
Algebraic cipher types
Master's thesis on encrypted search: enabling standard IR on encrypted collections. Published via ProQuest (2014). Part of the oblivious-computing research …
Encrypted Search with Oblivious Bernoulli Types: Information-Theoretic Privacy through Controlled Approximation
Probabilistic framework for quantifying confidentiality of encrypted search systems using bootstrap methods and entropy analysis
Research code and data for the IEEE CloudCom 2016 paper on estimating confidentiality risks in encrypted search systems. The Moving Average Bootstrap (MAB) …
cipher_maps
A unified theoretical framework for oblivious function approximation through algebraic structures and Bernoulli models
Closed-form MLE and Fisher information for exponential series systems with masked failure data. Includes theoretical results, proofs, and numerical validation.
Modern C++20 header-only library for algebraic hash function composition with elegant DSL
maph
Space-efficient approximate mappings using perfect hash functions. Supports arbitrary function approximation (X→Y) with configurable storage (8/16/32/64-bit) …
Privacy-preserving set operations using cryptographic trapdoor functions. Minimal Python library implementing Bernoulli types framework with explicit error …
bernoulli_data_type
Bernoulli Data Type A general framework for understanding and constructing probabilistic data structures with controlled error rates. This framework can also …