Statistics

Browse posts by tag

February 18, 2026

Masked Failure Data: Looking Back, Looking Forward

A retrospective on three years of building R packages and writing papers for masked series system reliability, and what comes next.

Statistics Software Development

February 13, 2026

Observation Functors: Composable Censoring for Series System Simulation

Observation functors in maskedcauses: composable functions that separate the data-generating process from the observation mechanism, enabling mixed-censoring simulation and verified Monte Carlo studies.

Statistics Software Development

February 5, 2026

maskedcauses: Maximum Likelihood Estimation for Masked Series System Failures

The maskedcauses R package for MLE in series systems with masked component failures, built on composable likelihood contributions and validated through simulation.

Statistics Software Development

December 17, 2025

All of Statistics

December 17, 2025

Bayesian Data Analysis

Notes

Applied Bayesian inference with computing methods. The standard Bayesian statistics reference.

December 17, 2025

compositional.mle: SICP-Inspired Optimization

An R package where optimization solvers are first-class functions that compose through chaining, racing, and restarts.

programming statistics projects

December 17, 2025

Introduction to Probability and Mathematical Statistics

Notes

Rigorous graduate-level probability + statistics; useful for inference and ML foundations.

December 17, 2025

Statistical Inference

Notes

Standard rigorous text on estimation, hypothesis testing, and asymptotic theory.

December 17, 2025

The Elements of Statistical Learning

December 16, 2025

Graduate Statistics Problem Sets

My graduate coursework from SIUe's math program is up: time series, regression, computational stats, multivariate analysis, and statistical methods.

Personal Statistics

December 16, 2025

symlik: Symbolic Likelihood Models in Python

Define statistical models symbolically and automatically derive score functions, Hessians, and Fisher information. No numerical approximation.

Projects Statistics

December 12, 2025

hypothesize: Now on CRAN

My R package for hypothesis testing, hypothesize, is now available on CRAN.

R Publication

December 3, 2025

Likelihood Models for Series Systems with Masked Component Failure Data: An R Package for Maximum Likelihood Estimation

December 3, 2025

mdrelax: When Masking Conditions Don't Hold

Extending masked failure data analysis when the standard C1-C2-C3 masking conditions are violated.

Statistics Reliability

December 3, 2025

Model Selection for Weibull Series Systems: When Simpler Models Suffice

When can reliability engineers safely use simpler models? Likelihood ratio tests on Weibull series systems give sharp boundaries.

Statistics Reliability

December 2, 2025

Closed-Form Results for Masked Exponential Series Systems

Closed-form MLEs and Fisher information for exponential series systems with masked failure data. No numerical optimization required.

Statistics Mathematics

December 2, 2025

Statistical Inference for Series Systems from Masked Failure Time Data: The Exponential Case

October 7, 2025

Alea: A Modern C++ Library for Algebraic Random Elements

June 20, 2024

Fisher Flow: An Information-Geometric Framework for Sequential Estimation

June 15, 2024

Reliability Estimation in Series Systems: Maximum Likelihood Techniques for Right-Censored and Masked Failure Data

Maximum likelihood estimation of component reliability from masked failure data in series systems, with BCa bootstrap confidence intervals validated through extensive simulation studies.

March 1, 2024

Accumux: Compositional Online Statistical Reductions in C++

A C++20 library for composing online statistical accumulators with numerically stable algorithms and algebraic composition.

computer-science programming

February 19, 2024

Approximations of Solomonoff Induction

I experiment with simple predictive / generative models to approximate Solomonoff induction for a relatively simple synthetic data-generating process.

statistics machine learning inference

October 18, 2023

Math Master's Done: Post-Mortem

I defended my mathematics thesis. Three years, stage 3 cancer, and a second master's degree. Here is what worked and what did not.

October 7, 2023

Model Selection for Reliability Estimation in Series Systems

August 9, 2023

Reliability Estimation in Series Systems Maximum Likelihood Techniques for Right-Censored and Masked Failure Data

March 31, 2023

Problem Set Solutions

Graduate problem set solutions in computational statistics and numerical methods from my math master's at SIUe. Implementing things from scratch teaches you what the libraries are hiding.

March 29, 2023

Model Selection in Weibull Series Systems

In my paper, Reliability Estimation in Series Systems, I discarded a lot of research that may be interesting to pursue further. This one is about using homogeneous shape parameters for the Weibull series system, which can greatly simplify the …

statistics data-science inference likelihood-models

February 5, 2023

Numerical Methods for Maximum Likelihood Estimation

Numerical approaches to maximum likelihood estimation, covering the optimization methods and computational issues that come up in practice.

June 30, 2022

likelihood.model: Composable Likelihood Models in R

A generic R framework for composable likelihood models. Likelihoods are first-class objects that compose through independent contributions.

Statistics Software Development

April 18, 2022

Weibull Distributions: From Reliability Theory to My Own Survival Curve

Weibull distributions model time-to-failure in reliability engineering and cancer survival. I study both professionally. One of them became personal.

March 25, 2022

hypothesize: A Consistent Interface for Statistical Tests

An R package that gives hypothesis tests a consistent interface. Every test returns the same structure. You can write generic code that works across all of them.

Statistics Software Development

December 1, 2021

STAT 581 - Statistical Methods - SIUe - Fall 2021

Problem sets for STAT 581 - Statistical Methods at SIUe, taught by Dr. Neath during Fall 2021.

December 1, 2021

Statistical Methods - STAT 581 - Exam 1

An experiment is conducted to study the effect of fitness level on ego > strength. Random samples of college faculty members are selected from each

December 1, 2021

Statistical Methods - STAT 581 - Exam 2

A randomized complete block design is used to study the effect of caliper on the measured diameters

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 1

An experiment is designed to investigate whether the time to drill holes in rock holes using wet or dry drilling.

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 2

A product developer is investigating the tensile strength of a new synthetic fiber that will be used to make cloth for men’s shirts.

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 3 a

An experiment is conducted to study the effect of drilling method on drilling time. Each method (dry drilling, wet drilling) is used on $n = 12$ rocks.

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 3 b

An experiment to compare a new drug to a standard is in the planning stages. The response variable of interest is the clotting time (in minutes) of blood

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 4

The insulating life of protective fluids at an accelerated load is being studied. The experiment has been performed for four types of fluids, with $n = 5$

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 5

A factorial experiment is used to develop a nitride etch process on a single wafer plasma etching tool.

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 6

A soft drink bottler is interested in studying the effects on a filling process. A factorial experiment is run using three factors: percent carbonation (in %),

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 7

A paired comparisons design is used to study the effect of machine operator on > the measured running time (in secs.) of a fuse. A sample of $n = 10$ fuses is

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 8

An experiment is designed to test for systematic differences in the hardness > measurements provided by two devices (fixed effect, factor $A$).

December 1, 2021

Statistical Methods - STAT 581 - Problem Set 9

The surface finish of metal parts made on $a=4$ machines is under > investigation. > Each machine can be run by one of $b=3$ operators.

October 30, 2021

Computational Statistics - SIUe - STAT 575 - Problem Set 2

This problem set covers the E-M algorithm for right-censored normal data with known variance.

statistics R EM algorithm

October 30, 2021

Review: A Symbolic Representation of Time Series, with Implications for Streaming Algorithms

A review of SAX (Symbolic Aggregate approXimation), a method for converting real-valued time series into symbolic representations with guaranteed distance lower bounds.

statistics data science

October 30, 2021

SIUe - Computational Statistics (STAT 575) - Problem Set 4

This problem set covers sampling from a Gamma distribution using Metropolis-Hastings and acceptance-rejection methods.

September 10, 2021

Bootstrap Methods: When Theory Meets Computation

Bootstrap resampling trades mathematical complexity for computational burden. When you can't derive the variance analytically, you resample. For my thesis work on masked failure data, that trade is essential.

August 20, 2021

flexhaz: Specify the Hazard Function Directly

An R package for specifying hazard functions directly instead of picking from a catalog of named distributions. You write the hazard. It handles the rest.

Statistics Software Development

August 1, 2021

Regression Analysis - SIUe - STAT 482 - Probem Set 8

This problem set covers multicollinearity in regression analysis and the marginal and partial effects of predictor variables, among other topics.

August 1, 2021

STAT 482 - Regression Analysis - SIUe - Fall 2022

This is a problem set for STAT 482 - Regression Analysis at SIUe. These problem sets were given by Dr. Andrew Neath, a professor in the Department of Mathematics and Statistics at Southern Illinois University Edwardsville (SIUe) during the Fall 2022 …

August 1, 2021

STAT 575 - Computational Statistics - SIUe - Summer 2021

This is a problem set for STAT 575 - Computational Statistics at SIUe. These problem sets were given by Dr. Qiang Beidi, a professor in the Department of Mathematics and Statistics at Southern Illinois University Edwardsville (SIUe) during the Summer …

May 15, 2021

algebraic.mle: MLEs as Algebraic Objects

An R package that treats MLEs as algebraic objects. They carry Fisher information, compose through independent likelihoods, and propagate uncertainty correctly.

Statistics Software Development

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Exam 2

Discrete multivariate analysis exam covering log-linear models and categorical data analysis.

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Final Exam

Final exam for discrete multivariate analysis.

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Problem Set 10

Problem set 10 for discrete multivariate analysis.

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Problem Set 5

Problem set 5 for discrete multivariate analysis.

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Problem Set 6

Problem set 6 for discrete multivariate analysis.

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Problem Set 7

Problem set 7 for discrete multivariate analysis.

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Problem Set 8

Problem set 8 for discrete multivariate analysis.

May 1, 2021

Discrete Multivariate Analysis - STAT 579 - Problem Set 9

Problem set 9 for discrete multivariate analysis.

May 1, 2021

STAT 478 - Time Series Analysis - SIUe - Spring 2021

Problem sets for STAT 478 - Time Series Analysis at SIUe, taught by Dr. Beidi during Spring 2021.

May 1, 2021

STAT 579 - Discrete Multivariate Analysis - SIUe - Spring 2021

Problem sets for STAT 579 - Discrete Multivariate Analysis at SIUe, taught by Dr. Andrew Neath during Spring 2021.

May 1, 2021

Time Series Analysis - STAT 478 - Exam 1

Time series analysis exam covering ARMA processes and model identification.

May 1, 2021

Time Series Analysis - STAT 478 - Exam 2

Time series analysis coursework.

May 1, 2021

Time Series Analysis - STAT 478 - Final Exam

Final exam for time series analysis course.

May 1, 2021

Time Series Analysis - STAT 478 - Problem Set 3

Problem set 3 for time series analysis.

May 1, 2021

Time Series Analysis - STAT 478 - Problem Set 4

Problem set 4 for time series analysis.

May 1, 2021

Time Series Analysis - STAT 478 - Problem Set 5

Problem set 5 for time series analysis.

May 1, 2021

Time Series Analysis - STAT 478 - Problem Set 6

Problem set 6 for time series analysis.

May 1, 2021

Time Series Analysis - STAT 478 - Project

Time series analysis project.

February 1, 2021

algebraic.dist: Distributions as Algebraic Objects in R

An R package that treats probability distributions as algebraic objects. They compose through standard operations. The algebra preserves distributional structure.

Software Development Statistics

October 1, 2020