Fisher scoring optimizer
fisher_scoring.RdSimilar to Newton-Raphson but uses expected Fisher information (negative expected Hessian) instead of observed Hessian. More stable for some problems.
Details
For regular exponential families, Fisher scoring is equivalent to Newton-Raphson since observed = expected information.
Fisher scoring: θ_n+1 = θ_n + I(θ_n)⁻¹ S(θ_n) where I = -EH (Fisher information) and S = gradient (score).
This implementation uses the observed Hessian as an approximation to the expected Hessian, making it identical to Newton-Raphson. For a true Fisher scoring implementation, one would need to compute EH analytically or via Monte Carlo.