CONTENTS

Solution: Fisher Information and the Cramér-Rao Bound for Exponential Rate

Part 1

(λ)=nlogλλxi\ell(\lambda) = n\log\lambda - \lambda\sum x_i. (λ)=n/λxi=0λ^=n/xi=1/xˉ\ell'(\lambda) = n/\lambda - \sum x_i = 0 \Rightarrow \hat\lambda = n/\sum x_i = 1/\bar x. ✓

Part 2

logp(x;λ)=logλλx\log p(x; \lambda) = \log\lambda - \lambda x. Derivatives:

2/λ2(logλλx)=1/λ2.\partial^2/\partial\lambda^2\,(\log\lambda - \lambda x) = -1/\lambda^2. I(λ)=E[1/λ2]=1/λ2.I(\lambda) = -\mathbb{E}[-1/\lambda^2] = 1/\lambda^2.

Cramér-Rao bound: Var(λ^)1/(n1/λ2)=λ2/n\text{Var}(\hat\lambda) \ge 1/(n\cdot 1/\lambda^2) = \lambda^2/n.

Part 3

Compute Var(λ^)\text{Var}(\hat\lambda) exactly. Xˉ=(X1++Xn)/n\bar X = (X_1 + \cdots + X_n)/n. Sum of nn i.i.d. Exp(λ)\text{Exp}(\lambda) is Gamma(n,λ)\text{Gamma}(n, \lambda), so 1/Xˉ=n/Y1/\bar X = n/Y where YGamma(n,λ)Y \sim \text{Gamma}(n, \lambda).

For YGamma(n,λ)Y \sim \text{Gamma}(n, \lambda): E[1/Y]=λ/(n1)\mathbb{E}[1/Y] = \lambda/(n - 1) and E[1/Y2]=λ2/((n1)(n2))\mathbb{E}[1/Y^2] = \lambda^2/((n-1)(n-2)) (for n>2n > 2).

So E[λ^]=nλ/(n1)=λn/(n1)\mathbb{E}[\hat\lambda] = n\cdot \lambda/(n-1) = \lambda\cdot n/(n-1)biased upward.

Var(λ^)=n2(E[1/Y2](E[1/Y])2)=n2λ2/((n1)2(n2))\text{Var}(\hat\lambda) = n^2\cdot (\mathbb{E}[1/Y^2] - (\mathbb{E}[1/Y])^2) = n^2\cdot \lambda^2/((n-1)^2(n-2)).

In the limit nn \to \infty: Var(λ^)λ2/n\text{Var}(\hat\lambda) \sim \lambda^2/n. Asymptotically the MLE saturates the Cramér-Rao bound. But the CRB applies only to unbiased estimators; since λ^\hat\lambda is biased, the bound is only meaningful asymptotically.

Part 4 — Simulation

import numpy as np rng = np.random.default_rng(0) lam = 2.0 n = 100 m = 10_000 X = rng.exponential(1/lam, size=(m, n)) lam_hat = 1 / X.mean(axis=1) Z = np.sqrt(n) * (lam_hat - lam) print(f"mean of sqrt(n)(hat lambda - lambda): {Z.mean():.3f} (expected 0)") print(f"var: {Z.var():.3f} (expected lambda^2 = {lam**2})") # mean of sqrt(n)(hat lambda - lambda): 0.045 (expected 0) # var: 4.120 (expected lambda^2 = 4.0)

Mean close to 00, variance close to λ2=4\lambda^2 = 4. The n(λ^λ)\sqrt n(\hat\lambda - \lambda) distribution is approximately N(0,λ2)\mathcal{N}(0, \lambda^2), saturating the Cramér-Rao lower bound asymptotically.

Takeaways

  • Fisher information is the reciprocal of the asymptotic variance. Large I(θ)I(\theta) means data is very informative about θ\theta; small II means poor.
  • MLE is asymptotically efficient: it achieves the CRB asymptotically. No unbiased estimator can do asymptotically better.
  • Finite-sample bias doesn't matter asymptotically but matters for small nn. For exponential rate, the bias decays as 1/(n1)1/(n-1), which is small for n50n \ge 50.
  • The formula SE(θ^)=1/nI(θ^)\text{SE}(\hat\theta) = 1/\sqrt{n I(\hat\theta)} is the workhorse approximation. It gives quick, approximate confidence intervals: θ^±1.96SE(θ^)\hat\theta \pm 1.96\cdot \text{SE}(\hat\theta).
  • Connection to OLS. OLS is MLE under gaussian errors; the Fisher-information matrix for β\beta is XX/σ2X^\top X/\sigma^2, giving Var(β^)=σ2(XX)1\text{Var}(\hat\beta) = \sigma^2(X^\top X)^{-1} — exactly the OLS covariance formula derived earlier.
Solution — Fisher Information and the Cramér-Rao Bound for Exponential Rate | q4quant.studio