Optional Stopping Theorem
Motivation: why this matters in quant finance
The
Optional Stopping Theorem (OST) is the sharpest form of "a fair game stays fair." Formally: if
(Mn) is a martingale and
τ is a stopping time satisfying one of a short list of regularity conditions, then
E[Mτ]=E[M0].
In one sentence:
you cannot cheat a martingale by choosing a clever exit time. The theorem sits at the heart of arbitrage arguments, gambler's-ruin computations, and the pricing of barrier and American options. It is also where "naive" probabilistic intuition most commonly goes wrong — the theorem has tight integrability hypotheses that are easy to miss, and its failure mode (the symmetric random walk that eventually hits
1) is a standard counter-example worth memorising.
In quant finance, OST is the tool that converts "this process is a martingale" into "my expected P&L at exit is zero" — the universal no-free-lunch statement. Every closed-form ruin probability, hitting-time distribution, and barrier-option price that has a slick derivation routes through it.
The informal idea
A martingale is a fair game:
E[Mn+1∣Fn]=Mn. "Fair" means zero expected profit per step. The natural question: can I add a
stopping rule — "exit when something happens" — to engineer a positive expected profit?
Intuitively, no. If at every step the expected next move is zero, then no matter when you exit, your expected final value should equal your starting value. This is the OST.
The subtlety: "no matter when you exit" is too strong. There are stopping rules that
exploit unboundedness. Classic example: symmetric random walk
Sn starting at
0, exit at
τ=inf{n:Sn=1}. The walk is recurrent, so
τ<∞ a.s. and
Sτ=1 a.s., so
E[Sτ]=1=0=E[S0].
What happened? The exit time τ is a stopping time, but has infinite expected value (E[τ]=∞). The strategy "bet until you're up 1 unit, then quit" works — eventually — but requires infinite expected time and an infinite credit line. The OST's hypotheses are designed to rule out exactly this pathological regime.
Formal statement
Discrete-time OST
Let
(Mn)n≥0 be a martingale with respect to a filtration
(Fn), and
τ a stopping time. Then
E[Mτ]=E[M0] provided
any one of the following holds:
- τ is bounded: τ≤K a.s. for some deterministic constant K.
- M is bounded: ∣Mn∣≤K a.s. for some K (equivalently, M bounded on {n≤τ}).
- E[τ]<∞ and martingale has bounded increments: E[τ]<∞ and there exists C with ∣Mn+1−Mn∣≤C a.s. for all n.
- Uniform integrability: (Mn∧τ) is uniformly integrable (subsumes (1)–(3)).
Why these hypotheses
The stopped process (Mn∧τ) is always a martingale (no condition), so E[Mn∧τ]=E[M0] for every fixed n. The OST question is whether limnE[Mn∧τ]=E[limnMn∧τ]=E[Mτ]. The limit-and-expectation interchange requires uniform integrability (or one of the sufficient conditions above).
Continuous-time OST
For a continuous-time right-continuous martingale (Mt) and a stopping time τ, the analogous statement holds under analogous conditions (bounded τ, bounded M, or uniform integrability of (Mt∧τ)).
Canonical applications
Application 1 — Gambler's ruin for a symmetric walk
Let Sn be a symmetric random walk starting at k∈{0,1,…,N}, stopped at τ=inf{n:Sn∈{0,N}} — the first exit time from (0,N).
- τ is finite almost surely (a recurrent walk exits any bounded interval).
- The stopped process Sn∧τ is bounded between 0 and N, so OST (condition 2) applies.
- E[Sτ]=k by OST.
- Sτ∈{0,N}, so E[Sτ]=N⋅P(Sτ=N). Setting the two equal:
P(hit N before 0, starting at k)=k/N.
This is the gambler's ruin formula in three lines.
Application 2 — Gambler's ruin for a biased walk
For a biased walk with
P(Xi=+1)=p,
P(Xi=−1)=q=1−p and
p=1/2, the walk
Sn is no longer a martingale, but the process
Mn=(q/p)Sn is (see the exponential-martingale exercise for the
discrete-time martingales lesson).
Apply OST to Mn at τ=inf{n:Sn∈{0,N}}. Stopped process is bounded (between (q/p)0=1 and (q/p)N), OST applies, and E[Mτ]=M0=(q/p)k. Since Sτ∈{0,N}:
E[Mτ]=P(hit 0)⋅1+P(hit N)⋅(q/p)N.
Letting ρ=q/p and P_N := \mathbb{P}(\text{hit Nbefore0}), solve:
PN=ρN−1ρk−1.
The classic biased-gambler's-ruin formula. For p<1/2 (ρ>1): PN→ρk−N as N→∞ — you will almost certainly ruin yourself playing against an unfavourable game, even starting far from the origin.
Application 3 — Expected hitting time for a random walk
Still with the symmetric walk
Sn starting at
k∈(0,N) and
τ=inf{n:Sn∈{0,N}}, use the martingale
Nn:=Sn2−n (see
Doob's decomposition exercise). OST condition (3) applies: increments are bounded (
∣Nn+1−Nn∣≤2max(Sn,1)+1≤2N+1 on
{τ>n}), and
E[τ]<∞ (provable separately).
E[Nτ]=k2−0=k2andE[Nτ]=E[Sτ2]−E[τ]=P(Sτ=N)⋅N2−E[τ].
Using P(Sτ=N)=k/N:
E[τ]=NkN2−k2=k(N−k).
Expected exit time is k(N−k) — quadratic in the distance to the barriers, maximal at the midpoint. This drives the "time between stops" estimates in drawdown risk.
Application 4 — Discounted asset price is a Q-martingale; hence, no arbitrage across any bounded stopping time
Under the risk-neutral measure Q, discounted prices S~n=e−rnΔtSn are a martingale. For any bounded stopping time τ (e.g. a trader's exit rule with a hard time cap), OST gives EQ[S~τ]=S~0. No strategy on (S~n) can generate risk-neutral expected profit — the no-free-lunch theorem, encoded in OST.
When OST fails — the symmetric-walk-hits-one example
Let Sn be a symmetric random walk starting at 0 and let τ=inf{n:Sn=1}. Facts:
- τ<∞ a.s. (symmetric walk on Z is recurrent).
- Sτ=1 a.s., so E[Sτ]=1.
- But E[S0]=0.
OST fails because none of the conditions hold:
- τ is not bounded.
- Sn is not bounded.
- Increments are bounded but E[τ]=∞.
- (Sn∧τ) is not uniformly integrable.
The naive interpretation of this failure is "doubling-up works": play long enough and you'll be up
+1 unit. This is true
in expectation over time, but requires infinite expected time and an infinite credit line. Real traders have finite patience and finite capital, which re-introduces OST's hypotheses and forces expected P&L back to zero.
Proof via uniform integrability (sketch)
The stopped process (Mn∧τ) is always a martingale, so E[Mn∧τ]=E[M0] for every n. We want E[Mτ]=E[M0], which requires:
n→∞limE[Mn∧τ]=E[n→∞limMn∧τ]=E[Mτ].
Uniform integrability is the exact condition for this interchange (Vitali's theorem). Each sufficient condition in the theorem implies uniform integrability:
- Bounded τ: the stopping occurs by a fixed time, so Mn∧τ=Mτ for n≥K.
- Bounded M: a uniformly bounded sequence is trivially uniformly integrable.
- Bounded increments + E[τ]<∞: ∣Mn∧τ∣≤∣M0∣+Cτ, and E[τ]<∞ suffices for uniform integrability by dominated convergence.
Common confusions and pitfalls
"OST says E[Mτ]=E[M0] for any martingale and any stopping time." Emphatically no. The hypotheses are required. The symmetric-walk-hits-one example is the sharpest counter-example; know it.
"If τ<∞ a.s., OST applies." Finite
τ a.s. is
not enough — you need
E[τ]<∞ (at minimum) plus bounded increments, or uniform integrability. Recurrent walks on unbounded state spaces are the canonical exception.
"OST only works for positive martingales." OST works for any martingale under its hypotheses. The sign of
M doesn't matter; what matters is control of the stopped process's tail.
"OST gives you the distribution of Mτ." Only its mean. Distributional information requires additional tools — martingale CLT, explicit hitting-time distributions, stochastic comparison. OST is the most elementary of a family of
moment identities.
"If I have a non-trivial martingale strategy in a real market, OST must be violated." No — real-world markets are not martingales under the real-world measure
P. They are martingales under
Q. A strategy can have positive
EP P&L (alpha) while still respecting OST under
Q (no arbitrage).
Where this goes next
- Stopping Times: Foundational definitions; OST is the payoff theorem.
- Martingales (Discrete Time): The class of processes to which OST applies.
- Brownian Motion: Continuous-time hitting-time problems; closed-form distributions for Brownian first-exit times come from continuous-time OST.
- Martingale I: Fundamental theorem of asset pricing; OST applied to discounted prices proves no-arbitrage.
- Barrier Options: Knock-in/knock-out options valued as expectations of stopped payoffs under Q — OST and hitting-time distributions are the core machinery.
- American Options and Optimal Stopping: The optimal exercise problem is a stopping-time optimisation where OST provides the key dual characterisation.
References
- Lawler, G. F. (2023). Stochastic Calculus: An Introduction with Applications. Ch. 1 §1.3 (Optional sampling theorem), Ch. 4 §4.1 (Martingales and local martingales).