Every numerical method in quant finance produces a sequence of approximations that (hopefully) converges to the true answer. The binomial tree with n steps gives a price Cn that converges to the Black-Scholes price as n→∞. A Monte Carlo simulation with N samples gives an estimator C^N that converges to the true expected value. Newton-Raphson iteration for implied volatility produces a sequence σ0,σ1,σ2,… that converges to σimpl.
The questions a practitioner needs answered are: does the sequence converge?, how fast?, and when can I stop? Convergence tests answer the first question. Convergence rates answer the second and third. This page provides the mathematical tools for both.
Convergence of sequences
Definition
A sequence {an}converges to L if:
∀ε>0,∃N:n>N⟹∣an−L∣<ε
We write an→L or limn→∞an=L. A sequence that does not converge diverges.
Monotone convergence theorem
A bounded, monotone sequence converges.
If {an} is increasing and bounded above, it converges to supnan. If decreasing and bounded below, it converges to infnan.
Finance application: Many iterative calibration algorithms produce sequences that are monotonically decreasing in error (e.g., each step of a least-squares minimisation reduces the objective). If the error is bounded below by zero, the monotone convergence theorem guarantees the sequence converges.
Convergence rates
The rate of convergence describes how fast an→L:
Rate
Condition
Example in finance
O(1/n)
$
a_n - L
O(1/n2)
$
a_n - L
O(1/n)
$
a_n - L
Geometric / O(rn)
$
a_n - L
The distinction matters enormously in practice. To halve the error of a Monte Carlo estimator (O(1/N)), you need 4× the samples. To halve the error of a binomial tree (O(1/n)), you need 2× the steps. Newton-Raphson for implied vol converges geometrically — each iteration roughly doubles the number of correct digits.
Cauchy sequences
A sequence {an} is a Cauchy sequence if:
∀ε>0,∃N:m,n>N⟹∣am−an∣<ε
In R, a sequence converges if and only if it is Cauchy. The practical significance: you can check convergence without knowing the limit. In numerical algorithms, you monitor ∣an+1−an∣ and stop when it falls below a tolerance — this is a Cauchy criterion applied in practice.
Convergence of series
Definition
A series ∑n=1∞an converges if the sequence of partial sums SN=∑n=1Nan converges: SN→S as N→∞.
Necessary condition (divergence test)
If ∑an converges, then an→0.
The contrapositive: if an→0, the series diverges. Warning:an→0 is necessary but not sufficient — the harmonic series ∑1/n diverges even though 1/n→0.
Convergence tests
Geometric series:∑n=0∞rn=1−r1 converges if ∣r∣<1. This is the basis of perpetuity valuation: PV=∑n=1∞c/(1+r)n=c/r for r>0. See Discounting.
p-series:∑n=1∞1/np converges if p>1, diverges if p≤1. This benchmark is used in comparison tests.
Comparison test: If 0≤an≤bn and ∑bn converges, then ∑an converges. The continuous analogue is the comparison test for improper integrals.
Ratio test: If L=limn→∞∣an+1/an∣ exists, then the series converges absolutely if L<1 and diverges if L>1. The ratio test is how you determine the radius of convergence of power series and Taylor series: R=1/L.
Root test: If L=limsupn→∞∣an∣1/n, then the series converges absolutely if L<1 and diverges if L>1.
Integral test: If f is positive, continuous, and decreasing on [1,∞) with an=f(n), then ∑an converges if and only if ∫1∞f(x)dx converges. This connects series convergence to improper integral convergence.
Alternating series test (Leibniz): If an>0, an is decreasing, and an→0, then ∑(−1)n+1an converges. The error after N terms is bounded by ∣aN+1∣.
Absolute vs conditional convergence
A series converges absolutely if ∑∣an∣ converges. It converges conditionally if ∑an converges but ∑∣an∣ diverges. Absolutely convergent series can be rearranged in any order without changing the sum; conditionally convergent series cannot (Riemann rearrangement theorem).
In quant finance, the series you encounter (Taylor series within their radius, geometric series for PV calculations, Fourier series for characteristic functions) are typically absolutely convergent, so rearrangement is safe.
Examples and applications
Example 1: Newton-Raphson for implied volatility
Given a market call price Cmkt, Newton-Raphson iterates:
σn+1=σn−V(σn)CBS(σn)−Cmkt
where V is vega. Near the root, the error satisfies:
∣σn+1−σ∗∣≤M⋅∣σn−σ∗∣2
This is quadratic convergence: each iteration doubles the number of correct digits. Starting from σ0=0.20 with true implied vol σ∗=0.2347, typical convergence is:
Iteration
σn
Error
0
0.2000
3.5×10−2
1
0.2340
7×10−4
2
0.2347
3×10−7
3
0.2347
machine precision
Three iterations suffice. This geometric convergence rate is why Newton-Raphson is the standard method for implied vol computation. See Implicit Differentiation for the sensitivity analysis.
Example 2: Monte Carlo convergence rate
A Monte Carlo estimator C^N=N1∑i=1Nf(ST(i)) estimates E[f(ST)]. By the Central Limit Theorem:
C^N≈N(C,NVar(f(ST)))
The standard error is σ/N — convergence rate O(1/N). To reduce the error by a factor of 10, you need 100× more samples. This is slow compared to Newton-Raphson but dimension-independent, which is why Monte Carlo dominates for high-dimensional problems (basket options, path-dependent payoffs).
Example 3: convergence of the binomial tree to Black-Scholes
as the number of steps n→∞. The convergence is not monotone — Cn oscillates around CBS (depending on whether the strike falls between or on tree nodes). But ∣Cn−CBS∣ is bounded by M/n for some constant M, so by the squeeze theorem, Cn→CBS.
Richardson extrapolation improves this to O(1/n2) by combining Cn and C2n to cancel the leading error term — a standard trick for accelerating convergence.
Example 4: present value as a geometric series
A bond paying coupon c semi-annually for T years with yield y:
The coupon sum is a finite geometric series with ratio r=1/(1+y/2)<1. As T→∞ (perpetual bond), the partial sums converge to c/y — the infinite geometric series. The ratio test confirms convergence: ∣r∣=1/(1+y/2)<1 for y>0.
Common confusions and pitfalls
"an→0 implies ∑an converges." No. The harmonic series ∑1/n diverges. The terms shrinking to zero is necessary but not sufficient for the series to converge.
"Monte Carlo converges faster with more dimensions." No. The O(1/N) rate is dimension-independent, which is Monte Carlo's strength — but it does not improve with dimension. Grid-based methods converge faster in low dimensions (O(1/N2/d) for d dimensions) but degrade exponentially with dimension. The crossover is typically around d=4–6.
"Newton-Raphson always converges." Not if the starting point is too far from the root or if the derivative (vega) is too small. For implied vol, starting with σ0=0.20 is usually safe, but for deep OTM or short-dated options where vega is tiny, Newton-Raphson can overshoot and diverge. Bisection (slower but guaranteed) is a safer fallback.
Confusing the rate of convergence with the number of iterations.O(1/n2) convergence means 10× fewer steps for the same accuracy compared to O(1/n). Quadratic (geometric) convergence like Newton-Raphson means 2–3 iterations often suffice regardless of the starting error, which is qualitatively different from polynomial rates.
Where this goes next
Convergence of sequences and series connects to:
Limits: Sequence convergence is the discrete version of the limit concept.
Squeeze Theorem and Bounds: Comparison and bounding arguments are the primary tools for proving convergence.
Power Series: The ratio and root tests determine the radius of convergence of power series.
Numerical Integration: Convergence rates of numerical methods determine computational cost.