CONTENTS

Convergence of Sequences and Series

Motivation: why this matters in quant finance

Every numerical method in quant finance produces a sequence of approximations that (hopefully) converges to the true answer. The binomial tree with nn steps gives a price CnC_n that converges to the Black-Scholes price as nn \to \infty. A Monte Carlo simulation with NN samples gives an estimator C^N\hat{C}_N that converges to the true expected value. Newton-Raphson iteration for implied volatility produces a sequence σ0,σ1,σ2,\sigma_0, \sigma_1, \sigma_2, \ldots that converges to σimpl\sigma_{\text{impl}}.
The questions a practitioner needs answered are: does the sequence converge?, how fast?, and when can I stop? Convergence tests answer the first question. Convergence rates answer the second and third. This page provides the mathematical tools for both.

Convergence of sequences

Definition

A sequence {an}\{a_n\} converges to LL if:
ε>0,  N:n>N    anL<ε\forall\,\varepsilon > 0, \;\exists\,N: n > N \implies |a_n - L| < \varepsilon
We write anLa_n \to L or limnan=L\lim_{n \to \infty} a_n = L. A sequence that does not converge diverges.

Monotone convergence theorem

A bounded, monotone sequence converges.

If {an}\{a_n\} is increasing and bounded above, it converges to supnan\sup_n a_n. If decreasing and bounded below, it converges to infnan\inf_n a_n.

Finance application: Many iterative calibration algorithms produce sequences that are monotonically decreasing in error (e.g., each step of a least-squares minimisation reduces the objective). If the error is bounded below by zero, the monotone convergence theorem guarantees the sequence converges.

Convergence rates

The rate of convergence describes how fast anLa_n \to L:
RateConditionExample in finance
O(1/n)O(1/n)$a_n - L
O(1/n2)O(1/n^2)$a_n - L
O(1/n)O(1/\sqrt{n})$a_n - L
Geometric / O(rn)O(r^n)$a_n - L

The distinction matters enormously in practice. To halve the error of a Monte Carlo estimator (O(1/N)O(1/\sqrt{N})), you need 4× the samples. To halve the error of a binomial tree (O(1/n)O(1/n)), you need 2× the steps. Newton-Raphson for implied vol converges geometrically — each iteration roughly doubles the number of correct digits.

Cauchy sequences

A sequence {an}\{a_n\} is a Cauchy sequence if:
ε>0,  N:m,n>N    aman<ε\forall\,\varepsilon > 0, \;\exists\,N: m, n > N \implies |a_m - a_n| < \varepsilon

In R\mathbb{R}, a sequence converges if and only if it is Cauchy. The practical significance: you can check convergence without knowing the limit. In numerical algorithms, you monitor an+1an|a_{n+1} - a_n| and stop when it falls below a tolerance — this is a Cauchy criterion applied in practice.

Convergence of series

Definition

A series n=1an\sum_{n=1}^{\infty} a_n converges if the sequence of partial sums SN=n=1NanS_N = \sum_{n=1}^{N} a_n converges: SNSS_N \to S as NN \to \infty.

Necessary condition (divergence test)

If an\sum a_n converges, then an0a_n \to 0.

The contrapositive: if an↛0a_n \not\to 0, the series diverges. Warning: an0a_n \to 0 is necessary but not sufficient — the harmonic series 1/n\sum 1/n diverges even though 1/n01/n \to 0.

Convergence tests

Geometric series: n=0rn=11r\sum_{n=0}^{\infty} r^n = \frac{1}{1-r} converges if r<1|r| < 1. This is the basis of perpetuity valuation: PV=n=1c/(1+r)n=c/r\text{PV} = \sum_{n=1}^{\infty} c/(1+r)^n = c/r for r>0r > 0. See Discounting.
pp-series: n=11/np\sum_{n=1}^{\infty} 1/n^p converges if p>1p > 1, diverges if p1p \leq 1. This benchmark is used in comparison tests.
Comparison test: If 0anbn0 \leq a_n \leq b_n and bn\sum b_n converges, then an\sum a_n converges. The continuous analogue is the comparison test for improper integrals.
Ratio test: If L=limnan+1/anL = \lim_{n \to \infty} |a_{n+1}/a_n| exists, then the series converges absolutely if L<1L < 1 and diverges if L>1L > 1. The ratio test is how you determine the radius of convergence of power series and Taylor series: R=1/LR = 1/L.
Root test: If L=lim supnan1/nL = \limsup_{n \to \infty} |a_n|^{1/n}, then the series converges absolutely if L<1L < 1 and diverges if L>1L > 1.
Integral test: If ff is positive, continuous, and decreasing on [1,)[1, \infty) with an=f(n)a_n = f(n), then an\sum a_n converges if and only if 1f(x)dx\int_1^{\infty} f(x)\,dx converges. This connects series convergence to improper integral convergence.
Alternating series test (Leibniz): If an>0a_n > 0, ana_n is decreasing, and an0a_n \to 0, then (1)n+1an\sum (-1)^{n+1} a_n converges. The error after NN terms is bounded by aN+1|a_{N+1}|.

Absolute vs conditional convergence

A series converges absolutely if an\sum |a_n| converges. It converges conditionally if an\sum a_n converges but an\sum |a_n| diverges. Absolutely convergent series can be rearranged in any order without changing the sum; conditionally convergent series cannot (Riemann rearrangement theorem).

In quant finance, the series you encounter (Taylor series within their radius, geometric series for PV calculations, Fourier series for characteristic functions) are typically absolutely convergent, so rearrangement is safe.

Examples and applications

Example 1: Newton-Raphson for implied volatility

Given a market call price CmktC_{\text{mkt}}, Newton-Raphson iterates:

σn+1=σnCBS(σn)CmktV(σn)\sigma_{n+1} = \sigma_n - \frac{C_{\text{BS}}(\sigma_n) - C_{\text{mkt}}}{\mathcal{V}(\sigma_n)}

where V\mathcal{V} is vega. Near the root, the error satisfies:

σn+1σMσnσ2|\sigma_{n+1} - \sigma^*| \leq M \cdot |\sigma_n - \sigma^*|^2
This is quadratic convergence: each iteration doubles the number of correct digits. Starting from σ0=0.20\sigma_0 = 0.20 with true implied vol σ=0.2347\sigma^* = 0.2347, typical convergence is:
Iterationσn\sigma_nError
00.20003.5×1023.5 \times 10^{-2}
10.23407×1047 \times 10^{-4}
20.23473×1073 \times 10^{-7}
30.2347machine precision
Three iterations suffice. This geometric convergence rate is why Newton-Raphson is the standard method for implied vol computation. See Implicit Differentiation for the sensitivity analysis.

Example 2: Monte Carlo convergence rate

A Monte Carlo estimator C^N=1Ni=1Nf(ST(i))\hat{C}_N = \frac{1}{N}\sum_{i=1}^{N} f(S_T^{(i)}) estimates E[f(ST)]\mathbb{E}[f(S_T)]. By the Central Limit Theorem:
C^NN(C,Var(f(ST))N)\hat{C}_N \approx \mathcal{N}\left(C, \frac{\text{Var}(f(S_T))}{N}\right)
The standard error is σ/N\sigma/\sqrt{N} — convergence rate O(1/N)O(1/\sqrt{N}). To reduce the error by a factor of 10, you need 100× more samples. This is slow compared to Newton-Raphson but dimension-independent, which is why Monte Carlo dominates for high-dimensional problems (basket options, path-dependent payoffs).

Example 3: convergence of the binomial tree to Black-Scholes

The CRR binomial tree price CnC_n satisfies:
CnCBS=O(1/n)|C_n - C_{\text{BS}}| = O(1/n)
as the number of steps nn \to \infty. The convergence is not monotone — CnC_n oscillates around CBSC_{\text{BS}} (depending on whether the strike falls between or on tree nodes). But CnCBS|C_n - C_{\text{BS}}| is bounded by M/nM/n for some constant MM, so by the squeeze theorem, CnCBSC_n \to C_{\text{BS}}.

Richardson extrapolation improves this to O(1/n2)O(1/n^2) by combining CnC_n and C2nC_{2n} to cancel the leading error term — a standard trick for accelerating convergence.

Example 4: present value as a geometric series

A bond paying coupon cc semi-annually for TT years with yield yy:

P=i=12Tc/2(1+y/2)i+F(1+y/2)2T=c/2y/2(11(1+y/2)2T)+F(1+y/2)2TP = \sum_{i=1}^{2T} \frac{c/2}{(1 + y/2)^i} + \frac{F}{(1 + y/2)^{2T}} = \frac{c/2}{y/2}\left(1 - \frac{1}{(1+y/2)^{2T}}\right) + \frac{F}{(1+y/2)^{2T}}

The coupon sum is a finite geometric series with ratio r=1/(1+y/2)<1r = 1/(1 + y/2) < 1. As TT \to \infty (perpetual bond), the partial sums converge to c/yc/y — the infinite geometric series. The ratio test confirms convergence: r=1/(1+y/2)<1|r| = 1/(1+y/2) < 1 for y>0y > 0.

Common confusions and pitfalls

"an0a_n \to 0 implies an\sum a_n converges." No. The harmonic series 1/n\sum 1/n diverges. The terms shrinking to zero is necessary but not sufficient for the series to converge.
"Monte Carlo converges faster with more dimensions." No. The O(1/N)O(1/\sqrt{N}) rate is dimension-independent, which is Monte Carlo's strength — but it does not improve with dimension. Grid-based methods converge faster in low dimensions (O(1/N2/d)O(1/N^{2/d}) for dd dimensions) but degrade exponentially with dimension. The crossover is typically around d=4d = 466.
"Newton-Raphson always converges." Not if the starting point is too far from the root or if the derivative (vega) is too small. For implied vol, starting with σ0=0.20\sigma_0 = 0.20 is usually safe, but for deep OTM or short-dated options where vega is tiny, Newton-Raphson can overshoot and diverge. Bisection (slower but guaranteed) is a safer fallback.
Confusing the rate of convergence with the number of iterations. O(1/n2)O(1/n^2) convergence means 10× fewer steps for the same accuracy compared to O(1/n)O(1/n). Quadratic (geometric) convergence like Newton-Raphson means 2–3 iterations often suffice regardless of the starting error, which is qualitatively different from polynomial rates.

Where this goes next

Convergence of sequences and series connects to:

  • Limits: Sequence convergence is the discrete version of the limit concept.
  • Squeeze Theorem and Bounds: Comparison and bounding arguments are the primary tools for proving convergence.
  • Power Series: The ratio and root tests determine the radius of convergence of power series.
  • Numerical Integration: Convergence rates of numerical methods determine computational cost.
  • Implicit Differentiation: Newton-Raphson convergence analysis for root-finding in calibration.
  • Central Limit Theorem: The CLT determines the O(1/N)O(1/\sqrt{N}) convergence rate of Monte Carlo.

References

  • Stewart, J. (2008). Single Variable Calculus: Early Transcendentals (6th ed.). Thomson Brooks/Cole. Ch. 11 Sections 11.1-11.6 (sequences, series, integral/comparison tests, alternating series).
Convergence of Sequences and Series | q4quant.studio