CONTENTS

Radon-Nikodym Theorem

Motivation: why this matters in quant finance

The Radon-Nikodym theorem is the measure-theoretic engine that underlies every change of measure in quantitative finance. It is the fact that, whenever two probability measures P\mathbb{P} and Q\mathbb{Q} on the same σ\sigma-algebra agree on "what is possible" (absolute continuity), there exists a non-negative random variable Z=dQ/dPZ = d\mathbb{Q}/d\mathbb{P} such that
EQ[X]=EP[ZX]\mathbb{E}^{\mathbb{Q}}[X] = \mathbb{E}^{\mathbb{P}}[Z\cdot X]
for every integrable XX. The function ZZ is the Radon-Nikodym derivative, the density of Q\mathbb{Q} with respect to P\mathbb{P}.

Concrete quant-finance applications:

  • Risk-neutral pricing. The fundamental theorem of asset pricing guarantees (under no-arbitrage) the existence of an equivalent martingale measure Q\mathbb{Q}. The Radon-Nikodym derivative dQ/dPd\mathbb{Q}/d\mathbb{P} encodes the "risk-neutralisation" that turns real-world drift into the risk-free rate.
  • Girsanov's theorem. The specific formula for dQ/dPd\mathbb{Q}/d\mathbb{P} when changing the drift of a Brownian motion — a Doléans-Dade exponential. Radon-Nikodym is the existence statement; Girsanov is the constructive formula.
  • Importance sampling in Monte Carlo. To price a deep OTM option, sample paths under Q\mathbb{Q} with higher probability in the interesting region, then re-weight by dQ/dPd\mathbb{Q}/d\mathbb{P} to keep the answer unbiased. Variance reduction; classical variance-reduction trick.
  • Esscher transforms. In actuarial and insurance pricing, the Esscher transform dQ/dPeθXd\mathbb{Q}/d\mathbb{P} \propto e^{\theta X} provides a tractable family of pricing measures for jump processes and Lévy models.
  • Density conversions. Going from a cumulative-distribution statement to a density statement, or from one parameterisation of a distribution to another, is always a Radon-Nikodym computation in disguise.

The theorem sits at an abstract level but its consequences are bread-and-butter. Every time you write "EQ[]=EP[Z]\mathbb{E}^{\mathbb{Q}}[\cdot] = \mathbb{E}^{\mathbb{P}}[Z\cdot]", you are invoking Radon-Nikodym.

The informal idea

Two probability measures P\mathbb{P} and Q\mathbb{Q} on a measurable space (Ω,F)(\Omega, \mathcal{F}) can have very different probabilities assigned to each event. But if Q\mathbb{Q} is absolutely continuous with respect to P\mathbb{P} (written QP\mathbb{Q} \ll \mathbb{P}) — meaning every P\mathbb{P}-null event is also a Q\mathbb{Q}-null event — then Q\mathbb{Q} can be encoded by a density Z:Ω[0,)Z: \Omega \to [0, \infty) relative to P\mathbb{P}:
Q(A)=AZdP=EP[Z1A]AF.\mathbb{Q}(A) = \int_A Z\,d\mathbb{P} = \mathbb{E}^{\mathbb{P}}[Z\cdot \mathbf{1}_A] \quad \forall A \in \mathcal{F}.
The function ZZ is the density of Q\mathbb{Q} relative to P\mathbb{P}. It's the same mental model as a continuous density fXf_X relative to Lebesgue measure — but generalised to arbitrary measures.
Why absolute continuity? If some P\mathbb{P}-null set AA has Q(A)>0\mathbb{Q}(A) > 0, then no density ZZ relative to P\mathbb{P} can produce that mass: AZdP=0\int_A Z\,d\mathbb{P} = 0 because P(A)=0\mathbb{P}(A) = 0. So absolute continuity is the exact condition needed for ZZ to exist.

Formal statement

Radon-Nikodym Theorem

Let (Ω,F)(\Omega, \mathcal{F}) be a measurable space and let μ,ν\mu, \nu be σ\sigma-finite measures on F\mathcal{F} with νμ\nu \ll \mu (i.e. μ(A)=0ν(A)=0\mu(A) = 0 \Rightarrow \nu(A) = 0). Then there exists a measurable function f:Ω[0,)f: \Omega \to [0, \infty), called the Radon-Nikodym derivative of ν\nu with respect to μ\mu, such that:
ν(A)=AfdμAF.\nu(A) = \int_A f\,d\mu \quad \forall A \in \mathcal{F}.

The function ff is unique up to μ\mu-null sets. We write f=dν/dμf = d\nu/d\mu.

In the probabilistic setting

If P,Q\mathbb{P}, \mathbb{Q} are probability measures on (Ω,F)(\Omega, \mathcal{F}) with QP\mathbb{Q} \ll \mathbb{P}, then there exists a non-negative random variable ZZ with EP[Z]=1\mathbb{E}^{\mathbb{P}}[Z] = 1 such that:
Q(A)=EP[Z1A]AF.\mathbb{Q}(A) = \mathbb{E}^{\mathbb{P}}[Z\cdot \mathbf{1}_A] \quad \forall A \in \mathcal{F}.

Equivalently, for any random variable X0X \ge 0 (or XL1(Q)X \in L^1(\mathbb{Q})):

EQ[X]=EP[ZX].\mathbb{E}^{\mathbb{Q}}[X] = \mathbb{E}^{\mathbb{P}}[Z\cdot X].

ZZ is denoted dQ/dPd\mathbb{Q}/d\mathbb{P}.

Equivalence of measures. If additionally PQ\mathbb{P} \ll \mathbb{Q}, we say P\mathbb{P} and Q\mathbb{Q} are equivalent (write PQ\mathbb{P} \sim \mathbb{Q}). In this case Z>0Z > 0 a.s. and dP/dQ=1/Zd\mathbb{P}/d\mathbb{Q} = 1/Z.

Standard properties

Property 1 — Chain rule

If RQP\mathbb{R} \ll \mathbb{Q} \ll \mathbb{P}, then:

dRdP=dRdQdQdPP-a.s.\frac{d\mathbb{R}}{d\mathbb{P}} = \frac{d\mathbb{R}}{d\mathbb{Q}}\cdot \frac{d\mathbb{Q}}{d\mathbb{P}} \quad \mathbb{P}\text{-a.s.}

So densities compose by multiplication — the natural calculus rule.

Property 2 — Inverse

If QP\mathbb{Q} \sim \mathbb{P} (equivalence), then Z=dQ/dPZ = d\mathbb{Q}/d\mathbb{P} is strictly positive P\mathbb{P}-a.s. and:

dPdQ=1ZQ-a.s.\frac{d\mathbb{P}}{d\mathbb{Q}} = \frac{1}{Z} \quad \mathbb{Q}\text{-a.s.}

Property 3 — Conditional expectation under change of measure (Bayes' formula)

If QP\mathbb{Q} \ll \mathbb{P} with density ZZ, and GF\mathcal{G} \subset \mathcal{F} is a sub-σ\sigma-algebra, then for any XL1(Q)X \in L^1(\mathbb{Q}):

EQ[XG]=EP[ZXG]EP[ZG]Q-a.s.\mathbb{E}^{\mathbb{Q}}[X \mid \mathcal{G}] = \frac{\mathbb{E}^{\mathbb{P}}[Z\cdot X \mid \mathcal{G}]}{\mathbb{E}^{\mathbb{P}}[Z \mid \mathcal{G}]} \quad \mathbb{Q}\text{-a.s.}
This Bayes formula for conditional expectations is the workhorse of change-of-measure pricing. It converts Q\mathbb{Q}-conditional expectations into P\mathbb{P}-conditional expectations weighted by ZZ.

Proof sketch

The proof is not elementary but the idea is constructive. Define:

f:=sup{g:g0 measurable,Agdμν(A) A}.f := \sup\{g: g \ge 0 \text{ measurable}, \int_A g\,d\mu \le \nu(A)\ \forall A\}.

Show that the supremum is attained (MCT on an increasing sequence achieving the supremum). Show that Afdμ=ν(A)\int_A f\,d\mu = \nu(A) — if equality failed for some AA (strict inequality), one could find a "larger" candidate, contradicting supremum maximality. σ\sigma-finiteness ensures this construction doesn't break due to infinite measure issues.

The full proof uses the Hahn decomposition of signed measures or the Jordan decomposition. See any graduate text (Folland, Royden, Billingsley).

Canonical examples

Example 1 — Densities on R\mathbb{R}

If XX has density fXf_X with respect to Lebesgue measure on R\mathbb{R}, then the law of XX, μX(B):=P(XB)\mu_X(B) := \mathbb{P}(X \in B), is absolutely continuous w.r.t. Lebesgue measure, and the density is fXf_X:

dμXd(Lebesgue)=fX.\frac{d\mu_X}{d(\text{Lebesgue})} = f_X.
This is the only Radon-Nikodym derivative most students have seen, disguised as "density."

Example 2 — Exponential tilting (Esscher transform)

Let XN(0,1)X \sim \mathcal{N}(0, 1) under P\mathbb{P}. Define Q\mathbb{Q} by:

dQdP=Z:=exp ⁣(θX12θ2).\frac{d\mathbb{Q}}{d\mathbb{P}} = Z := \exp\!\left(\theta X - \tfrac{1}{2}\theta^2\right).

Note EP[Z]=eθ2/2EP[eθX]=eθ2/2eθ2/2=1\mathbb{E}^{\mathbb{P}}[Z] = e^{-\theta^2/2}\cdot \mathbb{E}^{\mathbb{P}}[e^{\theta X}] = e^{-\theta^2/2}\cdot e^{\theta^2/2} = 1. ✓ (Q\mathbb{Q} is a probability measure.)

Compute the distribution of XX under Q\mathbb{Q}:

Q(Xx)=EP[Z1Xx]=xeθtθ2/212πet2/2dt.\mathbb{Q}(X \le x) = \mathbb{E}^{\mathbb{P}}[Z\cdot \mathbf{1}_{X \le x}] = \int_{-\infty}^x e^{\theta t - \theta^2/2}\cdot \frac{1}{\sqrt{2\pi}}e^{-t^2/2}\,dt.

The integrand is 12πexp((tθ)2/2)\frac{1}{\sqrt{2\pi}}\exp(-(t - \theta)^2/2) — density of N(θ,1)\mathcal{N}(\theta, 1). So under Q\mathbb{Q}, XN(θ,1)X \sim \mathcal{N}(\theta, 1).

Interpretation. Multiplying the P\mathbb{P}-density by eθXθ2/2e^{\theta X - \theta^2/2} shifts the mean from 00 to θ\theta without changing the variance. This is the discrete-time Girsanov theorem for a single normal random variable: change of drift for gaussian variables is an explicit exponential tilt.

Example 3 — Change of measure in a one-period binomial model

A stock goes up to uS0uS_0 with probability pp or down to dS0dS_0 with probability 1p1 - p under P\mathbb{P}. The risk-free rate over one period is rr. The risk-neutral probability is q:=(erd)/(ud)q := (e^r - d)/(u - d).

The Radon-Nikodym derivative is:

dQdP(ω)={q/pω=up(1q)/(1p)ω=down.\frac{d\mathbb{Q}}{d\mathbb{P}}(\omega) = \begin{cases} q/p & \omega = \text{up} \\ (1 - q)/(1 - p) & \omega = \text{down} \end{cases}.

Check: EP[dQ/dP]=pq/p+(1p)(1q)/(1p)=q+1q=1\mathbb{E}^{\mathbb{P}}[d\mathbb{Q}/d\mathbb{P}] = p\cdot q/p + (1-p)\cdot (1-q)/(1-p) = q + 1 - q = 1. ✓

The Radon-Nikodym derivative converts the real-world probability pp (which reflects drift and risk preferences) to the risk-neutral probability qq (which reflects no-arbitrage pricing).

Example 4 — Girsanov sketch

If (Wt)(W_t) is a P\mathbb{P}-Brownian motion and θ\theta is a (sufficiently regular) process, define:

Zt:=exp ⁣(0tθsdWs120tθs2ds)(a Doleˊans-Dade exponential).Z_t := \exp\!\left(-\int_0^t \theta_s\,dW_s - \tfrac{1}{2}\int_0^t \theta_s^2\,ds\right) \quad\text{(a Doléans-Dade exponential)}.
Under suitable integrability (Novikov's condition), ZtZ_t is a P\mathbb{P}-martingale with EP[Zt]=1\mathbb{E}^{\mathbb{P}}[Z_t] = 1, so Qt\mathbb{Q}_t defined by dQt/dP=Ztd\mathbb{Q}_t/d\mathbb{P} = Z_t is a probability measure. The Girsanov theorem says that under Qt\mathbb{Q}_t, the process W~t:=Wt+0tθsds\tilde W_t := W_t + \int_0^t \theta_s\,ds is a Brownian motion.
This is the machinery that enables changing from the real-world measure to the risk-neutral measure in continuous-time models. Full treatment in the Girsanov's Theorem lesson (if authored).

Common pitfalls

"dQ/dPd\mathbb{Q}/d\mathbb{P} is a single number." No — it's a random variable (or function on Ω\Omega). The density varies depending on the ω\omega.
"Radon-Nikodym requires P\mathbb{P} and Q\mathbb{Q} to have densities." No — the theorem constructs the density. The requirement is absolute continuity of one with respect to the other.
"If P\mathbb{P} and Q\mathbb{Q} are different, one must be absolutely continuous w.r.t. the other." False. Counter-example: take P\mathbb{P} supported on {0,1}\{0, 1\} and Q\mathbb{Q} supported on {2,3}\{2, 3\}. Each is singular w.r.t. the other; no Radon-Nikodym derivative exists in either direction. (Lebesgue's decomposition theorem handles the general case: any Q\mathbb{Q} can be written as Q=Qac+Qsing\mathbb{Q} = \mathbb{Q}_{\text{ac}} + \mathbb{Q}_{\text{sing}} where QacP\mathbb{Q}_{\text{ac}} \ll \mathbb{P} and QsingP\mathbb{Q}_{\text{sing}} \perp \mathbb{P}.)
"EQ[Z]=1\mathbb{E}^{\mathbb{Q}}[Z] = 1." EP[Z]=1\mathbb{E}^{\mathbb{P}}[Z] = 1, not under Q\mathbb{Q}. Under Q\mathbb{Q}, EQ[Z]=EP[Z2]\mathbb{E}^{\mathbb{Q}}[Z] = \mathbb{E}^{\mathbb{P}}[Z^2] — which can differ from 11.
"Girsanov is the same as Radon-Nikodym." Girsanov is a specific instance of Radon-Nikodym: it provides the explicit formula for dQ/dPd\mathbb{Q}/d\mathbb{P} when changing the drift of a Brownian motion. Radon-Nikodym is the general abstract existence theorem.

Where this goes next

  • Girsanov's Theorem: The continuous-time change-of-drift formula, the single most important instance of Radon-Nikodym in quant finance.
  • Martingale Measures: Risk-neutral pricing relies on Radon-Nikodym's guarantee of existence of an equivalent martingale measure under no-arbitrage.
  • Conditional Expectation: Bayes' formula for conditional expectations under change of measure uses Radon-Nikodym explicitly.
  • Importance Sampling: Monte Carlo variance-reduction via Radon-Nikodym tilts (a specific application of the change-of-measure idea).
  • The Derivation of the Black-Scholes Formula: The transition from the real-world SDE to the risk-neutral SDE uses the Radon-Nikodym derivative from Girsanov's theorem.

Exercises

Test your understanding with 3 exercises for this lesson.