Many asset-price models are easy to describe through the distribution of log returns but hard to price from a density. Heston, Variance Gamma, CGMY, and other affine or jump models often provide a closed-form characteristic function while the density is unavailable or expensive to evaluate. Fourier pricing methods exploit this: price from the transform, not from the density.
A characteristic function is also the transform that does not fail. Moment generating functions require exponential moments and can explode for heavy-tailed variables. Characteristic functions always exist because ∣eitX∣=1. That makes them the right language for the Central Limit Theorem, weak convergence, and heavy-tailed modelling.
Bertsekas develops real transforms / MGFs in Ch. 4.4 and uses them to identify distributions, read moments, and handle sums of independent random variables. Characteristic functions follow the same transform logic with s replaced by it, but their interpretation is Fourier rather than exponential-growth.
The informal idea
The characteristic function of X is
φX(t)=E[eitX],t∈R.
Using Euler's identity,
eitX=cos(tX)+isin(tX),
so φX records how the distribution of X oscillates at every frequency t. This is why the characteristic function is a Fourier transform of the probability law.
The main computational gift is that independent sums become products:
φX+Y(t)=φX(t)φY(t)
when X and Y are independent. Convolution in the density world becomes multiplication in the transform world.
Formal definition
For a real-valued random variable X, the characteristic function is
φX(t)=E[eitX],t∈R.
If X has density fX, then
φX(t)=∫−∞∞eitxfX(x)dx.
If X is discrete,
φX(t)=x∑eitxpX(x).
Unlike the MGF MX(s)=E[esX], this expectation is always finite because ∣eitX∣=1.
Key properties
Boundedness and normalisation
∣φX(t)∣≤1,φX(0)=1.
The bound is the reason the transform is always available.
Affine transformations
If Y=aX+b, then
φY(t)=eitbφX(at).
Shifting a distribution rotates the transform; scaling changes the frequency.
Independent sums
For independent X and Y,
φX+Y(t)=φX(t)φY(t).
This is the same multiplication principle Bertsekas develops for MGFs, now in a domain where existence is automatic.
Moments from derivatives
If E[∣X∣k]<∞, then
φX(k)(0)=ikE[Xk].
The condition is important. The characteristic function can exist even when the moment does not.
Uniqueness
The characteristic function determines the distribution. If φX(t)=φY(t) for all t, then X and Y have the same law.
Convergence
Lévy's continuity theorem says that pointwise convergence of characteristic functions, with a limit continuous at zero, implies convergence in distribution. This is the engine behind the characteristic-function proof of the CLT.
Worked examples
Example 1: normal characteristic function
For X∼N(μ,σ2),
φX(t)=exp(iμt−21σ2t2).
The linear term carries location; the quadratic decay carries variance.
Example 2: sum of independent normals
Let X∼N(μX,σX2) and Y∼N(μY,σY2) be independent. Then
By uniqueness, X+Y is normal with mean μX+μY and variance σX2+σY2.
Example 3: CLT mechanism
If Xi are i.i.d. with mean 0 and variance 1, then near zero
φX(u)=1−21u2+o(u2).
For Sn=(X1+⋯+Xn)/n,
φSn(t)=(φX(t/n))n→e−t2/2,
which is the characteristic function of N(0,1).
Example 4: option pricing from a log-price CF
Suppose a model gives a closed form for φlogST(t) but not for the density of ST. Fourier methods damp the payoff, transform it, multiply by the model CF, and invert numerically. The density never has to be written explicitly. This is the practical reason characteristic functions appear in Heston-style pricing engines.
Common confusions and pitfalls
"Complex-valued means non-probabilistic." The complex number is a transform coordinate. Probabilities and densities are recovered through inversion or distributional identification.
"The CF and MGF are interchangeable." They coincide only formally under s=it. The MGF may fail to exist; the CF always exists.
"A CF gives moments automatically." Only if the corresponding absolute moments exist. Cauchy variables have characteristic functions but no mean or variance.
"Multiplication works for any sum." Multiplication of transforms requires independence. Dependence requires a joint characteristic function.
"Knowing a few CF values is enough." The uniqueness theorem needs the full function, not isolated frequencies.
Bertsekas, D. P., & Tsitsiklis, J. N. (2008). Introduction to Probability (2nd ed.). Athena Scientific. Ch. 4 §4.4 (Transforms). Bertsekas develops MGFs rather than characteristic functions; this note uses the same transform principles with the standard Fourier-domain extension.
Exercises
Test your understanding with 3 exercises for this lesson.