CONTENTS

Random Variables

Motivation: why this matters in quant finance

A probability space contains abstract outcomes. A random variable turns those outcomes into numbers: a terminal stock price, a log return, a default indicator, a portfolio loss, or an option payoff. Without random variables, a probability model can say which state occurred but cannot produce the numerical quantities quants price, hedge, forecast, and stress.

Bertsekas introduces random variables by asking how probabilities of numerical values are induced from probabilities of underlying outcomes. That direction matters. The model does not start with a histogram floating in space; it starts with outcomes ω\omega, then a function X(ω)X(\omega), then probabilities such as P(X=x)\mathbb{P}(X=x) or P(Xx)\mathbb{P}(X\le x).

In finance, this distinction keeps three ideas separate: the state of the world ω\omega, the observable quantity ST(ω)S_T(\omega), and the distribution of STS_T. Two models may have the same terminal distribution but different path structure, which matters immediately for barriers, Asians, hedging, and filtrations.

The informal idea

A random variable is a measurement rule. It looks at the realised outcome and reports a number. If ω\omega is the complete future market path, then

  • ST(ω)S_T(\omega) reports the terminal stock price,
  • (ST(ω)K)+(S_T(\omega)-K)^+ reports the call payoff,
  • 1{ST>K}(ω)\mathbf{1}_{\{S_T>K\}}(\omega) reports whether a digital call pays,
  • log(ST(ω)/S0)\log(S_T(\omega)/S_0) reports the log return.

The distribution of XX forgets the identity of the original outcome and remembers only how much probability lands on each numerical value or interval. That loss of detail is useful, but it is also dangerous: a terminal distribution alone does not tell you what happened along the way.

Formal definitions

Random variable

Let (Ω,F,P)(\Omega,\mathcal{F},\mathbb{P}) be a probability space. A real-valued random variable is a measurable function
X:ΩR.X:\Omega\to\mathbb{R}.

Measurability means

{ωΩ:X(ω)x}Ffor every xR.\{\omega\in\Omega:X(\omega)\le x\}\in\mathcal{F}\qquad\text{for every }x\in\mathbb{R}.

Equivalently, X1(B)FX^{-1}(B)\in\mathcal{F} for every Borel set BRB\subseteq\mathbb{R}. The shorter condition with half-lines is enough because half-lines generate the Borel sigma-algebra.

Distribution and CDF

The distribution or law of XX is the probability measure on R\mathbb{R} induced by XX:
μX(B)=P(XB).\mu_X(B)=\mathbb{P}(X\in B).
The cumulative distribution function is
FX(x)=P(Xx).F_X(x)=\mathbb{P}(X\le x).

Bertsekas uses the CDF as the common language for discrete and continuous random variables: every random variable has a CDF, even when it has neither a convenient PMF nor a smooth PDF.

Discrete and continuous cases

If XX takes countably many values xix_i, its PMF is

pX(xi)=P(X=xi),ipX(xi)=1.p_X(x_i)=\mathbb{P}(X=x_i),\qquad \sum_i p_X(x_i)=1.

If XX has a density fXf_X, then

FX(x)=xfX(u)du.F_X(x)=\int_{-\infty}^{x} f_X(u)\,du.

Key properties

Random variables push probability forward

The event {XB}\{X\in B\} is a subset of Ω\Omega, but the distribution μX(B)\mu_X(B) is a probability on numerical sets BRB\subseteq\mathbb{R}. This is why the same abstract probability space can support many financial quantities at once.

The CDF characterises the distribution

Knowing FX(x)F_X(x) for every xx determines the law of XX. For integer-valued XX,

pX(k)=FX(k)FX(k1).p_X(k)=F_X(k)-F_X(k-1).

For continuous XX with differentiable CDF,

fX(x)=FX(x).f_X(x)=F_X'(x).

Functions of random variables remain random variables

If Y=g(X)Y=g(X) for a measurable function gg, then YY is a random variable. Payoffs are built this way: H=g(ST)H=g(S_T). The law of YY is derived from the law of XX plus the transformation gg.

Several random variables require joint structure

Marginal distributions of XX and YY do not determine the joint distribution. A two-asset option needs the joint law of (ST(1),ST(2))(S_T^{(1)},S_T^{(2)}), not only the two separate terminal distributions. Dependence enters through the joint law.

Equality has multiple meanings

Random variables can be equal pointwise, equal almost surely, or equal in distribution. In probability and finance, almost-sure equality is usually the operational notion: changing a payoff on a null event does not change its price.

Worked examples

Example 1: terminal stock price and payoff

In a one-period model with Ω={u,d}\Omega=\{u,d\},

ST(u)=110,ST(d)=90.S_T(u)=110,\qquad S_T(d)=90.

For a call with strike 100100,

H(ω)=(ST(ω)100)+,H(\omega)=(S_T(\omega)-100)^+,

so H(u)=10H(u)=10 and H(d)=0H(d)=0. The payoff is a random variable derived from the price random variable.

Example 2: a CDF from a discrete PMF

Let XX be the number of heads in two fair coin tosses. Then

pX(0)=14,pX(1)=12,pX(2)=14.p_X(0)=\frac14,\qquad p_X(1)=\frac12,\qquad p_X(2)=\frac14.

Its CDF is a step function:

FX(x)={0,x<0,1/4,0x<1,3/4,1x<2,1,x2.F_X(x)= \begin{cases} 0, & x<0,\\ 1/4, & 0\le x<1,\\ 3/4, & 1\le x<2,\\ 1, & x\ge2. \end{cases}

The jumps of the CDF are the point probabilities.

Example 3: why the joint law matters

Suppose two assets each have return +10%+10\% or 10%-10\% with probability 1/21/2. If they always move together, an equally weighted portfolio has return +10%+10\% or 10%-10\%. If one always rises when the other falls, the same portfolio has return 00 with certainty. The marginal distributions are identical; the joint distribution changes the risk completely.

Common confusions and pitfalls

"A random variable is random because its value changes by itself." The function XX is fixed. Randomness comes from the realised outcome ω\omega.
"The distribution is the same thing as the random variable." The distribution forgets the underlying state space. That is enough for vanilla terminal payoffs, but not for path-dependent products or conditioning on information.
"Every function of ω\omega is allowed." It must be measurable. In ordinary quant examples this is rarely a practical obstacle, but it is the condition that makes probabilities like P(Xx)\mathbb{P}(X\le x) meaningful.
"Equal in distribution means interchangeable." Not for joint questions. Two variables can have the same law but different dependence with other variables.
"The PDF is more fundamental than the CDF." The CDF exists for every real-valued random variable. PDFs exist only for continuous laws with suitable smoothness.

Where this goes next

References

  • Bertsekas, D. P., & Tsitsiklis, J. N. (2008). Introduction to Probability (2nd ed.). Athena Scientific. Ch. 2 §2.2 (Probability Mass Functions), §2.3 (Functions of Random Variables), Ch. 3 §3.2 (Cumulative Distribution Functions).
Random Variables | q4quant.studio