Conditional Expectation
Motivation: why this matters in quant finance
The conditioning object is not a single event like "the first coin toss was heads". It is the entire information available at time : prices, realised volatility, rates, defaults, and anything else the model allows the trader to know. Conditional expectation is the operation that converts a future random payoff into the best current forecast using that information.
Bertsekas builds conditional expectation from a simpler idea: first condition on an event, then on the value of another random variable, then view itself as a random variable determined by . The sigma-algebra version used in stochastic finance is the same idea with "the value of " replaced by "the information set ."
The informal idea
Conditional expectation averages only over distinctions the conditioning information cannot see. If tells you which cell of a partition occurred, then is constant on each cell and equals the average value of inside that cell.
This is why is a random variable. Before the outcome is realised, you do not know which cell you will be in. After the information in is revealed, the forecast takes the value attached to that cell.
In finance: before observing , the time- option value is random. Once and the rest of are observed, the value is known.
Formal definitions
Conditioning on an event
For ,
This is a number: the average of under the probability law restricted to .
Conditioning on a random variable
For discrete ,
The object is the random variable obtained by substituting into the function .
Conditioning on a sigma-algebra
Let be a sub-sigma-algebra and let be integrable. The conditional expectation is the almost-surely unique random variable satisfying:
- is -measurable.
- For every ,
The first condition says the forecast uses only the information in . The second says it preserves the correct average on every event that can distinguish.
Key properties
Law of iterated expectations
Bertsekas states the basic form as
In sigma-algebra form, if ,
This is the tower property. It says that forecasting with more information and then coarsening back to less information gives the same result as forecasting directly with less information.
Pulling out known quantities
If is -measurable, then
What is already known can be treated as a constant inside the conditional expectation.
Independence removes information value
If is independent of , then
For Brownian motion, this is the reason future increments have conditional mean zero given the past.
Full and trivial information
Full information leaves no uncertainty about ; no information leaves only the unconditional mean.
Conditional variance decomposition
Bertsekas derives the law of total variance:
The same idea separates average residual uncertainty from uncertainty in the conditional forecast.
Worked examples
Example 1: conditional expectation on a finite partition
Let for two fair coin tosses, and let be the number of heads. Suppose reveals only the first toss.
If the first toss is , the possible outcomes are , so the average of is . If the first toss is , the possible outcomes are , so the average is .
Thus is the random variable equal to on and on .
Example 2: Brownian motion is a martingale
For ,
The first term is known at time ; the second is an independent future increment with mean zero.
Example 3: risk-neutral pricing over time
A European payoff paid at has time- value
At , this is the usual pricing expectation. At later times, the conditioning information changes the distribution of the remaining uncertainty. The tower property is what makes the discounted price process dynamically consistent.
Example 4: forecast revision has zero prior mean
Bertsekas notes that if is a revised forecast after observing , then
Before seeing the information, the expected revision is zero. If it were systematically positive, the original forecast was too low.
Common confusions and pitfalls
Where this goes next
- Filtrations and Information: Supplies the time-indexed sigma-algebras used in .
- Martingales Discrete Time: Defines fair-game processes through conditional expectation.
- Optional Stopping Theorem: Studies conditional-expectation behaviour under random stopping times.
- Risk-Neutral Valuation: Interprets derivative prices as conditional expectations under .
- Radon-Nikodym Theorem: Provides the existence machinery behind the general conditional expectation definition.
References
- Bertsekas, D. P., & Tsitsiklis, J. N. (2008). Introduction to Probability (2nd ed.). Athena Scientific. Ch. 2 §2.6 (Conditioning), Ch. 4 §4.3 (Conditional Expectation and Variance Revisited). The sigma-algebra formulation extends the textbook's random-variable conditioning treatment.