CONTENTS

Coherent Risk Measures

Motivation: why this matters in quant finance

Every risk manager needs a single number that answers "how bad can this get?" — one scalar that compresses a random future P&L distribution into a tractable figure. But not all ways of compressing are equal: some violate basic common-sense properties, and the ones that do have been implicated in real market blow-ups.

Value at Risk (VaR) is the industry workhorse, but it is not coherent — it can say that combining two portfolios produces more risk than the sum of their individual risks, which contradicts the intuition that diversification should not hurt. This failure isn't academic: it matters whenever a bank aggregates desk-level risks into a bank-level figure, or allocates capital across subsidiaries.
Artzner, Delbaen, Eber & Heath (1999) gave a clean axiomatic answer: write down the four properties any reasonable risk measure ought to satisfy, then ask which measures satisfy them. The answer — coherent risk measures — opens the door to Expected Shortfall (ES), now the Basel regulatory standard for market risk, and to a whole family of risk measures derived from acceptance sets and scenario analyses.

This note states the four axioms, proves VaR fails one of them, and derives the key dual representation that makes ES tractable.

The informal idea

Think of a risk measure ρ(X)\rho(X) as the minimum amount of cash the firm must hold so that the portfolio's position XX (a random net worth at horizon TT) is acceptable. Four things ought to be true:
  1. If portfolio XX is always at least as good as YY, then XX should require no more capital than YY.
  2. Adding cash cc to the portfolio should reduce required capital by exactly cc.
  3. Scaling the portfolio by λ>0\lambda > 0 should scale required capital by λ\lambda.
  4. Combining two portfolios into one should not require more capital than holding them separately — "a merger is at least as good as two separate firms."

The fourth is the diversification principle, and it is the one VaR breaks.

Formal definitions

Let X,Y,X, Y, \ldots be random variables on a probability space (Ω,F,P)(\Omega, \mathcal{F}, \mathbb{P}) representing portfolio values at a fixed horizon TT. A risk measure is any function ρ:XR\rho : \mathcal{X} \to \mathbb{R} on a suitable space of random variables X\mathcal{X}.
ρ\rho is coherent if it satisfies:
(A1) Monotonicity. If XYX \le Y almost surely, then ρ(X)ρ(Y)\rho(X) \ge \rho(Y).
(A2) Translation invariance (cash invariance). For any constant cRc \in \mathbb{R}, ρ(X+c)=ρ(X)c\rho(X + c) = \rho(X) - c.
(A3) Positive homogeneity. For any λ0\lambda \ge 0, ρ(λX)=λρ(X)\rho(\lambda X) = \lambda\,\rho(X).
(A4) Subadditivity. ρ(X+Y)ρ(X)+ρ(Y)\rho(X + Y) \le \rho(X) + \rho(Y).
A risk measure satisfying only (A1) and (A2) is called a monetary risk measure. Adding (A4) gives a convex risk measure if we weaken (A3) to convexity ρ(λX+(1λ)Y)λρ(X)+(1λ)ρ(Y)\rho(\lambda X + (1-\lambda)Y) \le \lambda\rho(X) + (1-\lambda)\rho(Y) for λ[0,1]\lambda \in [0,1]. Coherent means all four.

Value at Risk (VaR) — definition

At confidence level α(0,1)\alpha \in (0, 1) (typically α=0.05\alpha = 0.05 or 0.010.01),

VaRα(X):=inf{xR:P(Xx)>α}=FX1(α),\text{VaR}_\alpha(X) := -\inf\{x \in \mathbb{R} : \mathbb{P}(X \le x) > \alpha\} = -F_X^{-1}(\alpha),

where FXF_X is the CDF of XX. In words: VaRα\text{VaR}_\alpha is the negative of the α\alpha-quantile of the P&L — the loss exceeded only with probability α\alpha.

Expected Shortfall (ES) — definition

ESα(X):=1α0αFX1(u)du=E[XXVaRα(X)](continuous case).\text{ES}_\alpha(X) := -\frac{1}{\alpha}\int_0^\alpha F_X^{-1}(u)\,du = -\mathbb{E}[X \mid X \le -\text{VaR}_\alpha(X)]\quad\text{(continuous case)}.
ES is the average loss in the worst α\alpha-fraction of cases — not the threshold of that worst fraction, but the expected loss once you're inside it.

Key properties

VaR is not subadditive

Claim. There exist X,YX, Y with VaRα(X+Y)>VaRα(X)+VaRα(Y)\text{VaR}_\alpha(X + Y) > \text{VaR}_\alpha(X) + \text{VaR}_\alpha(Y).
Proof sketch (canonical counter-example). Let XX and YY be the P&Ls of two independent defaultable bonds, each with face value 100100. Each bond defaults with probability 0.020.02 (and the bondholder loses 100100); otherwise the bondholder gains 55 in coupon. So individually P(X=5)=0.98\mathbb{P}(X = 5) = 0.98 and P(X=100)=0.02\mathbb{P}(X = -100) = 0.02.
At α=0.05\alpha = 0.05: VaR0.05(X)=5\text{VaR}_{0.05}(X) = -5 (negative of the 5%5\% quantile, which is still 55). The loss 100-100 occurs with probability only 2%<5%2\% < 5\%, so is not captured.

Now consider X+YX + Y with independent defaults. The probability that at least one defaults is 10.9820.0396<0.051 - 0.98^2 \approx 0.0396 < 0.05. So the 5%5\% worst case is "exactly one defaults," producing 100+5=95-100 + 5 = -95. Thus VaR0.05(X+Y)=95\text{VaR}_{0.05}(X+Y) = 95.

Compare: VaR0.05(X)+VaR0.05(Y)=5+5=10\text{VaR}_{0.05}(X) + \text{VaR}_{0.05}(Y) = -5 + -5 = -10.

So 95>1095 > -10 — diversifying into two bonds increases VaR by 105105. This violates (A4). \square

The pathology comes from VaR ignoring tail shape beyond the quantile. Anything heavier than the quantile contributes nothing to VaR; merging two such bets can push a previously hidden tail into the quantile range.

Expected Shortfall is coherent

Theorem (Acerbi-Tasche). ESα\text{ES}_\alpha is coherent.

The proof is easy on (A1)-(A3); subadditivity uses that ESα\text{ES}_\alpha has a dual representation (see below) as a supremum over a convex set of probability measures.

Dual representation

Theorem (Artzner et al.). Every coherent risk measure ρ\rho on a finite Ω\Omega admits a representation
ρ(X)=supQQEQ[X],\rho(X) = \sup_{\mathbb{Q} \in \mathcal{Q}} \mathbb{E}^\mathbb{Q}[-X],
where Q\mathcal{Q} is a convex set of probability measures absolutely continuous with respect to P\mathbb{P} (called the generating set or the set of scenarios).
Interpretation. A coherent risk measure is always a worst-case expected loss across a family of scenarios. VaR fails to have this representation because it is not a supremum over any convex scenario set — it's a quantile, which is discontinuous.

For Expected Shortfall, the generating set is explicit:

Qα={QP:dQ/dP1/α}.\mathcal{Q}_\alpha = \{\mathbb{Q} \ll \mathbb{P} : d\mathbb{Q}/d\mathbb{P} \le 1/\alpha\}.

This is why ES is sometimes called "the average of VaRs above the threshold" — it weights the tail probability uniformly on the worst α\alpha-fraction.

Worked example — portfolio of two assets

Let asset A return +10+10 with probability 0.90.9 and 50-50 with probability 0.10.1. Let asset B return +10+10 with probability 0.90.9 and 50-50 with probability 0.10.1, independently of A. Consider the equal-weight portfolio P=12(A+B)P = \tfrac12(A + B) at α=0.1\alpha = 0.1.
  • Individual VaR0.1(A)\text{VaR}_{0.1}(A). Quantile at 10%10\%: loss 5050 occurs with P=0.1\mathbb{P} = 0.1, so VaR0.1(A)=50\text{VaR}_{0.1}(A) = 50.
  • Individual VaR0.1(B)\text{VaR}_{0.1}(B). Same: 5050.
  • Portfolio distribution. Outcomes of PP: +10+10 (both up, prob 0.810.81), 20-20 (one up one down, prob 0.180.18), 50-50 (both down, prob 0.010.01). So P(P20)=0.19>0.1\mathbb{P}(P \le -20) = 0.19 > 0.1 but P(P50)=0.01<0.1\mathbb{P}(P \le -50) = 0.01 < 0.1, meaning the 10%10\% quantile is 20-20 and VaR0.1(P)=20\text{VaR}_{0.1}(P) = 20.
  • Subadditivity check for VaR. VaR0.1(A+B)=220=4050+50=100\text{VaR}_{0.1}(A+B) = 2 \cdot 20 = 40 \le 50 + 50 = 100. VaR happens to satisfy subadditivity here because the distributions are symmetric and the extreme tail is small enough.
  • ES0.1(A)\text{ES}_{0.1}(A) = average loss in worst 10% = 5050 (the entire worst decile is the loss event). Same for B: 5050.
  • ES0.1(P)\text{ES}_{0.1}(P). Worst 10% of PP consists of probability 0.01 on 50-50 and probability 0.09 on 20-20, so average =(0.0150+0.0920)/0.1=(0.5+1.8)/0.1=23= (0.01 \cdot 50 + 0.09 \cdot 20)/0.1 = (0.5 + 1.8)/0.1 = 23.
  • Subadditivity check for ES. ES0.1(A+B)=223=4650+50=100\text{ES}_{0.1}(A+B) = 2 \cdot 23 = 46 \le 50 + 50 = 100. \checkmark

Both measures diversify correctly here because the tail is not pathological. Change the probabilities to make the individual tails just barely under α\alpha, and VaR's subadditivity can flip.

Common confusions and pitfalls

  • "VaR is the expected loss." No — VaR is a quantile, not an expectation. ES is the conditional expectation in the tail.
  • "ES dominates VaR." ESα(X)VaRα(X)\text{ES}_\alpha(X) \ge \text{VaR}_\alpha(X) always, because ES averages the tail that begins at the VaR threshold.
  • Coherent ≠ good. Coherent is a minimum requirement; many other properties (comonotonic additivity, law invariance, elicitability) are additionally desirable. VaR is elicitable; ES is not — a technical cost that matters for backtesting.
  • ES at α=0.025\alpha = 0.025 ≈ VaR at α=0.01\alpha = 0.01. Under normality; the two are calibrated to roughly equal capital under Basel. Under fat tails the relationship breaks and ES is the conservative choice.
  • Empty scenarios. When fitting ES from a finite sample at small α\alpha, you might have only one or two sample losses in the tail. Careful estimation (e.g., extreme-value theory) is needed.

Where this goes next

  • Backtesting (Risk Models) — how to validate ES despite non-elicitability.
  • Risk Management — situates these measures in the regulatory and operational landscape.
  • Convex risk measures (weakening positive homogeneity) — the right framework when liquidity risk makes large positions non-linearly riskier than small ones.

Exercises

Test your understanding with 3 exercises for this lesson.