CONTENTS

Black-Scholes PDE Derivation

Motivation: why this matters in quant finance

There are two fundamentally different derivations of the Black-Scholes formula. The probabilistic route, already covered in the vault's main BS lesson, uses risk-neutral expectation: V0=erTEQ[payoff]V_0 = e^{-rT}\mathbb{E}^{\mathbb{Q}}[\text{payoff}]. This route generalises cleanly to Monte Carlo, path-dependent payoffs, and stochastic rates.
The PDE route — this note — derives Black-Scholes from a hedging argument that turns the option valuation problem into a second-order parabolic PDE:
Vt+12σ2S22VS2+rSVSrV=0.\frac{\partial V}{\partial t} + \tfrac{1}{2}\sigma^2 S^2\frac{\partial^2 V}{\partial S^2} + rS\frac{\partial V}{\partial S} - rV = 0.

This is the original (1973) Black-Scholes-Merton derivation. Its advantages:

  • Generalises to any European-style derivative with the same diffusion model: the PDE is derived once, and different boundary conditions give different products (call, put, cash-or-nothing, asset-or-nothing).
  • Natural for finite-difference numerical methods: American options, barrier options, and early-exercise features fit into PDE frameworks far more cleanly than into Monte Carlo.
  • Shows the hedging structure explicitly: the drift μ\mu disappears because the replicating argument eliminates it.

This note walks through the derivation and sets up the PDE for subsequent finite-difference lessons.

The informal idea

Construct a portfolio that is instantaneously riskless by combining the option with a short position in the right number of shares. If the portfolio is riskless, it must earn the risk-free rate (no-arbitrage). Compare the resulting expression with Itô's expansion of the option's dynamics, match coefficients, and out pops the PDE.

The key insight: a "riskless portfolio" means the dWdW term vanishes. To make that happen, you choose the hedge ratio Δ=V/S\Delta = \partial V/\partial S. Everything else — the dtdt part, time-decay, gamma — falls into the PDE.

Formal derivation

Setup

Assume the underlying follows geometric Brownian motion:

dS=μSdt+σSdW.dS = \mu S\,dt + \sigma S\,dW.

Let V(S,t)V(S, t) be the value of a derivative depending on SS and time tt. Assume VC2,1V \in C^{2,1} (twice differentiable in SS, once in tt).

Step 1: Itô expansion of VV

By Itô's lemma,

dV=Vtdt+VSdS+122VS2(dS)2.dV = \frac{\partial V}{\partial t}\,dt + \frac{\partial V}{\partial S}\,dS + \frac{1}{2}\frac{\partial^2 V}{\partial S^2}(dS)^2.

Using (dS)2=σ2S2dt(dS)^2 = \sigma^2 S^2\,dt:

dV=[Vt+μSVS+12σ2S22VS2]dt+σSVSdW.dV = \left[\frac{\partial V}{\partial t} + \mu S\frac{\partial V}{\partial S} + \tfrac{1}{2}\sigma^2 S^2\frac{\partial^2 V}{\partial S^2}\right]dt + \sigma S\frac{\partial V}{\partial S}\,dW.

Step 2: Construct the hedged portfolio

Let Πt:=VΔtS\Pi_t := V - \Delta_t S with Δt:=V/S\Delta_t := \partial V/\partial S (a function of S,tS, t). The dynamics:

dΠ=dVΔdS=[Vt+12σ2S22VS2]dt+[σSVSΔσS]=0dW.d\Pi = dV - \Delta\,dS = \left[\frac{\partial V}{\partial t} + \tfrac{1}{2}\sigma^2 S^2\frac{\partial^2 V}{\partial S^2}\right]dt + \underbrace{\left[\sigma S\frac{\partial V}{\partial S} - \Delta\sigma S\right]}_{=0}dW.

Choosing Δ=V/S\Delta = \partial V/\partial S kills the stochastic term. The portfolio is locally riskless.

Subtle technical point. The hedge ratio Δt\Delta_t changes over time, so dΠd\Pi should in principle include terms from dΔdSd\Delta \cdot dS and dΔd\Delta itself. The derivation above treats Δ\Delta as instantaneously fixed (so d(ΔS)=ΔdS+SdΔd(\Delta S) = \Delta\,dS + S\,d\Delta). The shortcut works here because we choose Δ\Delta after taking the differential, not before; the dΔd\Delta terms contribute only to rebalancing and are cancelled by self-financing. A rigorous version uses the self-financing condition explicitly.

Step 3: No-arbitrage requires Π\Pi to earn rr

A riskless portfolio must earn the risk-free rate (else arbitrage). So

dΠ=rΠdt=r(VΔS)dt.d\Pi = r\Pi\,dt = r(V - \Delta S)\,dt.

Equating the two expressions for dΠd\Pi:

Vt+12σ2S22VS2=r(VVSS).\frac{\partial V}{\partial t} + \tfrac{1}{2}\sigma^2 S^2\frac{\partial^2 V}{\partial S^2} = r\left(V - \frac{\partial V}{\partial S}S\right).

Rearranging:

Vt+12σ2S22VS2+rSVSrV=0.\boxed{\frac{\partial V}{\partial t} + \tfrac{1}{2}\sigma^2 S^2\frac{\partial^2 V}{\partial S^2} + rS\frac{\partial V}{\partial S} - rV = 0.}
This is the Black-Scholes PDE.

Step 4: Terminal and boundary conditions

The PDE applies for (S,t)(0,)×[0,T)(S, t) \in (0, \infty) \times [0, T). To pin down VV, supply boundary conditions at the boundaries of the domain.

Terminal: at t=Tt = T, the option value equals the payoff:
  • European call: V(S,T)=max(SK,0)V(S, T) = \max(S - K, 0).
  • European put: V(S,T)=max(KS,0)V(S, T) = \max(K - S, 0).
Asymptotic (as S0S \to 0):
  • Call: V(0,t)=0V(0, t) = 0 (a worthless stock gives a worthless call).
  • Put: V(0,t)=Ker(Tt)V(0, t) = Ke^{-r(T-t)} (deeply-ITM; exercise value minus discounting).
Asymptotic (as SS \to \infty):
  • Call: V(S,t)SKer(Tt)V(S, t) \to S - Ke^{-r(T-t)} (deep ITM; the call is nearly equivalent to a forward).
  • Put: V(S,t)0V(S, t) \to 0.

Step 5: The PDE admits the Black-Scholes formula as its unique solution (for the call)

Substitute V(S,t)=SΦ(d1)Ker(Tt)Φ(d2)V(S, t) = S\Phi(d_1) - Ke^{-r(T-t)}\Phi(d_2) into the PDE and verify. The verification is algebra-intensive but mechanical — use the derivatives Δ=Φ(d1)\Delta = \Phi(d_1), Γ=φ(d1)/(SσTt)\Gamma = \varphi(d_1)/(S\sigma\sqrt{T-t}), Θ=Sφ(d1)σ/(2Tt)rKer(Tt)Φ(d2)\Theta = -S\varphi(d_1)\sigma/(2\sqrt{T-t}) - rKe^{-r(T-t)}\Phi(d_2), and check that they satisfy Θ+12σ2S2Γ+rSΔrV=0\Theta + \tfrac12\sigma^2 S^2 \Gamma + rS\Delta - rV = 0. This is Exercise 2.

Key properties

  • The drift μ\mu does not appear. This is the signature Black-Scholes result: replication eliminates directional-drift dependence. Two assets with the same volatility but different drifts have the same option value — under Black-Scholes assumptions.
  • The PDE is parabolic. Second-order in SS, first-order in tt. Parabolic PDEs have a unique solution for the forward problem (given initial data) and also for the backward problem used here (given terminal data) thanks to time reversal.
  • Feynman-Kac connection. The same PDE arises from the Feynman-Kac formula applied to the risk-neutral expectation V=er(Tt)EQ[payoffSt]V = e^{-r(T-t)}\mathbb{E}^{\mathbb{Q}}[\text{payoff} \mid S_t]. So the PDE and probabilistic derivations are rigorously equivalent — two views of the same problem.
  • Generalisation. Dividend yield qq modifies to (rq)SV/S(r - q)S\partial V/\partial S. Time-varying rr and σ\sigma keep the PDE form but require numerical methods. American options become a free-boundary problem because early-exercise adds an inequality.

Transformation to the heat equation

Let τ=Tt\tau = T - t (time to expiry), x=lnSx = \ln S, and u(x,τ)=erτV(ex,Tτ)u(x, \tau) = e^{r\tau}V(e^x, T - \tau). Substituting into the Black-Scholes PDE:

uτ=12σ22ux2+(r12σ2)ux.\frac{\partial u}{\partial \tau} = \tfrac{1}{2}\sigma^2\frac{\partial^2 u}{\partial x^2} + (r - \tfrac{1}{2}\sigma^2)\frac{\partial u}{\partial x}.

A further change u=vexp(ax+bτ)u = v\cdot\exp(ax + b\tau) with suitable a,ba, b eliminates the drift term, reducing the PDE to the heat equation

vτ=12σ22vx2.\frac{\partial v}{\partial \tau} = \tfrac{1}{2}\sigma^2\frac{\partial^2 v}{\partial x^2}.

This is how Black, Scholes, and Merton originally derived the closed form: reduce to heat equation, use the known Gaussian fundamental solution.

Worked example — verifying the PDE for a call

Set r=0.05r = 0.05, σ=0.20\sigma = 0.20, T=1T = 1, S=100S = 100, K=100K = 100, t=0.5t = 0.5. Compute:

  • d1=[ln(S/K)+(r+σ2/2)(Tt)]/(σTt)=[0+(0.05+0.02)0.5]/(0.20.5)=0.035/0.1414=0.2475d_1 = \left[\ln(S/K) + (r + \sigma^2/2)(T-t)\right]/(\sigma\sqrt{T-t}) = [0 + (0.05 + 0.02)\cdot 0.5]/(0.2\sqrt{0.5}) = 0.035/0.1414 = 0.2475.
  • d2=d1σTt=0.24750.1414=0.1061d_2 = d_1 - \sigma\sqrt{T-t} = 0.2475 - 0.1414 = 0.1061.
  • Φ(d1)0.5977\Phi(d_1) \approx 0.5977, Φ(d2)0.5423\Phi(d_2) \approx 0.5423, φ(d1)0.3866\varphi(d_1) \approx 0.3866.

Greeks:

  • V=1000.5977100e0.0250.5423=59.7752.89=6.88V = 100 \cdot 0.5977 - 100 \cdot e^{-0.025}\cdot 0.5423 = 59.77 - 52.89 = 6.88.
  • Δ=0.5977\Delta = 0.5977.
  • Γ=0.3866/(1000.20.5)=0.3866/14.14=0.02734\Gamma = 0.3866/(100 \cdot 0.2 \cdot \sqrt{0.5}) = 0.3866/14.14 = 0.02734.
  • Θ=1000.38660.2/(20.5)0.05100e0.0250.5423=5.4692.644=8.113\Theta = -100 \cdot 0.3866 \cdot 0.2/(2\sqrt{0.5}) - 0.05 \cdot 100 \cdot e^{-0.025}\cdot 0.5423 = -5.469 - 2.644 = -8.113.

PDE check (per year):

Θ+12σ2S2Γ+rSΔrV\Theta + \tfrac12\sigma^2 S^2 \Gamma + rS\Delta - rV =8.113+12(0.04)(10000)(0.02734)+0.05(100)(0.5977)0.05(6.88)= -8.113 + \tfrac{1}{2}(0.04)(10000)(0.02734) + 0.05(100)(0.5977) - 0.05(6.88) =8.113+5.469+2.9890.3440.0010.= -8.113 + 5.469 + 2.989 - 0.344 \approx 0.001 \approx 0.

The PDE is satisfied (up to rounding). \checkmark

Common confusions and pitfalls

  • "The drift μ\mu disappears, so drift doesn't matter." It matters for the real-world probability of ITM-ness, for expected returns, for risk. What disappears is its role in the pricing — because the replication argument hedges away direction. A different μ\mu doesn't change the option's price but does change how much you actually expect to make trading it.
  • Risk-neutral Q\mathbb{Q} in PDE land. The PDE has no μ\mu because the argument implicitly evaluates expectations under Q\mathbb{Q}, where EQ[dS/S]=rdt\mathbb{E}^{\mathbb{Q}}[dS/S] = r\,dt. This is the same underlying change of measure, viewed PDE-style.
  • Only works for complete markets. The derivation assumes every option can be replicated by trading the underlying and cash. In incomplete markets (stochastic vol, jumps), no riskless hedge exists and the argument breaks — the PDE becomes an inequality (super-replication) or requires an extra pricing kernel.
  • PDE numerics require care at the boundaries. The Black-Scholes PDE is posed on S(0,)S \in (0, \infty). For finite-difference methods you truncate to S[0,Smax]S \in [0, S_{\max}] with SmaxS_{\max} chosen so the error at the boundary is small. Too-small SmaxS_{\max} destroys accuracy.
  • The argument is formal, not rigorous. The step "choose Δ\Delta after taking the differential" sweeps subtlety under the rug. A full self-financing derivation (via the stochastic integral) is cleaner but longer.

Where this goes next

Exercises

Test your understanding with 3 exercises for this lesson.