Fair value estimation - Advanced Analytics and Algorithmic Trading

Introduction¶

In this chapter we introduce techniques to estimate the fair value of a financial instrument exploiting the pricing information available in financial markets.

Fair value is the price at which a financial instrument is economically equivalent, at a given time and under a given information set, to its future cash flows or payoffs, once transactions costs, opportunity costs and risk are appropriately accounted for

In this chapter we will focus on two conceptually different ways to estimate fair value. The first one assumes markets are efficient enough so that prices observed in the market are our best estimation of fair value, bar some idiosyncratic components coming from trading frictions like liquidity premiums, dealer spreads, and transaction costs. In this case, fair value estimation can be seen as a filtering problem, and we will study a simple albeit powerful model to carry out this task: the Kalman filter.

When financial instruments are highly illiquid or do not trade directly in organized markets, fair value estimation must rely on economic—often referred to as fundamental—valuation models. These models infer the value of a financial instrument from the future cash flows specified by its contractual structure. Central to this approach is the concept of the time value of money, which states that a payment received in the future is not economically equivalent to the same payment received today, due to opportunity costs. As a result, future cash flows must be converted into present values through the application of a discount factor, which renders cash flows occurring at different points in time comparable.

Time discounting, however, is not the only challenge faced by fundamental valuation models. Future cash flows are frequently contingent on information that is not known at the valuation date, such as future prices of financial assets or macroeconomic variables. Examples include dividends paid by a company or the value of the underlying asset referenced by a derivative contract. A simple, albeit theoretically naive, approach consists in valuing the instrument as the expected value of its future cash flows, treated as random variables. This approach, however, fails to account for heterogeneity in investors’ risk preferences. Once cash flows are stochastic, the realized return on the investment becomes uncertain, and uncertainty is not valued equally by all investors. To address this limitation, one can adopt a utility indifference pricing framework, in which risk preferences explicitly enter the valuation.

In many situations, particularly for illiquid flow instruments, this is the most refined valuation approach available. However, for the specific case of derivative instruments, stronger theoretical results can be obtained under additional assumptions. As shown by Fischer Black, Myron Scholes, and Robert C. Merton in the 1970s, it is possible to construct dynamic replication portfolios that reproduce the payoffs of a derivative using traded instruments. In such settings, the fair value of the derivative becomes independent of investors’ risk aversion, since any deviation from this price would give rise to risk-free arbitrage opportunities. This insight leads to the arbitrage-free pricing framework, which will be introduced in the final section of this chapter.

Finally, a unifying pricing framework that can accommodate both the risk-aversion profile of the investor and the arbitrage-free constraints is the stochastic discount factor pricing framework Cochrane, 2005, which we will briefly describe at the end of the chapter.

Filtering models for fair value estimation¶

As mentioned in the introduction, for financial instruments that are relatively liquid, we can aim at extracting all the pricing information from price indications and trades in the market, without having to resort to economic theories of fair value. In this setup, we consider as their fair value the one that market participants are willing to pay for.

The issue, though, is that price indications and trades cannot be considered themselves pure observations of fair value, since they might be affected by market frictions: bid ask spreads, particularities of the negotiation mechanism, liquidity fluctuations, specific needs of market participants at a given time, etc. When instruments trade in limit order books, a popular estimation of the fair value is using the mid-price, the arithmetic average of the best bid and ask. However, if bid-ask spreads are wide or liquidity is thin in the first levels, such estimation is not necessarily very precise. Trades provide a lot of information, since they are real transaction and not indications of interests, the larger they are in principle the more information. Still, they are subject to the aforementioned market frictions that reduce their reliability.

These make all these price observations noisy estimates of the fair value, so if we want to estimate a fair value out of them we need to be able to separate the signal from the noise, or in other words, filter those observations. This is precisely what, under certain model assumptions, a Kalman filter does.

The Kalman filter was introduced in the chapter on Bayesian Theory. It is a Bayesian filtering algorithm that allows to perform exact inference, i.e. compute the closed-form distribution, of the latent state vector in a Linear Gaussian State Space Model (LG-SSM).

Recall that a State Space Model (SSM) is a model to describe dynamic systems where we have a non or partially observable state, a vector $\mathbf{x}$ , whose dynamics in time is described by a so-called transition equation:

\mathbf{x}_{t+\Delta t} = f(\mathbf{x}_t, \mathbf{u}_t) + \mathbf{\epsilon}_t

(1)

where $f(\mathbf{x}_t, \mathbf{u}_t)$ is a general function, $\mathbf{u}_t$ are inputs (or controls) that affect the dynamics, $\Delta t$ is the time-step between observations, and $\mathbf{\epsilon}_t$ is a transition noise with a given distribution. The state is observed indirectly via a proxy vector $\mathbf{y}$ via the observation equation:

\mathbf{y}_t = g(\mathbf{x}_t, \mathbf{u}_t) + \mathbf{\eta}

(2)

where $g(\mathbf{x}_t, \mathbf{u}_t)$ is another general function and $\mathbf{\eta}_t$ the observation noise, meaning that observations have a degree of uncertainty with respect to the latent space.

A Linear Gaussian Model (LGM) is a specific case of the SSM were both the transition and observation functions are linear and the noise terms are Gaussian. In this case, we can use the Kalman filter algorithm to compute the distribution of the state vector at any time, given the observations and the transition and observation model. If some or all the parameters of these models are not known, they can be estimated using standard techniques like Maximum Likelihood Estimation (MLE) or Expectation Maximization (EM) when the former becomes computationally intractable due to the latent state vector.

For non-Linear Gaussian Models, there are extensions of the Kalman filter that can be used:

Extended Kalman Filter (EKF): Extends the Kalman Filter to non-linear state space models by linearizing the dynamics and observation models around the current estimate using Taylor expansions.
Unscented Kalman Filter (UKF): Avoids linearization by using deterministic sampling to approximate the state distribution.
Particle filtering / sequential Monte Carlo: it uses directly Monte Carlo methods to find the posterior distribution of the fair value. See Gu'eant & Pu, 2018 for more details of this approach applied to the fair-price estimation problem in corporate bonds.

A simple pricing model¶

Let us consider a simple setup where we aim to infer the distribution of the fair value $m_t$ of a financial instrument that follows a random walk:

m_{t+\Delta t} = m_t + \epsilon_t, \epsilon_t \sim N(0, \sigma_\epsilon^2 \Delta t)

(3)

We don’t observe this fair value, only trades which we consider noisy observation of the mid since they include transaction costs and potentially other external factors like dealer inventory positions, etc:

p_t = m_t + \nu_t, \nu_t \sim N(0, \sigma_\nu^2)

(4)

Readers will recognize that this is the local level model discussed extensively in the Chapter on Bayesian Modelling. For the observation noise we can introduce prior business knowledge about the confidence we have on trade observations as a source of pricing information. In his Option Trading’s book Sinclair, 2010, Euan Sinclair describes a simple model that quantifies the information provided by trades based on the size of the trade, $v$ :

\sigma_\nu (v)= \sigma_p \left(\frac{v_\text{max}}{v}-1\right)^+

(5)

where $\sigma_p$ is a baseline observation noise and $v_\text{max}$ is an input to the model, the trade size we believe saturates the information provided in the sense that our mid estimation will essentially move to the price of the trade. In contrast, trades of small size, $v \ll v_\text{max}$ , will have $\sigma_\nu(v) \rightarrow \infty$ and will provide negligible pricing information. An alternative simple model is:

\sigma_\nu(v) = \sigma_p \frac{v_0}{v}

(6)

where $v_0$ in this case is a size scale that separates the regimes where the information provided by the trade is negligible, $v \ll v_0$ , or relevant $v \gg v_0$ , but it does not saturate for a specific trading size, as in Sinclair’s model. Of course nothing prevents to use more business prior knowledge to enrich the observation model with other observable characteristics of the trade or the wider market context.

Estimation of the simple pricing model¶

As discussed in Bayesian Modelling, the standard way to estimate the parameters of a Kalman Filter is using the Expectation Maximization (EM) algorithm, suitable for probabilistic models with latent variables. However, the properties of the simple pricing model can be exploited to obtain closed-form estimators for its parameters using moment matching.

Let us start by working with a model where observation errors have no dependency on the volume: $\sigma_\nu (v) = \sigma_\nu$ . The key is to compute statistics of:

d_t \equiv p_{t+\Delta t} - p_t = (m_{t+\Delta t}- m_t) + (\nu_{t+\Delta t} - \nu_t) = \epsilon_t + (\nu_{t+\Delta t} - \nu_t)

(7)

which depend only on observed trades. First we compute the variance:

Var[d_t]= Var[\epsilon_t] + Var[(\nu_{t+\Delta t} - \nu_t)]= \sigma_\epsilon^2 \Delta t + 2 \sigma_\nu^2

(8)

where we have used that $\epsilon_t$ , $\nu_{t+\Delta t}$ and $\nu_t$ are independent random variables. This expression links the variance of the first differences in trade prices with the parameters to estimate. We need though a second expression to solve for each parameters separately. For that we compute the lag-1 auto-covariance of $d_t$ :

Cov[d_t,d_{t-\Delta t}]=Cov[\epsilon_t + (\nu_{t+\Delta t} - \nu_t), \epsilon_{t-\Delta t} + (\nu_t - \nu_{t-\Delta t})] = -Var[\nu_t] = -\sigma_\nu^2

(9)

where again we have used the independence between the noise terms. Our estimators read then:

\hat{\sigma}_\epsilon^2 \Delta t = \frac{1}{N-1} \sum_{i=1}^{N-1} d_{t_i}^2 - 2 \hat{\sigma}_\nu^2

(10)

\hat{\sigma}_\nu^2 = -\frac{1}{N-2}\sum_{i=2}^{N-1} d_{t_i} d_{t_{i}-\Delta t}

(11)

The results have an interesting interpretation:

Starting from the first equation: the variance of the fair value must be lower than the variance of the observed trades, given the additional noise in the observation equation. Or, in other words, since the fair value is filtered from the trades, which means that we estimate it by removing noise from the trades, it has to have a lower variance.
The second equation can be recognized as a form of the Roll estimator for effective bid-ask spreads, a typical measure of liquidity. Of course, by construction of the model, the noise that is filtered is essentially the bid-ask spread that liquidity providers request as a compensation for the liquidity provision.

In the case of volume dependent observation errors, we can still compute these statistics, which now read:

Var[d_t]= \sigma_\epsilon^2 \Delta t + \sigma_\nu^2(v_{t+\Delta t}) + \sigma_\nu^2(v_t)

(12)

Cov[d_t,d_{t-\Delta t}]= -\sigma_\nu^2 (v_t)

(13)

The statistical estimator of the variance of the first differences can still be used, by accounting by the variability of the error with volume (heteroskedasticity):

E[\frac{1}{N-1} \sum_{i=1}^{N-1} d_{t_i}^2]= \frac{1}{N-1} \sum_{i=1}^{N-1} \left(\sigma_\epsilon^2 \Delta t + \sigma_\nu^2(v_{t+\Delta t}) + \sigma_\nu^2(v_t)\right)=\sigma_\epsilon^2 \Delta t+\frac{1}{N-1} \sum_{i=1}^{N-1} \left( \sigma_\nu^2(v_{t+\Delta t}) + \sigma_\nu^2(v_t)\right)

(14)

Moving into the 1-lag covariance, we have:

E[\frac{1}{N-2}\sum_{i=2}^{N-1} d_{t_i} d_{t_{i}-\Delta t}] = - \frac{1}{N-2}\sum_{i=2}^{N-1} \sigma_\nu^2(v_{t_i})

(15)

As far as we the volume dependency has a single parameter to fit, we can still use these two equations to solve for the parameters. If we use, for instance, the simple model $\sigma_\nu(v) = \sigma_p \frac{v_0}{v}$ , where $v_0$ is given and $\sigma_p$ is to be estimated from data (notice that we could simply estimate $\sigma_p v_0$ , the factorization is useful for business interpretation):

E[\frac{1}{N-1} \sum_{i=1}^{N-1} d_{t_i}^2]=\sigma_\epsilon^2 \Delta t+ \frac{\sigma_p^2}{N-1} \sum_{i=1}^{N-1} \left( \frac{v^2_0}{v^2_{t_i+\Delta t}} + \frac{v^2_0}{v^2_{t_i}} \right)

(16)

E[\frac{1}{N-2}\sum_{i=2}^{N-1} d_{t_i} d_{t_{i}-\Delta t}] = - \frac{\sigma_p^2}{N-2}\sum_{i=2}^{N-1} \frac{v^2_0}{v^2_{t_i}}

(17)

In this simple case, the estimation of the parameters $\sigma_p$ and $\sigma_\epsilon$ is straightforward. More complex functions like the one from Sinclair cannot be estimated with only two moments. Further moments can be computed to provide extra equations, although at this point it might be worthy to resort to standard estimation techniques like EM if available.

Inference on the simple pricing model¶

We can use the general Kalman filter equations described in Bayesian Modelling to derive the distribution of our mid-price at the next time $t + \Delta t$ when a trade happens.

The Kalman filter algorithm operates sequentially over observation steps applying two steps, the predict step, where we compute the distribution of the fair value based purely on the random walk model, and the update step in which we incorporate the information provided by the observation of a new trade. We define $m_{t+\Delta t}^t$ as the distribution of the fair value at $t+\Delta t$ before observing the trade, and $m_{t+\Delta t}^{t+\Delta t}$ afterwards.

Let us apply first the predict step. The distribution of $m_{t+\Delta t}^t$ is Gaussian with mean and variance given by:

\bar{m}_{t+\Delta t}^t = \bar{m}_{t}^t

(18)

(\sigma_{m,t+\Delta t}^t)^2 = (\sigma_{m,t}^t)^2 + \sigma_\epsilon^2 \Delta t

(19)

Since our model uses a drift-less random walk dynamics for the evolution of the fair value, the updated mean does not change and the variance increases proportionally to the time step $\Delta t$ .

Now we use the update step to incorporate the information from a trade happening at $t + \Delta t$ :

\bar{m}_{t+\Delta t}^{t+\Delta t} = \bar{m}_{t+\Delta t}^{t} + K_t (p_{t +\Delta t} - \bar{m}_{t+\Delta t}^{t})

(20)

(\sigma_{m,t+\Delta t}^{t+\Delta t})^2 = \frac{(\sigma_{m,t+\Delta t}^t)^2 \sigma_\nu^2}{(\sigma_{m,t+\Delta t}^t)^2 + \sigma_\nu^2}

(21)

where $K_t$ is the Kalman gain, given by:

K_t = \frac{(\sigma_{m,t+\Delta t}^t)^2}{(\sigma_{m,t+\Delta t}^t)^2 + \sigma_\nu^2}

(22)

The updated mean is an interpolation between the predicted mean and the trade observation, weighted by the Kalman gain

If the observation noise is much smaller than the uncertainty in the mean in the prediction step, namely $\sigma_\nu \ll \sigma_{m,t+\Delta t}^t$ , the Kalman gain then tends to $K_t \rightarrow 1$ and $\bar{m}_{t+\Delta t}^{t+\Delta t} = p_{t+\Delta t}$ , i.e. since our confidence on the information from the trade is much higher than our best estimation of the mean, we essentially update the mean with the trade price.
On the contrary, if $\sigma_\nu \gg \sigma_{m,t+\Delta t}^t$ , the Kalman gain then tends to $K_t \rightarrow 0$ and $\bar{m}_{t+\Delta t}^{t+\Delta t} = \bar{m}_{t+\Delta t}^{t}$ . In this case, our confidence on the information provided by the trade is very low, so essentially we ignore the trade information and use the predicted mean.
In between those two limiting cases, the updated mean combines the information from the prediction using the internal dynamics and our last update, and the trade information.

If we look at the new standard deviation, we also find similar limiting behaviors:

If $\sigma_\nu \ll \sigma_{m,t+\Delta t}$ , then $\sigma_{m,t+\Delta t}^{t+\Delta t} \rightarrow \sigma_\nu$ , since as we discussed above, we essentially use the trade information to inform our estimation of the mid.
If $\sigma_\nu \gg \sigma_{m,t+\Delta t}$ , then $\sigma_{m,t+\Delta t}^{t+\Delta t} \rightarrow \sigma_{m,t+\Delta t}^t$ , i.e. we stick with the estimation from the predict step

One interesting consequence of the optimality of the Kalman filter is that the updated standard deviation cannot be larger than the predicted one, and for any finite $\sigma_\nu$ is always smaller: the information from the trade always contributes to improve our estimation of the fair value. This is easily seen writing:

\sigma_{m,t+\Delta t}^{t+\Delta t} =\sigma_{m,t+\Delta t}^t \frac{1}{\sqrt{(\frac{\sigma_{m,t+\Delta t}^t}{\sigma_\nu})^2 + 1}}

(23)

Since $\frac{\sigma_{m,t+\Delta t}^t}{\sigma_\nu}$ is non-negative, then the denominator is never lower than 1.

In some applications of the local level model to pricing we might also be interested in the Kalman smoothing algorithm. Recall that the difference with the Kalman filtering we have just seen is that in smoothing we estimate the latent variable using all the available data, including the future. Of course this means Kalman smoothing does not make sense for online price inference, but there are other applications of this pricing model where using the best estimation of the latent fair value is relevant:

Estimation of parameters when using Expectation Maximization, as discussed in in Bayesian Modelling
Calibration of pricing models, for instance as discussed in Chapter Modelling RfQs in Dealer to Client Markets, when estimating the hit rate probability — the probability that a client trades an RfQ from a dealer given the quoted price. In this case, having the best possible estimate of the fair value is essential to isolate the effect of the spread, which is the quantity that puts dealers in competition. That chapter also covers models to evaluate the toxicity of client flows, i.e. when clients appear to have more information than the dealer, so that the dealer ends up on the wrong side of the market. Estimating such models typically relies on analysing how the fair value of the instrument moves before and after a client intends to trade.

Multiple observations of the same instrument¶

In many real pricing situations, we might have different sources that reveal information about the fair value of an instrument, for example trades in different platforms, information from composites, or pricing information derived indirectly from trading indicators like the hit&miss. We will explore later those sources in detail. If they happen asynchronously, we can just use the simple pricing model introduced in the previous section, adjusting the observation error depending on the pricing source.

If they happen synchronously, though, we need to expand the Kalman filter to cope with simultaneous observations. This requires to change the observation model to a system of equations:

p_{t,i} = m_t + \nu_{t,i}, \vec{\nu}_{t,i} \sim N(0, \sigma_{\nu, i}^2)

(24)

We can compute the Kalman gain in this case, which is a matrix:

K_{t,i} = \frac{(\sigma_{m,t+\Delta t}^{\,t})^2/\sigma_{\nu,i}^2}{1 + (\sigma_{m,t+\Delta t}^{\,t})^2/\Lambda}

(25)

where:

\Lambda = \sum_{i=1}^n \frac{1}{\sigma_{\nu,i}^2}

(26)

is the total observation precision. Notice that, as a sanity check, in the case of a single observation we recover the Kalman gain derived in the previous section. For multiple observations, the update equation then reads:

\bar{m}_{t+\Delta t}^{t+\Delta t} = \bar{m}_{t+\Delta t}^{t} + \sum_{i=1}^n K_{t,i} (p_{t + \Delta t ,i} - \bar{m}_{t+\Delta t}^{t})

(27)

The relative weight of influence of each observation depends on the fraction of the total variance that the observation variance represents, with more noisy observations having a smaller effect in the update.

Multiple correlated instruments¶

The Kalman filter model for pricing becomes even more relevant when we include information from other financial instruments that are historically correlated with the one whose fair value we are estimating. Typical situations are:

Instruments that are more liquid, i.e. they trade more often and with smaller bid and ask spreads. This allows us to improve the estimation of the fair value until we observe a new trade from the instrument, anticipating potential relevant movements derived from common market factors.
Instruments that trade in markets that are open when markets where our instrument of interest is traded are closed. This allows us to reduce the uncertainty from the overnight gap in trading data.

The simple pricing model we have analyzed so far can be easily extended to include information from a set of N instruments. Notice that in this case what we are actually doing is estimating the fair values of all the instruments in the set, not necessarily only the one of interest. The evolution of the fair values is now modelled using:

\vec{m}_{t+\Delta t} = \vec{m}_t + \vec{\epsilon}_t, \vec{\epsilon}_t \sim N(\vec{0}, \Sigma_\epsilon \Delta t)

(28)

where $\Sigma_\epsilon$ is now a covariance matrix that takes into account the effect of correlations in the pricing movements. Price observations follow the model:

\vec{p}_t = \vec{m}_t + \vec{\nu}_t, \vec{\nu}_t \sim N(0, \Sigma_\nu)

(29)

In this case, since we are already modelling correlations at fair value level, a typical choice is to take $\Sigma_\nu$ diagonal, i.e. $\Sigma_\nu = \text{diag}(\sigma_{\nu,1}^2, ..., \sigma_{\nu,N}^2)$ , although in certain setups one might want to include some of form of bid-ask spread correlation between instruments.

With this model specification, we can directly use the filtering, smoothing and EM equations discussed in the Bayesian Modelling chapter. Let us though specifically focus on the case of $N=2$ instruments, where we can work out in detail the Kalman filter equations to get further insights into the model’s inner workings.

The predict step in the Kalman filter is given by the following equations:

\vec{m}_{t|t-1} = \vec{m}_{t-1|t-1}

(30)

\Sigma_{t|t-1} = \Sigma_{t-1|t-1} + \Sigma_\epsilon \Delta t

(31)

which are relatively simple, as expected. The update step equations are more interesting, since they include the effect of observations, and critically the impact of one instrument’s trades into the fair value or the other:

\vec{m}_{t|t} = \vec{m}_{t|t-1} + K_t ( \vec{P}_{t+\Delta t}-\vec{m}_{t|t-1})

(32)

\Sigma_{t|t} = (1-K_t) \Sigma_{t|t-1}

(33)

with the Kalman gain being:

K_t = \Sigma_{t|t-1} ( \Sigma_{t|t-1} + \Sigma_\nu)^{-1}

(34)

For the two instrument case, the Kalman gain can be expanded into:

\begin{aligned} K_t = \frac{1}{(\sigma_{\nu,1}^2 + (\sigma_{1,t}^{t-1})^2)(\sigma_{\nu,2}^2 + (\sigma_{2,t}^{t-1})^2) - (\rho_t^{t-1}\sigma_{1,t}^{t-1}\sigma_{2,t}^{t-1})^2} \nonumber \\ \begin{pmatrix} (\sigma_{1,t}^{t-1})^2 \sigma_{\nu,2}^2 + (\sigma_{1,t}^{t-1})^2(\sigma_{2,t}^{t-1})^2(1- (\rho_t^{t-1})^2) & \rho_t^{t-1} \sigma_{1,t}^{t-1}\sigma_{2,t}^{t-1}\sigma_{\nu,1}^2 \\ \rho_t^{t-1} \sigma_{1,t}^{t-1}\sigma_{2,t}^{t-1}\sigma_{\nu,2}^2 & (\sigma_{2,t}^{t-1})^2\sigma_{\nu,1}^2 + (\sigma_{1,t}^{t-1})^2 (\sigma_{2,t}^{t-1})^2(1-(\rho_t^{t-1})^2) \nonumber \\ \end{pmatrix} \end{aligned}

(35)

Let us analyze some particular cases:

If we set the correlation to zero, $\rho_t^{t-1} = 0$ , the Kalman gain becomes:
$\begin{aligned} K_t = \frac{1}{(\sigma_{\nu,1}^2 + (\sigma_{1,t}^{t-1})^2)(\sigma_{\nu,2}^2 + (\sigma_{2,t}^{t-1})^2)} \nonumber \\ \begin{pmatrix} (\sigma_{1,t}^{t-1})^2 \sigma_{\nu,2}^2 + (\sigma_{1,t}^{t-1})^2(\sigma_{2,t}^{t-1})^2 & 0 \\ 0 & (\sigma_{2,t}^{t-1})^2\sigma_{\nu,1}^2 + (\sigma_{1,t}^{t-1})^2 (\sigma_{2,t}^{t-1})^2 \end{pmatrix} \\ \nonumber = \begin{pmatrix} \frac{(\sigma_{1,t}^{t-1})^2}{\sigma_{\nu,1}^2 + (\sigma_{1,t}^{t-1})^2} & 0 \\ 0 & \frac{(\sigma_{2,t}^{t-1})^2}{\sigma_{\nu,2}^2 + (\sigma_{2,t}^{t-1})^2} \end{pmatrix} \nonumber \end{aligned}$
(36)
i.e. we recover the equations of the Kalman gain for a single instrument for each diagonal, and the update equation factors in two independent updates.
If we set $\sigma_{\nu, 1}^{-1} = 0$ , which simulates the case in which we don’t have an observation of a trade in the first financial instrument --by considering its observation error so large that it’s effect is negligible, the Kalman gain becomes:
$\begin{aligned} K_t = \begin{pmatrix} 0 & \frac{\rho_t^{t-1} \sigma_{1,t}^{t-1}\sigma_{2,t}^{t-1}}{\sigma_{\nu,2}^2 + (\sigma_{2,t}^{t-1})^2} \\ 0 & \frac{(\sigma_{2,t}^{t-1})^2}{\sigma_{\nu,2}^2 + (\sigma_{2,t}^{t-1})^2} \nonumber \\ \end{pmatrix} \end{aligned}$
(37)
The update equations for each instruments fair value read:
$m_{1,t|t} = m_{1,t|t-1} + \frac{\rho_t^{t-1} \sigma_{1,t}^{t-1}\sigma_{2,t}^{t-1}}{\sigma_{\nu,2}^2 + (\sigma_{2,t}^{t-1})^2} (P_{2,t + \Delta t} - m_{2,t|t-1})$
(38)
$m_{2,t|t} = m_{2,t|t-1} + \frac{(\sigma_{2,t}^{t-1})^2}{\sigma_{\nu,2}^2 + (\sigma_{2,t}^{t-1})^2} (P_{2,t + \Delta t} - m_{2,t|t-1})$
(39)
Notice that these equations are equivalent to the ones that we would derive if we use an observation matrix $H = (0, 1)$ and compute the update step for arbitrary values of the parameters. Going into the results, the second equation is the same update equation we had for a single instrument for which we have observed a trade. The first equation is more interesting, since it isolates the effect that an observation of a trade in an instrument has in our fair value estimation of a correlated instrument. As expected, the influence is proportional to the estimated correlation between the instruments, $\rho_t^{t-1}$ . The effect of the influence depends on the relative sizes of the variances in play. It helps to rewrite the equation as:
$m_{1,t|t} = m_{1,t|t-1} + \beta_{12,t}^{t-1} \frac{1}{1+ \frac{\sigma_{\nu,2}^2}{ (\sigma_{2,t}^{t-1})^2}} (P_{2,t + \Delta t} - m_{2,t|t-1})$
(40)
where $\beta_{12,t}^{t-1} \equiv \frac{\rho_t^{t-1} \sigma_{1,t}^{t-1}}{\sigma_{2,t}^{t-1}}$ is the linear regression coefficient between $m_{1,t}$ and $m_{2,t}$ . It provides an upper bound on the Kalman gain between the two instruments, which happens when the observation in the second instrument has no error $\sigma_{\nu, 2} = 0$ . This makes sense, since in the absence of noise in the observation of the second instrument, the update equation becomes the best linear prediction of $m_{1,t}$ using $m_{2,t}$ .

Let us evaluate this model in one of the typical scenarios for fair value discovery discussed above: two correlated instruments traded in markets with some periods of non-overlapping trading. The objective is to leverage their correlation to estimate fair values for the instrument whose market is closed. The underlying principle is that new information affecting the price of the actively traded instrument during its market hours would similarly impact the closed-market instrument, if it were tradable.

For that, we first generate synthetic fair values using a correlated Brownian motion with $\rho = 0.9$ , $\sigma_1 = 5e-4$ and $\sigma_2 = 4e-4$ . Then we generate trades over 22 days but for each day, each day consisting on 60 time-steps to make the simulation efficient. We consider three situations: one in which only the first instrument is traded, one in which only the second instrument is traded, and a third one in which both are simultaneously traded. We use a diagonal observation covariance to generate the trades, i.e. we assume that there is no correlation between the spreads with respect to the fair value, so correlation is driven exclusively by fair value correlations. To generate the trades, we use standard deviations in the observation covariance of 0.032 and 0.045, respectively. Then we use Expectation Maximization (EM) over the first half of the synthetic trade data to estimate the parameters of the model, and run the Kalman filter over the second half of the data to compare the estimations of the fair value to the real simulated values. The results can be seen in the following figure:

Figure 1:Estimation of the fair value of instruments when their market is closed, using information from correlated instrument that trade at those times. The results are based on a simulation in which first the fair values are generated (blue lines) and trades (blue dots) are simulated when the market is open, which happens half of the day. Notice that a third of the day both instruments trade simultaneously. The orange line are the fair values estimated using the Kalman filter, which is trained with half of the data using EM and then run over the second half of the data. The figure focus on four days of test data.

As we see, the Kalman filter successfully exploits the correlation between instruments to update the fair value of the instruments when the market is closed. The updates are not perfect but they capture the overnight trends, improving over typical baselines like the closing price of the instrument. In general, the estimated fair values include the true fair values within one standard deviation, depicted as the shaded grey area in the figure.

Pricing sources and models¶

When it comes to feeding the Kalman filter with information to improve the estimations of the fair value, there is a variety of sources that is typically used. Let us discuss the most typical sources, those based on Limit Order Book (LOB) data and those based on Request for Quote (RfQ) data. As we have seen, these sources can be used both if they belong to the instrument of interest or correlated ones. However, there are some considerations to take into account when using correlated instruments, which we discuss in the end.

One point that will become a common theme in this section is the information content of the observations. Intuitively, not every source of pricing is equally informative. For example, as we discussed above, we expect that trades with larger volumes are more informative than small trades. In the same way, a cancellation of a limit order deep in a Limit Order Book might not carry meaningful new pricing information. These effects need to be incorporated case by case into the pricing model.

LOB traded¶

Limit Order Books (LOBs) are complex structures that contain extensive pricing information. Nevertheless, as discussed in Chapter Market microstructure, the primary references for price discovery are the best bid and ask quotes, the mid-price —defined as the average of these two—and the prices of executed trades. To improve robustness and reduce the impact of potential market manipulation, it is standard practice to compute volume-weighted averages of the first K levels on both the bid and ask sides, and to define a robust mid-price based on these averages.

Mid-price information¶

As mentioned above, a robust mid-price indicator in a LOB is:

P_{\text{mid},t}^{\text{LOB}} = \frac{1}{2} \left(\frac{\sum_{i=1}^K v_{b,i} P_{b,i}}{\sum_{i=1}^K v_{b,i}} + \frac{\sum_{i=1}^K v_{a,i} P_{a,i}}{\sum_{i=1}^K v_{a,i}}\right)

(41)

where $v_{b/a, i}$ and $P_{b/a, i}$ are volumes and prices of bid and ask, respectively, and $K$ is the number of levels that are taken into account, for instance $K = 4$ . We have omitted the time subscript in volumes and prices but they are also time dependent. Similarly, we can compute a robust bid-ask spread in the form:

S_t^{\text{LOB}} = \frac{\sum_{i=1}^K v_{a,i} P_{a,i}}{\sum_{i=1}^K v_{a,i}} - \frac{\sum_{i=1}^K v_{b,i} P_{b,i}}{\sum_{i=1}^K v_{b,i}}

(42)

which has to be positive since any bid and ask limit orders with the same price are automatically matched in the LOB. With this information, we can compute a simple model of fair value based on LOB information:

m_t^{\text{LOB}} \sim N(P_{\text{mid},t}^\text{LOB}, (S_t^{\text{LOB}})^2 )

(43)

The choice of a Gaussian distribution is for simplicity, since it captures well our subjective view of the pricing information that the LOB contains, i.e. as we saw in chapter Bayesian Modelling, since we only identify the mean and variance as the constraints, the maximum entropy distribution associated with these constraints in the Gaussian distribution --admittedly, there is an extra constraint in the form of positivity of the fair value, but for prices far from the zero boundary we can safely omit this constraint. A more grounded criticism of this model could be that the Gaussian distribution allows the fair value to be beyond the best bid and ask prices, and those prices are tradeable: if the market consensus of fair value was beyond those prices, market participants would be willing to trade at those prices until they the fair value lies between the bid and ask spread. However, since liquidity might be thin at the first levels, and therefore those prices might not be available for bulk transactions, we consider a soft constraint as more appropriate, even when we use the robust bid-ask spread instead of the best bid and ask spread.

Of course, in many situations this pricing source might not be sufficient for actual applications, since in illiquid markets the bid-ask spreads are large and therefore the pricing source has a large uncertainty. In those situations is precisely where the Kalman filter model plays a part.

Using directly $m_t^{\text{LOB}}$ as an observation in the Kalman filter is problematic, since the feed is available in streaming and therefore, even if we limit the updates to the Kalman filter to the moments in which the LOB gets updates (i.e. arriving of new orders, cancellations, modifications), the pricing observations might jam the Kalman filter estimation, making the model consider that $m_t^{\text{LOB}}$ is a perfect source of pricing information. To see this, let us see the effect of a second observation following an initial one. Recall that the first predict plus update gave:

\bar{m}_{t+\Delta t}^{t+\Delta t} = \bar{m}_{t}^{t} + K_t (p_{t+\Delta t} - \bar{m}_{t}^{t})

(44)

(\sigma_{m,t+\Delta t}^{t+\Delta t})^2 = \frac{((\sigma_{m,t}^t)^2 + \sigma_\epsilon^2 \Delta t) \sigma_\eta^2}{(\sigma_{m,t}^t)^2 + \sigma_\epsilon^2 \Delta t + \sigma_\eta^2}

(45)

with:

K_t = \frac{(\sigma_{m,t}^t)^2 + \sigma_\epsilon^2 \Delta t }{(\sigma_{m,t}^t)^2 + \sigma_\epsilon^2 \Delta t + \sigma_\eta^2}

(46)

If we approach the limit $\Delta t \rightarrow 0$ this simplifies to:

\bar{m}_{t+\Delta t}^{t+\Delta t} = \bar{m}_{t}^{t} + K_t (p_{t+\Delta t} - \bar{m}_{t}^{t})

(47)

(\sigma_{m,t+\Delta t}^{t+\Delta t})^2 = \frac{(\sigma_{m,t}^t)^2 \sigma_\eta^2}{(\sigma_{m,t}^t)^2 + \sigma_\eta^2}

(48)

K_t = \frac{(\sigma_{m,t}^t)^2 }{(\sigma_{m,t}^t)^2 + \sigma_\eta^2}

(49)

Let us apply a second predict plus update steps on top of this, keeping the $\Delta t \rightarrow 0$ limit:

\bar{m}_{t+2\Delta t}^{t+2\Delta t} = \bar{m}_{t}^{t} + K_t (p_{t+\Delta t} - \bar{m}_{t}^{t}) + K_{t+\Delta t} (p_{t+2\Delta t} - \bar{m}_{t+\Delta t}^{t +\Delta t}) \nonumber \\ = \bar{m}_{t}^{t} (1- K_t (1-K_{t+\Delta t}) - K_{t+\Delta t}) + K_t(1-K_{t+\Delta t}) p_{t+\Delta t}+ K_{t+\Delta t} p_{t+2\Delta t}

(50)

(\sigma_{m,t+2\Delta t}^{t+2\Delta t})^2 = \frac{(\sigma_{m,t}^{t})^2 \sigma_\eta^2}{2 (\sigma_{m,t}^{t})^2 + \sigma_\eta^2}

(51)

Since:

K_{t+\Delta t} = \frac{(\sigma_{m,t+\Delta t}^{t+\Delta t})^2}{(\sigma_{m,t+\Delta t}^{t+\Delta t})^2 + \sigma_\eta^2} = \frac{(\sigma_{m,t}^t)^2 }{2 (\sigma_{m,t}^t)^2 + \sigma_\eta^2}

(52)

1-K_{t+\Delta t} = \frac{(\sigma_{m,t}^t)^2 +\sigma_\eta^2}{2 (\sigma_{m,t}^t)^2 + \sigma_\eta^2}

(53)

K_t (1-K_{t+\Delta t}) = \frac{(\sigma_{m,t}^t)^2 }{2(\sigma_{m,t}^t)^2 + \sigma_\eta^2} = K_{t+\Delta t}

(54)

then:

\bar{m}_{t+2\Delta t}^{t+2\Delta t} = (1-2K_{t+\Delta t}) \bar{m}_{t}^{t} + K_{t+\Delta t}(p_{t+\Delta t} + p_{t+2\Delta t}) \nonumber \\ = \frac{\sigma_\eta^2}{2 (\sigma_{m,t}^t)^2 + \sigma_\eta^2} \bar{m}_{t}^{t} + \frac{(\sigma_{m,t}^t)^2 }{2 (\sigma_{m,t}^t)^2 + \sigma_\eta^2} (p_{t+\Delta t} + p_{t+2\Delta t})

(55)

If we continue applying $n$ predict plus update steps in the $\Delta \rightarrow 0$ limit, by using the induction principle we arrive at the following result:

\bar{m}_{t+n\Delta t}^{t+n\Delta t} = \frac{\sigma_\eta^2}{n (\sigma_{m,t}^t)^2 + \sigma_\eta^2} \bar{m}_{t}^{t} + \frac{(\sigma_{m,t}^t)^2 }{n (\sigma_{m,t}^t)^2 + \sigma_\eta^2} \sum_{i=1}^n p_{t+i \Delta t}

(56)

(\sigma_{m,t+n\Delta t}^{t+n\Delta t})^2 = \frac{(\sigma_{m,t}^{t})^2 \sigma_\eta^2}{n (\sigma_{m,t}^{t})^2 + \sigma_\eta^2}

(57)

If we now take the limit $n\rightarrow \infty$ we converge to:

\bar{m}_{t+n\Delta t}^{t+n\Delta t} \rightarrow \frac{1}{n} \sum_{i=1}^n p_{t+i \Delta t}

(58)

(\sigma_{m,t+n\Delta t}^{t+n\Delta t})^2 \rightarrow 0

(59)

Therefore, in the case of LOB updates that don’t significantly provide new pricing information but happen with a high frequency, as in the case in LOBs for liquid instruments, the Kalman filter will converge to the mid-price with zero uncertainty.

A simple way to fix this issue is to estimate the Kalman filter with other pricing information (trades, RfQs, see next section) and combine it as two separate fair value estimations at any time. The best linear combination of two estimators is the one that minimizes the variance:

\hat{m}_t = \alpha m_t^{\text{LOB}} + (1-\alpha) m_t^{\text{kalman}}

(60)

\text{Var}(\hat{m}_t) = \alpha^2 (S_t^{\text{LOB}})^2 + (1-\alpha)^2 (\sigma_{m,t}^t)^2

(61)

which is achieved for:

\hat{\alpha} = \frac{(\sigma_{m,t}^t)^2}{(S_t^{\text{LOB}})^2 + (\sigma_{m,t}^t)^2}

(62)

Notice that we have considered them independent estimators, otherwise there would be an extra term accounting for the correlation. Therefore, the best linear estimator is:

\hat{m}_t = \frac{(\sigma_{m,t}^t)^2}{(S_t^{\text{LOB}})^2 + (\sigma_{m,t}^t)^2} m_t^{\text{LOB}} + \frac{(S_t^{\text{LOB}})^2 }{(S_t^{\text{LOB}})^2 + (\sigma_{m,t}^t)^2} m_t^{\text{kalman}}

(63)

This makes intuitive sense: the smaller the relative error of one estimator compared to the other, the more weight the combined estimate assigns to it. Importantly, the resulting variance is lower than that of either individual estimator.

\text{Var}(\hat{m}_t) = \frac{(S_t^{\text{LOB}})^2(\sigma_{m,t}^t)^2}{(S_t^{\text{LOB}})^2 + (\sigma_{m,t}^t)^2}

(64)

To check it, simply take the ratio of the final variance with any of the variances of the independent predictors, for example:

\frac{\text{Var}(\hat{m}_t)}{(S_t^{\text{LOB}})^2} = \frac{(\sigma_{m,t}^t)^2}{(S_t^{\text{LOB}})^2 + (\sigma_{m,t}^t)^2} \leq 1

(65)

the equality happening when $S_t^{\text{LOB}} = 0$ i.e. it was already a perfect predictor and, therefore, there is no way to improve the prediction.

Notice that, since the combined fair value estimator is reconstructed independently at each point in time —without relying on past estimates—it is not affected by the high-frequency sampling issue observed when using the LOB mid-price as an observation.

Trades¶

Trades happening in the LOB are a valuable source of pricing information, since they correspond to real transaction prices and not only interests to trade as limit orders. When a trade happens, the exchange reports publicly the time, the size and the price, but not the parties or the orders involved. The latter is particularly relevant since a relevant pricing information is the side (buy or sell) of the order that was aggressive, meaning the one that consumed the liquidity in the order book. As we discussed in Market microstructure, this can typically be a market order or a limit order at a price that is equal or better that prices available in the opposite side. Reverse engineering the side from a trade is an inference problem, and requires a model. A simple one widely used is the so-called tick-rule model which consists on comparing the price of the trade with the mid-price available in the order book just before the trade:

If the price of the trade is below the mid-price, then we assume it was an aggressive sell order, since the trade price has to be an average weighted by size of the limit orders available. Therefore, it makes sense to assume the trade consumed liquidity in the side closer to the trade price using the mid-price as a reference.
If the price of the trade is above the mid-price, we assume it was an aggressive buy order

This model is not perfect, however. It does not account for hidden liquidity that might exist at more favorable prices than those displayed, which would alter the reference mid-price and, consequently, the trade classification logic. Moreover, it assumes a sequential processing of orders, whereas in practice, multiple orders may arrive simultaneously. In such cases, the exchange’s internal matching engine determines execution priority and order pairing through mechanisms that are not observable externally, meaning the apparent sequence of trades and quote updates in public data may not reflect the true matching process. This lack of transparency can lead to misclassification when applying the tick rule or similar models.

Once we have the relevant pricing information for the trade, namely time, side, size and or course price, it can be used to update the current fair value estimation of the financial instrument. The size information is useful to include some measure of information content in the order, as discussed in the simple pricing model section when introducing the Sinclair model: intuitively, a very small size trade should not be as relevant as a large size trade when updating our fair value estimation. Sinclair proposes to model a trade observation as a Gaussian random variable ${\mathcal N}(P_t^{\text{trade}}, \sigma^2(v))$ , where $P_t^{\text{trade}}$ is the observed trade price at time $t$ , and $\sigma(v)$ is given by:

\sigma (v)= \sigma_p \left(\frac{v_\text{max}}{v}-1\right)^+

(66)

where $\sigma_p$ is a constant to be estimated, and $v_{\text{max}}$ is an exogenous parameters that provides a typical scale for which trades are considered informative. It can be given by business prior knowledge or estimated from the statistical distribution of trade sizes. Notice that this error has the desirable properties of becoming zero at the scale $v_{\text{max}}$ and above, i.e. trades above this scale are considered maximally informative and the fair value is instantaneously updated to this value. It also becomes infinitely large as the volume tends to zero, which makes the Kalman filter model to essentially ignore those observations.

Sinclair’s model is not the only way to introduce this behavior into the model. Other choices of $\sigma(v)$ are also valid. For example, the model:

\sigma (v)= \sigma_0 \frac{\exp(-\frac{v}{v_0})}{1+ \exp(-\frac{v}{v_0})}

(67)

might be more realistic in the sense that volume adjusts the degree of information but it never completely ignores small trades, for which the error tends to $\sigma_0/2$ . Another alternative that does not completely saturate for trades of large size is the following:

\sigma (v)= \sigma_{\min} + (\sigma_0 - \sigma_{\min}) e^{-\frac{v}{v_0}}

(68)

which tends to $\sigma_{\min}$ as $v \rightarrow \infty$ .

So far we have not used the side information inferred from the tick-rule model. There are different ways this information can be factored into the pricing. In markets that are quite unbalanced, the observation of a trade in the opposite side where the market is prevalently trading might be considered more informative. This means potentially adjusting the error function with the side information. Another alternative is directly building separate Kalman filters for buy and sell information, which are then combined in a final fair value estimation using the model discussed in the previous section for optimally combining two predictors, although in this case neglecting correlation between bid and ask estimations might not be a solid modelling choice. We leave as an exercise to the reader to derive the optimal linear predictor with correlation.

RfQ traded¶

We have discussed the Request for Quote (RfQ) protocol in the chapter on Market Microstructure. In terms of pricing information, the main difference with LOBs is the asymmetry in information between the different participants in the process: dealers and clients. Let us focus on the case of negotiation via Multi-Dealer-to-Client platforms, which has the richer casuistic, from the point of view of the dealer, who is typically the one actively trying to calculate the fair value of the instruments. The pricing information that the dealer receives is the following:

Platform composites: the platform does not provide individual streaming bid and ask prices from other dealers, this is an information only available to clients. However, the platform typically offers a composite price, and index calculated aggregating the individual feeds from the dealers and applying proprietary rules. A composite typically consists on a stream of bid and ask prices that roughly represent average indicative or sometimes executable streamed prices from the dealers active in the platform. For example:
- Bloomberg’s CBBT for bonds (Composite Bloomberg Bond Trader), using executable bid and ask streams from dealers
- Tradeweb’s TW Composite, using both indicative and executable prices
In some cases, the platforms offer pricing feeds that already incorporate multiple pricing sources. For instance, Bloomberg offers BGN (Bloomberg Generic Price) for multiple instruments, which incorporates several pricing sources in the calculation, from dealer’s indicative prices, interdealer brokers, traded data from exchanges (if available), etc. Marketaxess offers CP+ (Composite plus) for bonds, using indicative and executable streams from dealers but adding also information from trades within its platform as well as reported data from trade repositories like Trax and TRACE (see below)
Traded price: when the dealer trades the RfQ (or ends tied), of course she has the information from the traded price.
Cover price: When a dealer wins the RFQ and executes the trade, the platform discloses the second-best price quoted by a competitor, called cover price.
Non pricing information: if the dealer quoted the second-best price, the platform informs them of this outcome, although it does not disclose the actual traded price. The dealer can, however, infer that the executed price must have been better than their own quote. Each participating dealer is also notified whether the client executed with another dealer or decided not to trade at all.

Apart from the pricing information gained via the trading platform, in order to improve transparency and price formation, there are private and public initiatives to share pricing information post-trade, particularly in the bond markets:

TRACE (United States): Operated by FINRA since 2002, the Trade Reporting and Compliance Engine (TRACE) requires broker-dealers to report transactions in corporate, agency, and certain securitized and Treasury securities. Reports must be submitted within minutes of execution, and a portion of the data is made public, providing prices, volumes, and timestamps. TRACE has become the main benchmark for post-trade transparency in U.S. fixed income and is widely used for valuation, best-execution monitoring, and market analysis.
MiFID II / APAs (European Union): Under the MiFID II and MiFIR framework, investment firms must publish details of OTC bond and derivative transactions through Approved Publication Arrangements (APAs). These entities disseminate standardized post-trade data on price, size, and time of execution, subject to deferrals for large or illiquid trades. Although the system has enhanced transparency across European markets, the coexistence of multiple APAs has led to fragmentation and the absence of a unified consolidated tape.
TRAX and Axess All (Europe): TRAX, originally created by ICMA and now operated by MarketAxess, serves as a trade matching and regulatory reporting system under MiFID II, EMIR, and SFTR. Building on this infrastructure, MarketAxess launched Axess All, a private transparency service that aggregates and anonymizes post-trade bond data to produce daily composite levels for European government and corporate bonds. While not an official regulatory tape, it provides a valuable commercial reference for post-trade pricing and market trends.
Other Regional Systems: Comparable frameworks exist in other jurisdictions. In the United Kingdom, the FCA maintains the MiFID-style APA regime and is developing a consolidated tape for fixed income. In Canada, the IIROC operates a corporate bond transparency service with public post-trade data similar to TRACE. In Asia, initiatives remain limited, though regulators in Japan and Singapore are exploring TRACE-like models to improve fixed-income transparency.

Composites¶

Modelling composites is similar to modelling mid-prices from order books, since they share in common the structure of information in terms of a continuous feed of bid and ask prices, only that for composites are indicative prices. Again, given that the frequent updates in the feed might not carry new pricing information, it makes sense to model it as an independent fair value source, that can be then aggregated with other estimations, like those using Kalman filters on trades and other information.

The model is therefore:

m_t^{\text{comp}} \sim N(P_{\text{mid},t}^\text{comp}, (S_t^{\text{comp}})^2 )

(69)

with:

P_{\text{mid},t}^\text{comp} = \frac{1}{2}(a_t^\text{comp} +b_t^\text{comp})

(70)

S_{\text{mid},t}^\text{comp} = \frac{1}{2}(a_t^\text{comp} -b_t^\text{comp})

(71)

where $b_t^\text{comp}$ , $a_t^\text{comp}$ are, respectively, the bid and ask composite prices published by the platform.

RfQs¶

The information from RfQs depends, as discussed above, on the final status of the dealer in the process. If the dealer wins the RfQ, the observation corresponds to a trade, and the modelization is overall similar to the one discussed in the context of trades in the LOB. There is, though, one key difference with the case of LOBs, in that the cover price is also informed to the dealer.

Intuitively, the closer the cover and the trade price, the more confidence we might put on the trade price as a pricing source, since we have the agreement from a second dealer quoting a similar level. Or put it in a different way, if the dealer wins the trade at a price much further than the cover price, it might imply a clear mispricing that needs to be adjusted, not a reliable pricing source. One way to incorporate this into the model is to make the observation error a function of the distance to the cover:

\sigma (v, d_t^{\text{cover}}) = \sigma(v) g(d_t^{\text{cover}})

(72)

where $d_t^{\text{cover}} = |P_t^\text{trade} - P_t^\text{cover}|$ is the distance to the cover, and $g(\cdot)$ a modulating function with a minimum at zero.

A second case is the one in which the dealer misses the RfQ, but the price was the second best quoted. The fact that among the number of dealers quoting competitively the price quoted was the second best carries some significant information that can be used to adjust the fair value. One way to do this is to fit a probabilistic model that predicts the distance to cover based on features $f_i$ available in the negotiation, for example, a simple linear regression model on $d_t^\text{cover}$ :

d_t^\text{cover} = \sum_i w_i f_i + \epsilon

(73)

where $w_i$ are the regression weights and $\epsilon \sim N(0, \sigma_d^2)$ . Alternatively, if we want to explicitly model that the distance to cover cannot be negative, we can use a log transformation, although then we will have to use a non-linear Kalman filter to account for this observation. Assuming the simple linear regression model, we can add the observation to the Kalman filter:

P_t^{\text{inf, cover}} \sim N(P_t^\text{cover} + \text{side} \cdot E[d_t^\text{cover}|\{f_i\}], \sigma_d^2)

(74)

where $\text{side} = \pm 1$ depending on the side of the RfQ (buy or sell).

A third case happens when the client trades, the dealer misses the RfQ but her quote was not even the second best (cover). The information in this setup is much weaker, in the form of a bound to the traded price (the quoted price). Although, in principle, a probabilistic inference on the traded price could be performed using this information, the potential model risk coming from the plethora of model assumptions typically outweighs the information gains, so we will not dive deeper here.

Hit & Miss¶

Another source of pricing information comes from analyzing patterns of trading information from the dealer. In particular, it is useful to analyze the hit & miss, i.e. the ratio between won RfQs over the total traded by the client with any dealer (i.e. excluding those where the client did not trade at all), over a certain window of time. Intuitively, a dealer that has an abnormally high or low hit & miss might be using an incorrect fair value estimation from the quoting. Of course, the tricky question here is to assess what are the normal levels of hit & miss, which have to take into account the influence of factors like, for example, inventory levels in the spreads quoted: if a dealer has relatively high inventory holdings, it is likely that she will skew quotes to reduce inventory risk, hence the hit & miss will be higher if there are more quotes overall in the direction of risk reduction.

If the dealer has estimated a model that estimates the probability of winning the RfQs, and the model is well calibrated, it can be evaluated for the same windows of RfQs as the empirical hit & miss (H&M). If we compute the latter at time $t$ using the last $n$ RfQs, it is given by:

\text{H\&M}_t = \frac{1}{n} \sum_{i=1}^n 1_{\text{win}_i}

(75)

where $\text{win}_i$ is an abbreviation of the condition $\text{status}(i) = \text{win}$ . For a well calibrated model, such empirical hit & miss should be close to the expected by the model:

E[\text{H\&M}_t] = \frac{1}{n} \sum_{i=1}^n P(\text{win}_i|\mathcal{F}_{t_i})

(76)

Here, $\mathcal{F}_{t_i}$ are the filtrations at the time $t_i$ of request of the i-th RfQ, which include the spreads $\delta_i$ quoted by the dealer:

\delta_i = \text{side} (P_{i} - m_{t_i})

(77)

with $P_{i}$ the price quoted for the i-th RfQ and $m_{t_i}$ the estimation of the fair value at time $t_i$ .

If we consider each RfQ independent of each other, the variance of the hit & miss provides us with a scale of natural variability of our estimation versus the empirical hit & miss:

Var[\text{H\&M}] = \frac{1}{n} \sum_{i=1}^n P(\text{win}_i|\mathcal{F}_{t_i}) (1-P(\text{win}_i|\mathcal{F}_{t_i}))

(78)

Intuitively, then, if we get a persistent deviation between $\text{H\&M}_t$ and $E[\text{H\&M}_{t}]$ that cannot be explained by the expected variability from the randomness of the RfQ process, this could be attributed to a potential bias in the estimation of $m_t$ , which becomes a further pricing source for our fair value estimation model. Further modelization is required, though, to inject this information into the Kalman filter, which goes beyond the scope of this book. For us, it suffices to point out how we can potentially convert hit & miss deviations from the target into pricing information.

Correlated instruments¶

As we discussed above, if a set of instruments exhibit historical price correlations and we have reasons to believe those are structural correlations, i.e. they will continue existing in the present, we can do a joint estimation of the fair values using a multivariate Kalman filter. This way, information about the price of one instrument can be used to improve the estimation of the others. The pricing sources for these instruments can be any of those discussed previously.

There are some caveats though to take into account when using this source of pricing information:

The first one is that correlations are inferred parameters and therefore themselves subjected to a certain amount of model risk. The Kalman filter model uses point estimations of the correlations for the updates, and therefore ignores the potential uncertainty associated to the estimation when updating the price. This issue can actually become quite relevant in certain situations, for example when estimating the price of illiquid instruments using more liquid ones. The liquid instrument will have a smaller estimation error than the illiquid one. If point correlation between the instruments is high, it can have a over-weighted influence on the estimation of the price of the illiquid instrument, overriding the intrinsic price dynamics of the illiquid instrument. A Bayesian treatment of correlation can overcome these issues, but then the Kalman filter estimation algorithm can no longer be used, requiring a numerical computation of posterior probabilities for the predict and update steps.
The second one arises when estimating the correlations to be used in the Kalman filter model. The typical estimation of correlations used in financial models uses synchronous pricing data, for instance end of day prices. However, rigorously speaking, the correlations used in the Kalman filter model for fair value estimation are between latent fair values, not price observations. A consistent estimation therefore requires us to compute them endogenously, for example using Maximum Likelihood Estimation or Expectation Maximization over the historical pricing data. The difficulty lies when defining estimators (e.g. MLE ones) for the covariance matrix of asynchronous data. The standard estimator is not valid and we must resort to other less standard estimators. This topic is discussed extensively in Guo et al., 2017, where they suggest using Fourier methods, among others. Again, as in the previous case, a Bayesian modelling approach for the covariance matrix provides the natural way to circumvent these issues, at the price of adding numerical complexity to the computation.

Fundamental models for fair value estimation¶

Fundamental models estimate the fair value of a financial instrument by analysing the value of their future cash-flows. Recall from our introductory chapter on financial markets Financial Markets, that financial instruments are essentially contracts that promise to pay back funds to the investor that purchases it under conditions specified in the contract.

The present value theory of fair value¶

Even the most simple financial instrument, a promise to pay back a deterministic amount of money in a future fixed date, requires some theoretical hypothesis to estimate its fair value. The basic idea is that a unit of currency received today is worth more than the same unit received in the future, because it can be invested in the interim. Consider a risk-free deposit that pays a deterministic interest rate $r$ . An amount of one unit invested today grows to $(1+r)^T$ units after $T$ periods, as far as interest rate payments are reinvested at the same rate (compounded interest). Conversely, receiving one unit in $t$ periods is economically equivalent to receiving $1/(1+r)^T$ units today. This opportunity cost argument implies that any future cash flow must be discounted by the factor $(1+r)^T$ to make it comparable with cash today. Such economic consideration is referred as the time value of money.

Formally, for an investment that delivers a future cash flow $C_T$ at time $T$ , the present value $PV$ satisfies

PV = \frac{C_T}{(1+r)^T}

(79)

where we have assumed that an interest payment of $r C_T$ is paid each unit of time. The factor $1/(1+r)^T$ is called the discount factor, since it is used to discount future cash-flows. Present values becomes our fair value estimation within this framework.

A typical hypothesis that provides useful mathematical simplifications is that of continuously accrued interest rates. This is also a good approximation for real situations where interests are paid daily, for instance in money market funds. Consider an account that pays interest each time period of size $\Delta$ . The interest paid is $r \Delta$ over a unitary notional. If we reinvest the interests, at time $t$ we have accumulated $(1+ r \Delta)^{T/\Delta}$ . If we now take the limit $\Delta \rightarrow 0$ :

\lim_{\Delta \rightarrow 0} (1+ r \Delta)^{T/\Delta} = \lim_{\Delta \rightarrow 0} e^{\frac{T}{\Delta} \log (1 + r \Delta)} = e^{rT}

(80)

This gives us the expression of the discount factor for continuously paying interest rates, given by the inverse $e^{-rT}$ . Under this approximation, the present value of our simple financial instrument becomes:

PV = e^{-rT} C_T

(81)

Exponential functions provide a lot of mathematical simplifications, hence the usefulness of this limit, particularly when applied to the computation of fair value for more complex financial instruments.

If we also include the price paid for the financial instrument, let us say in this case we pay initially $C_0$ , we define net present value (NPV) as:

NPV = e^{-rT} C_T - C_0

(82)

Notice that, therefore, a rational investor will only be willing to invest in this financial instrument if $C_0 \leq e^{-rT} C_T$ , otherwise its NPV would be negative.

We can now generalize this expression for a financial instrument that pays a stream of deterministic cash-flows $C_{t_i}$ at times $t_i, i = 1, ..., N$ . This is the case for example of standard government bonds issued by most countries. The fair value given by present value becomes then:

PV = \sum_{i=1}^N e^{-r t_i} C_{t_i}

(83)

What happens if these future cash-flows are contingent to information not known at present day? For instance, we can have bonds that pay floating interest rates depending on reference rates that don’t get fixed until a future date. Shares pay dividends contingent to the financial results of the corporation that issues the shares. And derivatives are financial instruments whose value depends on the future price of an underlying instrument, hence the name “derivative”. A simple naive extension of the theory of present value would consider these future cash-flows as functions of random variables, replacing its future value by an expectation of future value:

PV_0 = {\mathbb E}\left[\sum_{i=1}^N e^{-r t_i} C_{t_i}|F_0\right] \equiv {\mathbb E}_0 \left[\sum_{i=1}^N e^{-r t_i} C_{t_i}\right]

(84)

where we have conditioned the expectation to the information available at the time of estimation, the filtration $F_0$ . If discount factors are not stochastic themselves (an argument we will revisit in the last section of this chapter), this becomes:

PV_0 = \sum_{i=1}^N e^{-r t_i} {\mathbb E}_0 \left[C_{t_i}\right]

(85)

The issue with this approach is that, once future cash-flows become uncertain, their expected value is a point estimation of their possible range of values that neglects the rest of potential scenarios that can happen. Recall from our discussion in chapter Bayesian Modelling, that choosing to represent a random variable by their expected value in terms of decision theory (and fair value estimation is in the end linked to the decision to buy or sell a financial instrument) makes sense when the investor penalizes errors in the estimation using a square loss function. The behavior of real rational investors, though, shows a more asymmetric loss function, in which generally potential extra gains are valued less than the equivalent potential losses. This kind of behavior is better capture by using utility functions, as we will discuss in the next section.

The utility indifference theory of fair value estimation¶

To ground the discussion in another example, let us consider a specific case of a future uncertain cash-flow whose value depends on the price of another instrument at the time of payment, $S_T$ , for example a stock. The cash-flow is therefore $C_T = f(S_T)$ . Notice that this is a specific case of a derivative’s contract. The function $f(S_T)$ is called the pay-off of the derivative, the instrument whose price is $S_T$ is called the underlying of the derivative, and the time $T$ is the expiry date of the derivative. Apart from the value $S_T$ , it can also depend on other parameters that are deterministic. For instance, for a forward contract we have $C_T = (S_T-F)$ where $F$ is called the forward price; and for an European call option we have $C_T = (S_T-K)^+$ , where $K$ is called the strike of the option. An european put options has a payoff $C_T = (K-S_T)^+$ . There are also American options where the option can be exercised before the expiry date, which becomes itself a random variable.

This derivative pays in the future a quantity that is contingent to the future value of the underlying, whose value is known today but is uncertain in the future. To get an estimation, we need to use probability theory to put some bounds to our uncertainty, so we characterize $S_T$ by a random distribution function $g(S_T)$ . As we saw in chapter Stochastic Calculus, a popular model that allows us to compute such future distribution is a random walk model or a geometric random walk, the latter being a natural choice for prices that cannot be negative. In those cases the future distribution can be computed, being a normal distribution in the first case, and a log-normal distribution in the second, e.g. in the case of stocks. Although for short expiries the random walk can also be a good model for stocks. These models allow us to get sometimes closed-form solutions, but more realistic models that capture better empirical distributions of prices can be used.

As mentioned in the previous section, we cannot just value this cash-flow using the expected value of the pay-off, since it would ignore the risk-profile of the investor. As discussed in more detail in chapter Stochastic Optimal Control, utility functions provide a mathematical formalism that allows us to capture realistic risk behaviors. Utility provides a description of the value that the cash-flows derived from the financial instrument have for the investor. Typical utility functions show the notion of marginally decreasing utility for increasingly larger cash-flows. In situations where cash-flows are random variables, such behavior models investors that are risk averse, meaning that they need to be compensated increasingly more to take on extra risk.

To apply the utility function framework to the problem of fair pricing, we need to compute expected utilities to characterize the value that the investor places on the contract. Using an exponential utility function for simplicity, this means:

\mathbb{E}_t[U] = 1- \mathbb{E}_t[e^{-\gamma_i \left(e^{-r(T-t)}f(S_T)\right)}]

(86)

where $\gamma_i$ is the risk aversion coefficient of the investor. We have discounted the payoff at $T$ by the discount factor $e^{-r(T-t)}$ in order to consider the time value of money, as discussed in the previous section, although now we consider for generality a initial time $t$ .

The fair value in this formalism is the so-called premium of the derivative, denoted $C_t$ , that the investors is willing to pay (or be paid, depending on the pay-off function) to enter into the derivative’s contract today. This changes the utility calculation, since it needs to take into account the premium:

\mathbb{E}_t[U] = 1- \mathbb{E}_t[e^{-\gamma_i \left(e^{-r(T-t)}f(S_T) -C_t\right)}]

(87)

When modelling rational risk-averse agents with utility functions, we model their decisions as those that maximize the expected utility. However, in this case this cannot be used to compute the premium, since naturally the premium that maximizes utility is $C_t = -\infty$ !. The problem is, of course, that it does not take into account the utility maximization of the dealer selling the derivative, who would not enter into the contract at this premium. The same framework could be used to model the dealer’s payoff, which is the reverse from the investor, albeit with a different dealer’s risk aversion, $\gamma_d$ :

\mathbb{E}_t[U] = 1- \mathbb{E}_t[e^{-\gamma_d \left(C_t - e^{-r(T-t)}f(S_T) \right)}]

(88)

but even introducing the dealer’s utility function, how could we compute the value of the premium?

For the answer, we need first to frame the problem in other terms: what is the maximum premium that the investor would be willing to pay to enter into the contract? Since the alternative to not entering into the contract implies a zero payoff with total certainty, whose expected utility in this framework is $0\$$ , we can argue that the investor would be willing to buy the derivative as far as the premium makes him/her better off, i.e. $\mathbb{E}_t[U] > 0$ . For a value of the premium such that $\mathbb{E}_t[U] = 0$ , the investor is indifferent to buy or not buy. This value of the premium calculated this way is called the certain equivalence price of the investor, since it is the guaranteed payoff that the client would accept now rather than taking the chance of a higher, but uncertain, payoff. Of course the same computation could be done for the dealer, obtaining a different certain equivalence price. An agreement will only happen if the maximum premium that the investor is willing to pay is above the minimum premium that the dealer is willing to receive.

Let us first see the problem from the dealer’s point of view. In real situations, it is typically the investor who comes to the dealer and request a price for the derivative. The minimum premium that the dealer would be willing to accept to provide the contract as a reference for derivatives pricing, i.e. the certain equivalence price of the dealer, is the one that solves:

1- \mathbb{E}_0[e^{-\gamma_d \left(C_t - e^{-r(T-t)}f(S_T) \right)}] = 0

(89)

We can now obtain a general expression for the premium:

C_t^d = \frac{1}{\gamma_d} \log \mathbb{E}_0[e^{\gamma_d \left(e^{-r(T-t)}f(S_T)\right)}] = \frac{1}{\gamma_d} \log \int dS_T g(S_T) e^{\gamma_d \left(e^{-r(T-t)}f(S_T)\right)}

(90)

For those dealers that have zero risk aversion, i.e. they are risk neutral, by taking the limit $\gamma_d \rightarrow 0$ we get:

C_{t}(0) = \mathbb{E}_0[ e^{-r(T-t)}f(S_T)] = \int dS_T g(S_T) e^{-r(T-t)}f(S_T)

(91)

And for small, but positive risk aversion:

C_t^d = C_{t}(0) + \frac{\gamma_d}{2}\int dS_T g(S_T) e^{-2r(T-t)}f^2(S_T) + O(\gamma_d^2)

(92)

We can derive the same expression for an investor; we get:

C_t^i = -\frac{1}{\gamma_i} \log \mathbb{E}_0[e^{-\gamma_i \left(e^{-r(T-t)}f(S_T)\right)}] = -\frac{1}{\gamma_i} \log \int dS_T g(S_T) e^{-\gamma_i \left(e^{-r(T-t)}f(S_T)\right)}

(93)

If the investor has a small but positive risk aversion:

C_t^i = C_{t}(0)- \frac{\gamma_i}{2}\int dS_T g(S_T) e^{-2r(T-t)}f^2(S_T) + O(\gamma_i^2)

(94)

We see immediately that $C_t^i \leq C_t^d$ , so there is only agreement if both investor and dealer are risk neutral, or at least one is sufficiently risk prone. Dealers are expected to be naturally risk averse, and some clients (speculative ones) will be risk prone, but not all. This would imply that derivatives transactions happen less often that what we empirically observe. What’s going on? There are two caveats to this discussion

Some clients might use derivatives to hedge their risks, i.e. they have an existing exposure into a risky instrument, and are willing to pay a premium to reduce the risk. This exposure needs to be introduced into the pricing framework to derive the correct certain equivalence price.
Dealers don’t simply passively take the opposite risk from the derivative contract. They actively hedge their exposure, changing as well the final payoff and therefore the price they are willing to accept.

Let us see a few examples to illustrate this.

Example: pricing of a simple contingent claim¶

A contingent claim is a contract that pays off only under the realization of an uncertain event. Many derivatives contracts like options are contingent claims. The most simple contingent claim pays 1$ under the realization of a specific uncertain event, and zero in all other cases. These contingent claims are called Arrow-Debreu securities, and have a theoretical interest since we could in principle decompose any contingent claim as a linear combination of these securities. Therefore, if we know the prices (premiums) of Arrow-Debreu securities, we could price any contingent claim. We say that in this case we have a complete market, where we can trade instruments linked to any future state of the market.

For our purposes, though, we just want to discuss a simple example of reservation prices. Let us consider a contingent claim in which the dealer pays the investor 1$ if we get heads when tossing a fair coin in the present. In our framework, the underlying now is the side of the coin, heads or tails, with probabilities $p_H = p_T = 1/2$ . We also make $T = t$ since we toss the coin in the present. The value of the reservation price for the investor reads then:

C_t^i = -\frac{1}{\gamma_i} \log \left(\frac{1}{2}e^{-\gamma_i} + \frac{1}{2} \right) = \frac{1}{2} - \frac{1}{\gamma_i}\log \cosh \left(\frac{\gamma_i}{2}\right)

(95)

For a risk-neutral investor, by making $\gamma_i \rightarrow 0$ , we get simply $C_t^i = 1/2$ , which makes sense: the investor is willing to pay 0.5$ to make the game fair. Or in other terms, to make the expected value of the game zero. A fully risk averse investor for whom $\gamma_i \rightarrow \infty$ has $C_t^i = 0$ , i.e. only is willing to buy the contract when there is guarantee of no losses under any scenario. In the middle, the premium lies between those two values: the investor will be willing to pay more than 0$ to trade, as far as the payoff is skewed in its favor.

Example: Forward on a non-dividend paying stock¶

Let us now focus on a more realistic case and find the maximum premium that a risk averse investor would be willing to pay for a forward contract on a non-dividend paying stock ^[1]. The buyer of a forward has the obligation to buy a stock at the expire $T$ at a pre-agreed price $F$ . Therefore, the payoff function reads:

f(S_T) = S_T - F

(96)

The maximum premium that the investor is willing to pay reads then:

C_t^i = - \frac{1}{\gamma_i} \log \int dS_T g(S_T) e^{-\gamma_i e^{-r(T-t)}(S_T-F)}

(97)

Let us assume a Brownian motion model for the (non-dividend paying) stock:

dS_t = \mu dt + \sigma dW_t

(98)

where $\mu$ is the drift (expected revaluation), $\sigma$ is the volatility and $W_t$ a Wiener process. Integrating this SDE we have $S_T \sim {\mathcal N}(S_t + \mu (T-t), \sigma^2 (T-t))$ . Notice that it is only a realistic model as far as $\sigma^2 (T-t) \gg S_t + \mu (T-t)$ , since otherwise the stock price could become negative in a relevant proportion of scenarios, which is not financially possible. The advantage of this model is that allows us to compute a closed-form for the premium, since the integral becomes the expected value of a log-normal random variable $Z = e^{-\gamma e^{-r (T-t)} S_T}$ , which is known [^2]:

C_t^i = - e^{-r(T-t)} F - \frac{1}{\gamma_i} \log \mathbb{E}_t\left[ e^{-\gamma_i e^{-r(T-t)}S_T}\right] \nonumber \\ = - e^{-r(T-t)} (F - S_t - \mu (T-t)) - \frac{\gamma_i}{2} e^{-2r(T-t)} \sigma^2 (T-t)

(99)

Forward contracts are usually priced choosing the forward price $F$ that makes the premium zero, $C_t^i = 0$ so there is not cash transacted at the time of inception of the contract. In this case:

F^i = S_t + \mu (T-t) - \frac{\gamma_i}{2} e^{-r(T-t)} \sigma^2 (T-t)

(100)

Unless the client has strong expectations of future returns of the stock (characterized by its drift $\mu (T-t)$ ), a risk averse client will only accept a forward price $F$ smaller than the current spot price, given the risk penalty in the right side of the equation, which is negative.

As mentioned before, though, many clients buy forwards as a way to hedge their risk with respect to the underlying. For example, let us assume the client knows today that he will need to buy the stock at a certain time $T$ , and not before. This means the client is exposed to market risk, which could neutralize by entering into a forward contract. In this case, the client has to compare the utility of two different scenarios: in the first one, the client buys the stock at time $T$ at the market price available, $S_T$ . In the second one, he enters into a forward contract at time $t$ that expires at time $T$ . The forward price is agreed so that the client does not need to pay a premium at time $t$ . The expected utility of the first scenario is:

\mathbb{E}_t(U_1) = 1 - \mathbb{E}_t\left[ e^{-\gamma_i e^{-r (T-t)} (-S_T)}\right] = 1 - e^{\gamma_i \left(e^{-r (T-t)}S_t + e^{-r (T-t)} \mu (T-t)\right) + \frac{\gamma_i^2}{2} e^{-2 r (T-t)} \sigma^2 (T-t)}

(101)

The expected utility of the second scenarios is now:

\mathbb{E}_t(U_2) = 1 - \mathbb{E}_t\left[ e^{-\gamma_i e^{-r (T-t)} (-S_T + (S_T - F))}\right] = 1 - e^{\gamma_i e^{-r (T-t)} F}

(102)

The certain equivalence forward price for the client can be obtained by equating both expressions:

F^i = S_t + \mu (T-t) + \frac{\gamma_i}{2} e^{-r(T-t)} \sigma^2 (T-t)

(103)

As we would expect, now the risk-averse client is willing to pay a higher forward price to remove the exposure to market risk.

Example: European Call Options¶

The calculation for a forward was relatively tractable since the payoff of the derivative was linear on the stock. What about non-linear payoffs? This is the case for instance of an European Call option on a stock, which has the payoff:

f(S_T) = (S_T - K)^+

(104)

where $K$ is the strike of the option. The risk-averse investor will be willing to pay the dealer a maximum premium of:

C_t^i = -\frac{1}{\gamma_i} \log \int dS_T g(S_T) e^{-\gamma_i e^{-r(T-t)}(S_T-K)^+}

(105)

In the limit of a risk-neutral investor, the premium is:

C_{t}^i(0) = \int dS_T g(S_T) e^{-r(T-t)} (S_T-K)^+

(106)

Let us consider again the case of a non-dividend paying stock, but in this case we will model it a as Geometrical Brownian Motion to ensure negative prices are not allowed:

d S_t = \mu S_t dt + \sigma S_t d W_t

(107)

Integrating this SDE up to T:

S_T = S_t e^{(\mu - \frac{\sigma^2}{2})(T-t) + \sigma \sqrt{T-t} Z}

(108)

where $Z \sim N(0,1)$ . We can further decompose the expression of the premium as:

C_{t}^i(0) = \int_0^\infty dS_T g(S_T) e^{-r(T-t)} (S_T-K)^+ = \int_K^\infty dS_T g(S_T) e^{-r(T-t)} (S_T - K)

(109)

= e^{-r(T-t)} \left(\int_K^\infty dS_T g(S_T) S_T - K \int_K^\infty dS_T g(S_T)\right)

(110)

Let us now change variables from $S_T$ to $Z$ in the integration. The first integral becomes:

\int_K^\infty dS_T g(S_T) S_T = S_t \int_{\frac{1}{\sigma \sqrt{T-t}} \left(\log \frac{K}{S_t} - (\mu - \frac{\sigma^2}{2})(T-t)\right)}^\infty \frac{dZ}{\sqrt{2\pi}} e^{-\frac{Z^2}{2}} e^{(\mu - \frac{\sigma^2}{2})(T-t) + \sigma \sqrt{T-t} Z}

(111)

= S_t e^{(\mu - \frac{\sigma^2}{2})(T-t)} \int_{-d_2(\mu)}^\infty \frac{dZ}{\sqrt{2\pi}} e^{-\frac{Z^2}{2}} e^{ \sigma \sqrt{T-t} Z} = S_t e^{\mu (T-t)} \int_{-d_2(\mu)-\sigma\sqrt{T-t}}^\infty \frac{dZ}{\sqrt{2\pi}} e^{-\frac{(Z-\sigma \sqrt{T-t})^2}{2}}

(112)

= S_t e^{\mu (T-t)} \left(1-N(-d_1(\mu))\right)

(113)

where we have defined the functions:

d_1(\mu) = \frac{1}{\sigma \sqrt{T-t}} \left(\log \frac{S_t}{K} + (\mu + \frac{\sigma^2}{2})(T-t)\right)

(114)

d_2(\mu) = \frac{1}{\sigma \sqrt{T-t}} \left(\log \frac{S_t}{K} + (\mu - \frac{\sigma^2}{2})(T-t)\right)

(115)

and $N(x)$ is the cumulative distribution function of the standard normal distribution. The second integral is then:

\int_K^\infty dS_T g(S_T) = \int_{-d_2(\mu)}^\infty \frac{1}{\sqrt{2 \pi}} e^{-\frac{Z^2}{2}} = 1-N(-d_2(\mu))

(116)

Wrapping up all we get:

C_{t}^i(0)=S_t e^{(\mu-r)(T-t)}N(d_1(\mu))-K e^{-r(T-t)} N(d_2(\mu))

(117)

where we have used the property $1-N(-x) = N(x)$ . Let us analyze this formula, which recall is the maximum premium that a investor is willing to pay for the call option. It depends on the following parameters:

Current stock price $S_t$ : as the current stock price increases, the premium increases. This is because the call option gives the right to buy the stock at the strike price, so a higher current stock price makes this option more valuable.
Strike price $K$ : as the strike price increases, the premium decreases. A higher strike price makes the option less valuable since it would cost more to exercise the option.
Time to maturity $T-t$ : as the time to maturity increases, the premium increases. More time to maturity means more opportunity for the stock price to move favorably.
Risk-free interest rate $r$ : as the risk-free interest rate increases, the premium decreases. A higher risk-free rate reduces the present value of the option payoff, making the option less attractive.
Expected stock drift $\mu$ : as the stock drift increases, the premium increases, since it is more likely that the option will be exercised.
Expected volatility $\sigma$ : as volatility increases, the premium increases. Higher volatility increases the probability of the stock price moving significantly, which benefits the option holder.

We plot those dependencies in the following picture:

Dependencies of a call option premium for a risk neutral investor, derived as the maximum premium the investor is willing to pay (reservation or indifference price). We use the parameters S_t=100, K =100, T-t = 1 in years, r = 0.05, \mu = 0.1, \sigma = 0.2. — Figure 2:Dependencies of a call option premium for a risk neutral investor, derived as the maximum premium the investor is willing to pay (reservation or indifference price). We use the parameters $S_t=100$ , $K =100$ , $T-t = 1$ in years, $r = 0.05$ , $\mu = 0.1$ , $\sigma = 0.2$ .

The arbitrage-free theory of derivatives pricing¶

So far we have looked closely at the pricing behaviour of clients who are willing to transact derivatives either because they want to bet on the market (risk prone) or hedge their exposure (risk averse). The dealers who are typically at the other side of the transaction are typically risk averse (as is expected for a banking business). However, if they were passively to take the opposite side of the bet, they would transact only against those risk prone or risk averse enough to accept the premiums they would be willing to accept.

In reality, as we anticipated above, dealers take on a more active role in order to handle their exposure to the risks of the derivatives they are transacting with the clients. There are mainly two ways a dealer will conduct this business:

If the market on derivatives is relatively liquid on both sides, with plenty of investors willing to take the long or short position in the derivative over periods much shorter than the expiry, the dealer will act as a market-maker of the derivatives. In this case, the dealer will quote bid and asks prices for the derivatives that compensate it for the provision of liquidity with its own capital, incurring potential inventory risk or information asymmetry risk
If the market is one-sided and / or illiquid, in the sense of having a small number of potential transactions before the expiry of the contract, the dealer will try to hedge the risk using other financial instruments available.
In practice, a combination of both situations will also happen often

As we discussed in the first part of this chapter, if we are in the first situation, we might not need a derivatives pricing model since we can extract the prices of derivatives directly from observations of trades or request for quotes that are not closed. It is the second case where we need a theory of derivatives pricing that takes into account potential hedging strategies that mitigate the risk of the dealer. We anticipate then that in this framework, the minimum price or premium that the dealer will accept to sell the derivative will be one that compensates it for the costs of hedging plus the residual risk.

More interestingly, we will see that in some cases, under certain theoretical situations, a perfect hedging strategy might exist, so the minimum price will be exactly the cost of the hedging strategy, which in this setup is also called the perfect replication strategy (since a perfect hedging implies no risk, and therefore a replication of the payoff using other financial instruments). A consequence of the existence of such replication strategy is that dealers are forced to price derivatives consistently, otherwise they would be generating risk-free arbitrage opportunities where other dealers who price correctly the derivative trade with the one mispricing it, and pocket the difference without risk. Hence, this theory of derivatives pricing is also called the arbitrage-free theory of derivatives pricing.

Let us revisit the case of forwards and options under this optic.

Example: Forward on a non-dividend paying stock¶

Under the modelling hypothesis used in the previous sections to value the premium of a forward, namely, that 1) the interest risk is locked during the period of the forward, 2) there is no counter-party risk, i.e. no risk that the investor will not satisfy its obligations, then there is actually a simple replication strategy that hedges all the risk of the contract. If the dealer is selling the forward to the investor, therefore guaranteeing a price $F$ to buy a share at time $T$ , then:

Borrows $S_t$ dollars in the repo monetary markets at interest rate $r$ during the period of the forward. We assume that interest accrues daily but is settled at the expiry. The stock bought is used as collateral in the repo.
Buys the underlying stock with this money at inception, paying $S_t$
At the expiry of the forward it delivers the stock to the investor, receives $F$ and repays the loan plus remaining interests

If we consider that daily accruing of interest can be well approximated by continuous accruing, then the investor needs to repay at $T$ an amount $S_t e^{r(T-t)}$ . The payoff for the dealer at the expiry is therefore:

F - S_t e^{r(T-t)}

(118)

which is deterministic under the hypotheses of the model. A rational dealer of course will not accept a determinist loss, so the minimum premium that will command for this contract is the discounted value of this payoff:

C_{F,t} = F e^{-r(T-t)} - S_t

(119)

In practice, forward markets work by quoting the strike such that the premium is zero, hence:

F_t \equiv S_t e^{r(T-t)}

(120)

This is the arbitrage-free price of a forward contract. As mentioned above, it is called arbitrage-free since any other price would represent a risk-free arbitrage opportunity for other dealer. For example, let’s assume this dealer quotes a $\bar{F}_t < F_t$ . Another dealer could buy the forward from the dealer, and use the opposite replication strategy:

Borrow the stock in repo and sell it in the market at price $S_t$ to fund the repo.
At time $T$ close the repo with the stock delivered by the dealer selling the forward, receive $S_t e^{r(T-t)}$ in cash and pay $\bar{F}_t$ .

The payoff for this dealer is then:

S_t e^{r(T-t)} - \bar{F}_t > S_t e^{r(T-t)} - S_t e^{r(T-t)} = 0

(121)

i.e. a risk-free profit!

Example: European Call Options¶

In the case of options there is no such obvious static replication strategy if we are only allowed to use the underlying stock and repo contracts. By static replication strategy we mean that we don’t need to modify the positions of the replication portfolio (the stock and the repo) during the life of the forward. If we are able to trade other derivatives there is actually a static replication strategy. If the dealer sells the call option to an investor, then immediately

Buys from another dealer a put option with the same strike and expiry
Buys from another dealer a forward with the same strike and expiry

The payoff at the expiry is:

-(S_T-K)^+ + (K-S_T)^+ + (S_T-K) = 0

(122)

meaning that the replication portfolio of a put and a forward replicates the call option. At inception, the dealer is paid for the call a premium $C_{C,t}$ and pays $C_{P,t}$ for the put and $C_{F,t}$ for the forward, hence the payoff at inception is:

C_{C,t} - C_{P,t} - C_{F,t}

(123)

In order to avoid losses, the minimum premium that must command is therefore:

C_{C,t} = C_{P,t} + C_{F,t}

(124)

which is called the put-call parity relationship. The premium for the forward can be derived as discussed in the previous section, however we are left with a sort of chicken and the egg problem with regards to the call and put premiums: given one, we can determine the other, but we don’t have yet a replication strategy for the put to derive the call premium, and vice-versa.

If no static replication is available, is it possible to find a dynamical one that reproduces the payoff without uncertainty? Or are we left with strategies that, though they might minimize uncertainty, they don’t remove it and therefore we need to go back to our utility indifference theory?

The Black - Scholes - Merton Model¶

Fisher Black and Myron Scholes Black & Scholes, 1973, and separately Robert Merton Merton, 1973 provided an answer to this question: under certain theoretical conditions, we can indeed find a dynamic replication strategy based on the underlying stock and a risk free account, that reproduces the payoff with no uncertainty. The main conditions are the following:

The stock price follows a Geometric Brownian Motion dynamics:

d S_t = \mu S_t dt + \sigma S_t d W_t

(125)

We have considered the drift $\mu$ and volatility $\sigma$ constant, but the model can be generalized to time-dependent deterministic drifts and volatilities. Other dynamics can also be considered within the same framework. Also, we will consider a non-dividend paying stock, but the model can be adjusted to consider deterministic dividends.

The replicating portfolio is composed of a position $\Delta_t$ in the underlying stock and a cash account $\beta_t$ from which we can borrow or lend money freely at a determinist risk-free rate $r$ :

\Pi_t = \Delta_t S_t + \beta_t

(126)

The model can be generalized easily to time-dependent risk-free interest rates. The original model considers a generic cash account, although in practice is more realistic to assume a repo on the stock is used for funding.

The position in the stock can be adjusted continuously over the life of the option, without restriction like market trading hours, etc. There are also no transaction costs when buying or selling the stock. There are no restrictions on shorting the stock.
The replication portfolio is self-financing, meaning that no cash is added or withdrawn from the portfolio after the initial investment. Any proceeds from selling at immediately reinvested in the portfolio. Mathematically, it means that:

d\Pi_t = \Delta_t d S_t + d \beta_t = \Delta_t d S_t + r \beta_t dt = \Delta_t d S_t + r (\Pi_t - \Delta_t S_t) dt

(127)

Notice that the position in the stock depends on time, but is always adjusted ex-post, i.e. after the market moves.

There is no counterparty default risk, i.e. the investor and the dealer will pay whatever obligations they have at the end of the option

If the portfolio $\Pi_t$ indeed replicates the option payoff at the maturity $T$ in any scenario, we must have $\Pi_T = C_T = (S_T - K)^+$ , where we have considered a Call option but the argument applies to any other payoff function. But since by construction the portfolio $\Pi_t$ is self-financing, then if such dynamic strategy exists we must have $\Pi_t = C_t$ for any time $t \leq T$ . Otherwise, there would be a risk-free arbitrage opportunity, e.g. selling the option at price $C_t$ in the case $C_t > \Pi_t$ , buying with the cash from the premium the replicating portfolio $\Pi_t$ and making an instantaneous gain of the difference $C_t - \Pi_t$ . The same reasoning applies $C_t < \Pi_t$ , only in this case we buy the option paying a discounted premium. Therefore, if we can find such strategy we immediately solve the problem of option pricing, since the premium is equal to the cost of replication by using the non-arbitrage opportunity argument.

The condition $\Pi_t = C_t$ is equivalent to $\Pi_T = C_T$ and $d \Pi_t = dC_t$ . Additionally, this equality implies that $C_t = C(t, S_t)$ , i.e. the premium of the option has to be a function of time and the contemporary value of the stock. We can then use Ito’s lemma to further understand the requirements for such strategy to exist:

d C_t = \frac{\partial C_t}{\partial t} dt + \frac{\partial C_t}{\partial S_t} dS_t + \frac{1}{2}\frac{\partial^2 C_t}{\partial^2 S_t} \sigma^2 S_t^2 dt = d\Pi_t = \Delta_t d S_t + r\beta_t dt

(128)

Grouping the terms dependent on $dt$ and $d S_t$ separately we have:

\left(\frac{\partial C_t}{\partial t} + \frac{1}{2} \sigma^2 S_t^2 \frac{\partial^2 C_t}{\partial^2 S_t} +r\beta\right) dt + \left(\frac{\partial C_t}{\partial S_t} - \Delta_t\right) dS_t = 0

(129)

Since the equality must apply for any arbitrary $dt$ and $dS_t$ , each term in parenthesis must cancel separately. Starting with the right hand term we have:

\Delta_t = \frac{\partial C_t}{\partial S_t}

(130)

which is call the delta-hedging condition, given its obvious connection with linear hedging strategies where we try to neutralize the exposure of a financial portfolio to a risk factor like the stock price in this case. Delta hedging the portfolio we ensure that its value is independent on the random dynamics of the stock, at least during an infinitesimal time. However, a strategy that delta-hedges at every time the portfolio is not necessarily self-financing, i.e. we might need to add extra cash during the life of the portfolio. The problem is that a priori the cash required for delta-hedging would depend on the path of the stock, making it a random variable. In that case the premium would also be a random quantity with a certain distribution.

In order to have a deterministic premium the portfolio has to be self-financing, which requires that the first term of the equality above is also zero, namely:

\frac{\partial C_t}{\partial t} + \frac{1}{2} \sigma^2 S_t^2 \frac{\partial^2 C_t}{\partial^2 S_t} + r S_t \frac{\partial C_t}{\partial S_t} = rC_t

(131)

where we have used the self-financing condition $\beta_t = \Pi_t - \Delta_t S_t = C_t - \frac{\partial C_t}{\partial S_t} S_t$ . The resulting equation is the celebrated Black-Scholes-Merton (BSM) partial differential equation. Solving this equation with the terminal condition $C_T = (S_T - K)^+$ allows us to compute the value of the premium for any time $t \leq T$ deterministically. Therefore, by virtue of the replicating portfolio and non-arbitrage opportunity arguments, if the Black-Scholes-Merton conditions are satisfied the price of an option is no longer a random quantity as in the utility indifference framework.

Before discussing the solution to this equation, we can get another insight simply by inspecting it: the option premium does not depend on the drift $\mu$ of the stock, only the volatility. In the utility indifference framework, the estimation of the drift plays a big role in the price that the investor is willing to pay for the option, since it affects the probability of exercising or not the option. However, for a dealer pricing the option, as far as the BSM framework holds, the directionality of the market is irrelevant since the strategy guarantees a replication of the payoff in any scenario by investing the premium into the BSM dynamic portfolio and implementing the dynamic strategy.

Solving the Black-Scholes-Merton equation¶

There are different ways to solve the BSM equation. Most introductory textbooks on the topic (see for example Joshi, 2003, Wilmott, 2007) follow the derivation used in the seminal paper that uses an ansatz for the solution that transforms the equation into the heat-equation, whose analytical solution is well-known Evans, 2010. Here, we will take a different approach and use the Feynman-Kac theorem introduced in chapter Stochastic Calculus, section The Feynman - Kac Theorem. Recall that the Feynman - Kac theorem provides a general solution to a family of partial differential equations in term of an expected value. In the interest of the reader we review it again here: the solution to the PDE

\frac{\partial u}{\partial t} + \mu(x,t) \frac{\partial u}{\partial x} + \frac{1}{2} \sigma^2(x,t) \frac{\partial^2 u}{\partial x^2} - r(x,t)u = 0

(132)

with the boundary condition $u(x,T) = f(x)$ , is the following expected value

u(x,t) = \mathbb{E}\left[ e^{-\int_t^T r(X_s,s) ds} f(X_T) \Big| X_t = x \right]

(133)

where $X_t$ satisfies the general stochastic differential equation:

d X_t = \mu(X_t, t) dt + \sigma(X_t, t) dW_t

(134)

The BSM equation is a specific case of this general PDE where $X_t \rightarrow S_t$ , $u(x,t) \rightarrow C(s, t)$ , $\mu(x,t) \rightarrow r$ , $\sigma(x,t) \rightarrow \sigma$ , $r(x,t) \rightarrow r$ . The solution of the BSM equation can be expressed as:

C(s, t) = \mathbb{E}\left[ e^{-r(T-t)} (S_T - K)^+ \Big| S_t = s \right]

(135)

where $S_t$ satisfies the SDE:

d S_t = r S_t dt + \sigma S_t dW_t

(136)

The first crucial observation is that the solution is remarkably close to the one analyzed in the context of utility indifference pricing for a risk-neutral investor, with one caveat: the expected value is taken with respect to a SDE for the stock price that has the drift $\mu$ replaced by the risk-free interest $r$ . In the former, the investor was taking the risk of the contract, but was indifferent to risk. In the latter, the dealer might be risk averse, but since the BSM dynamic hedging strategy neutralizes the risk (if the hypotheses are correct), there is no risk taken, only a deterministic cost to execute the hedging strategy, which is charged to the investor as the premium (plus potentially a margin for the service). In the jargon, the dealer prices the derivative as a risk-neutral investor if it computes probabilities in a different probability measure, where the drift is $r$ . Such measure is usually called the risk-neutral measure. This framework for derivatives pricing can be generalized to other derivatives, allowing dealers to compute prices directly skipping the construction of the hedging portfolio and solving the PDE. It is appropriately referred as risk-neutral derivatives pricing.

From a financial point of view, though, it is convenient not to lose the connection to the dynamic hedging strategy, since the validity of this theory is linked to the validity of the hypotheses done to derive the PDE. If we are interested in introducing more realistic dynamics into the pricing, like non-continuous hedging and transaction costs, we need to start again from the point of view of the replication portfolio and the hedging strategy. When those hypotheses are relaxed, though, we come back to a probability distribution to characterize the premium of the option instead of a deterministic one, the latter being one of the main remarkable results of the BSM theory.

The computation of the expectation value for the option premium can be reused from the one done in the context of utility indifference pricing, simply by substituting $\mu \rightarrow r$ in the expressions:

C(S_t,t)= S_t N(d_1)-K e^{-r(T-t)} N(d_2)

(137)

where:

d_1 = \frac{1}{\sigma \sqrt{T-t}} \left(\log \frac{S_t}{K} + (r + \frac{\sigma^2}{2})(T-t)\right)

(138)

d_2 = \frac{1}{\sigma \sqrt{T-t}} \left(\log \frac{S_t}{K} + (r - \frac{\sigma^2}{2})(T-t)\right)

(139)

This is the infamous Black-Scholes-Merton option pricing formula. It provides dealers with option prices that depend on

the current stock price $S_t$
the expected volatility of the stock $\sigma$
the risk-free interest rate $r$
the time to maturity $T-t$
the option strike $K$ .

It does not depend on the stock drift $\mu$ , i.e. the expected trend of the market. This is because in the BSM framework, the dealer uses the dynamic hedging strategy to make the portfolio risk-free and therefore shielded from any market trend.

Again, we must emphasize that despite the similarity with the risk-neutral option premium computed in the utility indifference framework, in the BSM framework the dealer is not computing the premium based on real probability scenarios of stock prices. The connection between replicating strategies and expectation values provided by the Feynman - Kac theorem is a useful mathematical result that simplifies the computation of derivatives premia in the BSM framework. But they shall not be interpreted in the sense of real probabilities (or probabilities in the real probability measure, as they are also referred to).

The dependencies of the option formula for a European call option with respect to the parameters are similar to the ones seen in the context of utility indifference pricing, with the exception of the dependence with respect to the risk-free interest rate $r$ , whose role in the formula goes now beyond cash-flow discounting. In the following picture we compare the premia derived with BSM versus utility indifference pricing:

Option price premium calculated using the utility indifference pricing versus arbitrage-free theory. The results are shown for an european call option with parameters S_t=100, K =100, T-t = 1 in years, r = 0.05, \mu = 0.1, \sigma = 0.2. — Figure 3:Option price premium calculated using the utility indifference pricing versus arbitrage-free theory. The results are shown for an european call option with parameters $S_t=100$ , $K =100$ , $T-t = 1$ in years, $r = 0.05$ , $\mu = 0.1$ , $\sigma = 0.2$ .

The first main differences is, as commented, the dependence with respect to the risk-free interest: in BSM theory, the premium is more valuable as interest rates increase, wheres in utility indifference pricing it was the other way around. In the latter, interest rates enter the formula to discount the value of future cash-flows. As interest rates grow, the opportunity cost of investing the premium for the future cash-flows of the options increases, since a risk-free deposit generates comparatively a larger yield. In other words, for an investor, the option becomes less attractive and is willing to pay less for it. However, in BSM theory used by dealers to price options, the opposite happens: bigger interest rates make funding the hedging strategy more costly, therefore requiring a larger compensation in terms of premium to execute the replication strategy.

The second big difference is of course the dependence with respect to the expected market drift, since in the BSM theory the premium is independent of it. The resulting plot is interesting because it points out to the resolution of the question of why are options traded in practice, assuming both dealers and investors have the same view of the market. As mentioned before, if both were to value the option as an investment, there would be no agreement on the premium to pay, since they hold opposite sides in the trade. However, if the dealer uses BSM theory there is room for agreement, at least for those investors whose expectations on market drifts make the premium requested by the dealer attractive. In the picture, such situation corresponds to those expected drifts that make the premium that an investor is willing to pay larger than the one commanded by the dealer.

From a classical Economics point of view, those frameworks fit well together to explain the derivatives market in terms of demand and supply. Demand for options is driven by investors looking to generate returns on investment, whereas supply comes from dealers that “fabricate” those options using replication strategies. The BSM premium is essentially the cost of “fabricating” the option, in analogy to the language used in the production of goods.

An alternative derivation: the market price of risk¶

An alternative derivation of the BSM equation that can be helpful to gain intuition on the theory uses the financial concept of market price of risk. The market price of risk is essentially a Sharpe ratio, commonly used in the theory of investment. The Sharpe ratio computes the excess returns of an investment, i.e. the expected returns deviating from the risk-free interest, over their risk defined as the volatility of the returns. For the stock that is the underlying of the option, and using continuous time, this is:

\lambda_{S} = \frac{ \mathbb{E}[\frac{dS_t}{S_t}]- rdt}{\sqrt{Var[\frac{dS_t}{S_t}]}} = \frac{\mu_t - r}{\sigma}\sqrt{dt}

(140)

We can now use Ito’s formula to compute the market price of risk of the option:

\lambda_{C} = \frac{ \mathbb{E}[\frac{dC}{C}]- rdt}{\sqrt{Var[\frac{dC}{C}]}} = \frac{\frac{\partial C}{\partial t}+ \mu_t S_t\frac{\partial C}{\partial S_t}+\frac{1}{2}\sigma^2 S_t^2 \frac{\partial^2 C}{\partial S_t^2} - rC}{\sigma S_t \frac{\partial C}{\partial S_t}} \sqrt{dt}

(141)

We can now apply a different version of the arbitrage-free theory. Since the value of the option is essentially derived from the underlying stock, which the only risk factor affecting the option price in the BSM theory, then as investment opportunities both should have the same Sharpe ratio or market price of risk, i.e. $\lambda_S = \lambda_C$ . Otherwise, investors would bid up the price of the one with the largest Sharpe ratio until both of them equalize. Applying this equality the terms proportional to the drift $\mu_t$ cancel and we get back to the BSM differential equation.

One could of course have used the argument in reverse, reorganizing the BSM equation in terms of market prices of risk to prove that the equality of those is a consequence of the arbitrage-free argument used when building the replication portfolio.

Using the BSM framework in practice¶

The Black - Scholes - Merton pricing theory supposed a change of paradigm for dealers creating liquidity in option markets. The theory allows for a consistent pricing of derivatives beyond options, providing not only a way to calculate the premium but a hedging strategy that neutralizes the risk of the derivative, or from a different angle, a recipe to synthesize those derivatives from liquid tradable instruments.

However, the BSM theory is based on multiple hypothesis that are not necessarily realistic, so the dealer needs to take into account how relevant is for the pricing and hedging of derivatives that those hypothesis are not consistent with reality. For instance:

The BSM theory does not take into account liquidity and transaction costs of the hedging instruments, which will increase the costs of the replication strategy in a non-deterministic way. The premium becomes dependent on specific market microstructure details of how the instruments of the hedging portfolio are traded, for instance if they are traded in limit order book based markets, the fees of orders, the bid-ask spread, the tick size, etc. How the dealer executes those trades becomes also relevant, for instance, which types or orders are used, or if execution algorithms are to be used, in whose case the theory needs to incorporate an estimation of the cost of execution of those strategies, based on transaction cost analysis (TCA)
The BSM theory assumes continuous hedging, which is not feasible in practice, both because of the costs mentioned in the previous point, and because markets in financial instruments usually are not open 24 hours per day. Even when they do, liquidity tends to vary widely across trading hours. In practice, dealers tend to hedge less frequently (e.g. daily), deviating from the pure BSM paradigm
The BSM theory assumes a specific dynamics for the evolution of the underlyings, namely the Geometric Brownian Motion in the case of stocks (for underlyings that can become negative, like interest rates or inflation, a Brownian Motion is also typically used). Markets in practice follow more complicated dynamics, for instance they exhibit fatter tails in the distribution of returns or they might show sudden jumps, particularly in the overnight gap ^[3].
The original BSM theory assumes hedging strategies based on the underlying. The theory can be applied equally if we assume hedging with other derivatives as far as they share the same underlyings, or in a more fundamental level, the same risk factors driving the underlyings. In practice, dealers might use liquid derivatives to hedge non-liquid ones. For instance, european options with non-standard strikes or maturities with respect to liquid ones traded in exchanges. As an exercise for the reader, we propose to prove that the Black-Scholes-Merton differential equation can be derived using a hedging portfolio where instead of the underlying we trade another option with a different strike.

In general, these deviations from the assumptions make the premium no longer deterministic, since they introduce uncertainty in its estimation. Dealers typically will need to estimate how much do they need to increase the BSM premium to compensate for those risks. In the following plots we show the histogram of differences between the replication portfolio and the option payoff at maturity, when different assumptions of Black - Scholes - Merton theory are violated, namely:

Continuous re-hedging, which can be violated by using increasingly smaller frequencies of re-hedging
BSM volatility (the one used in the BSM pricing and hedging formulae) equals to market realized volatility
Lognormal stock dynamics, which can be challenged using a different dynamics like for instance the Heston model, which considers that volatility of the log-normal process is also stochastic, with its variance (volatility squared) following a CIR mean-reverting process:

d S_t = \mu S_t dt + \sqrt{V_t} S_t dW_{1,t} \\ d V_t = \kappa (\theta - V_t) + \chi \sqrt{V_t} dW_{2,t} \\ \mathbb{E}\left[dW_{1,t}dW_{2,t}\right] = \rho dt

(142)

Zero transaction costs, which can be violated for instance by assuming the dealer pays the half bid-ask spread of the market every time the underlying is bought or sold when rebalancing the portfolio.

Histograms showing the difference between the replication portfolio using the BSM dynamic hedging strategy, and the actual payoff of an European call option. Each plot shows the impact of violating a different hypothesis of the BSM theory. We run 10000 simulations for each case. We use the parameters S_t=100, K =100, T-t = 1 in years, r = 0.05, \mu = 0.1, \sigma = 0.2 for the baseline scenario where we expect close to perfect replication when re-hedging continuously. To test the effect of different realized volatilities we use use \sigma = 0.19 and \sigma = 0.21, respectively. To test the effect of transaction cost we add a constant bid-ask half-spread of 0.005%. To test the effect of a different market dynamics we use a Heston model with parameters \kappa = 20 (mean reversion rate of the volatility squared), \theta = 0.04 (long-term volatility squared mean), \xi = 0.2 (volatility of volatility squared), \rho = -0.7 (correlation between stock and stochastic volatility risk factors). — Figure 4:Histograms showing the difference between the replication portfolio using the BSM dynamic hedging strategy, and the actual payoff of an European call option. Each plot shows the impact of violating a different hypothesis of the BSM theory. We run 10000 simulations for each case. We use the parameters $S_t=100$ , $K =100$ , $T-t = 1$ in years, $r = 0.05$ , $\mu = 0.1$ , $\sigma = 0.2$ for the baseline scenario where we expect close to perfect replication when re-hedging continuously. To test the effect of different realized volatilities we use use $\sigma = 0.19$ and $\sigma = 0.21$ , respectively. To test the effect of transaction cost we add a constant bid-ask half-spread of 0.005%. To test the effect of a different market dynamics we use a Heston model with parameters $\kappa = 20$ (mean reversion rate of the volatility squared), $\theta = 0.04$ (long-term volatility squared mean), $\xi = 0.2$ (volatility of volatility squared), $\rho = -0.7$ (correlation between stock and stochastic volatility risk factors).

In the plots, we can see the different impacts that the violations produce on the distribution of the residuals. Whereas a less frequent re-hedging increases the dispersion of the residual, those are still unbiased and symmetrical. Other violations produce skewed distributions with non-zero mean. When introducing transaction costs, residuals have a negative mean, meaning that the BSM premium is insufficient to cover the actual costs of replication, as expected since the BSM derivation does not take into account such transaction costs. The effect of a different dynamics for the underlying is more nuanced: depending on the actual process, the distribution of residuals might have positive or negative mean, meaning that the BSM premium over-estimates or under-estimates, respectively, the costs of replication. For instance, if the realized volatility is lower than the one used for pricing and hedging in the BSM model, the mean of the residual is positive. This is, again, intuitive, since as we seen the premium of the option increases with volatility, so overestimating it means charging a larger premium than necessary for replication. Notice that this is not necessarily good for a dealer in competition, since if other dealers have a better estimation of market volatilities, they will be able to offer more competitive prices to clients and close more deals.

The stochastic discount factor (SDF) pricing framework¶

This pricing framework stipulates a fundamental pricing equation for any asset (and, in particular, financial instruments) with a similar form as the naive pricing equation for the present value pricing framework when applied to assets with uncertain future cash-flows:

p_t = {\mathbb E}_t \left[\sum_{i=1}^N m_{t_i} C_{t_i}\right]

(143)

The difference being that now $m_{t_i}$ is a stochastic discount factor that does not necessarily has the form derived using the argument based on the time value of money, namely $e^{-rt_i}$ . That such pricing equation is general enough to price any asset can be derived based on two hypotheses:

The law of one price, which states that the price of an instrument delivering two cash-flows $C_1$ and $C_2$ -which may be uncertain and contingent on different future states- is equal to the sum of the prices of two assets delivering each cash-flow separately:
$p(X_{C_1 + C_2}) = p(X_{C_1}) + p(X_{C_2})$
(144)
where $X$ denotes a generic instrument and $p(\cdot)$ its market price. This property reflects the absence of arbitrage opportunities: two instruments that generate identical payoffs in all states of the world must have the same price.
The market is complete, meaning that for every possible future state of the world there exists a tradable instrument whose payoff is contingent on that state. As we discussed previously,a complete market admits a set of Arrow–Debreu securities, each of which pays one unit of currency if and only if a specific state of the market $s_i \in S$ , and zero otherwise. The pay-offs of such instruments can be written as the indicator function $1_{s_i}$ . Under market completeness, any instrument $X$ can be represented as a linear combination of Arrow–Debreu securities
$X = \sum_{i} 1_{s_i} C_i$
(145)
where $C_i$ denotes the pay-off of the instrument in state $s_i$ . For example, an instrument with deterministic cash-flows at times $t_i$ can be interpreted as having pay-offs contingent on time states $s_i \equiv t_i$ .

We can proceed now with the derivation of the pricing equation. If the law of one price holds, we can express the price of any generic instrument as:

p(X) = \sum_{i} p(1_{s_i}) C_i

(146)

If now we multiply and divide by the probabilities of each state $s_i$ , denoted $\pi_i$ :

p(X) = \sum_{i} \frac{p(1_{s_i})}{\pi_i} \pi_i C_i \equiv {\mathbb E}[m X]

(147)

where we have defined the stochastic discount factor as $m_i \equiv \frac{p(1_{s_i})}{\pi_i}$ . Notice that in order to avoid having arbitrage opportunities, this stochastic discount factor has to be strictly positive, $m_i > 0$ . To prove it, notice that in order to avoid arbitrages, any instrument with strictly positive cash-flows $C_i > 0$ has to have a positive price. Using our pricing equation:

p(X) = \sum_i m_i \pi_i C_i > 0 \rightarrow m_i > 0, \forall i

(148)

since $C_i$ is postive and $\pi_i$ is non-negative. Notice that condition is referred as the fundamental theorem of asset pricing Cochrane, 2005.

Given the role that time has in structuring financial instruments cash-flows, it makes sense to include splicitly the time dimension into the pricing equation. Let us add a time dimension to the market states, $s_{t,i} \in S_t$ , so now we have a complete set of possible market states for each time $t$ , which for the moment we consider them to belong to a discrete time grid. A generic instrument is expressed now as:

X = \sum_{t,i} 1_{s_{t,i}} C_{t,i}

(149)

Let us focus for the moment on a single Arrow-Debreu security paying $1_{s_t, i}$ . If we simply extend our pricing function to add the explicit time component we would have $p_{t_0}(1_{s_t, i}) = {\mathbb E}_{t_0}[m_t 1_{s_t, i}]$ . However, let us consider an intermediate time $t_0 < t' < t$ . According to this definition, pricing at time $t'$ of the same instrument would be $p_{t'}(1_{s_t, i}) = {\mathbb E}_{t'}[m_t 1_{s_t, i}]$ . If this is the case, nothing prevent us to define a financial instrument that simply pays $p_{t'}(1_{s_t, i})$ at time $t`$ . Using our pricing formula, the price of this instrument at time $t_0$ is ${\mathbb E}_{t_0}[m_{t'} p_{t'}(1_{s_t, i})]$ . But this should be consistent with simply using the Tower law in our original pricing formula:

p_{t_0}(1_{s_t, i}) = {\mathbb E}_{t_0}\left[{\mathbb E}_{t'}[m_t 1_{s_t, i}]\right] = {\mathbb E}_{t_0}\left[p_{t'}(1_{s_t, i})\right] \neq {\mathbb E}_{t_0}\left[m_{t'} p_{t'}(1_{s_t, i})\right]

(150)

The issue can be overcome by deflating our pricing formula by the discount factor at the pricing time, so we have:

p_{t_0}(1_{s_t, i}) = {\mathbb E}_{t_0}\left[\frac{m_t}{m_{t_0}} 1_{s_t, i}\right]

(151)

Using this corrected formula:

p_{t_0}(1_{s_t, i}) = {\mathbb E}_{t_0}\left[{\mathbb E}_{t'}\left[\frac{m_t}{m_{t_0}} 1_{s_t, i}\right]\right] = {\mathbb E}_{t_0}\left[\frac{m_{t'}}{m_{t_0}} p_{t'}(1_{s_t, i})\right]

(152)

which is now consistent. The intuition behind this adjustment is that the stochastic discount factor implicitly defines the numeraire of the economy - that is, a reference asset used as a unit of account, that ensures that prices are consistent across time. Deflating by the discount factor at the pricing date ensures that all values are measured in the same unit of account, so that prices observed at different times can be consistently compared and aggregated.

We can now extend the pricing formula to our generic instrument $X$ :

p_{t_0}(X) = {\mathbb E}_{t_0}\left[\sum_{t > t_0}\frac{m_t}{m_{t_0}} C_{t}\right]

(153)

Notice that this equation implies a recursive pricing equation:

p_{t_0}(X) = {\mathbb E}_{t_0}\left[\frac{m_{t_0+1}}{m_{t_0}}\left(p_{t_0+1}(X') + C_{t_0+1}\right)\right]

(154)

When using the temporal representation, it is mathematically convenient to derive a continuous-time representation of the pricing formula. For that we introduce a time step $\Delta t$ so that $t_k = t_0 + k \Delta t$ , $k = 1, ..., N$ and a cash-flow rate $C_{t_k} = c_{t_k} \Delta t$ . Our pricing equation becomes:

m_{t_0} p_{t_0}(X) = {\mathbb E}_{t_0}\left[\sum_{k = 1}^N m_{t_k} c_{t_k} \Delta t\right]

(155)

Now we take the continuous limit $\Delta t \rightarrow 0$ so we get:

m_{t_0} p_{t_0}(X) = {\mathbb E}_{t_0}\left[\int_{t_0}^T m_{t} c_{t} dt\right]

(156)

where $T \equiv t_0 + N \Delta t$ . The one-period recursive equation now becomes:

0 = m_t C_t + {\mathbb E}_{t}[d(m_t p_t)]

(157)

which in the absence of cash-flows between $t$ and $t+dt$ becomes simply:

0 = {\mathbb E}_{t}[d(m_t p_t)]

(158)

or, equivalently:

0 = m_t {\mathbb E}_{t}[d p_t] + p_t {\mathbb E}_{t}[d m_t] + {\mathbb E}_{t}[d m_t d p_t]

(159)

Practical applications of the SDF pricing framework¶

As we have discussed extensively in this chapter, we need fundamental pricing frameworks when we don’t have access to liquid market prices of financial instruments, otherwise we can just extract the fair values using filtering techniques. If we have complete markets, the general idea is to compute the stochastic discount factor using available prices of instruments, and then use those prices to price illiquid instruments which share the same risk factors as those of tradable instruments.

Notice that if the market is not complete, we can still use this framework to find a projection of the discount factor on the subspace of instruments with available market prices. This discount factor can be used to find a consistent price of instruments that have some risk factors out of this subspace, by decomposing the general discount factor in:

the projection in the subspace
an orthogonal component

This will provide us with a price that has the minimum uncertainty given the available prices.

In complete markets, we have the guarantee that a stochastic discount factor exists. Let us compute it for some representative financial instruments.

Bond pricing¶

We consider a standard bond paying a fixed coupon rate $c$ at periodic times $t_i = 1, ..., N$ . The day-count fraction $\gamma_i$ is the annualized fraction of days between coupon payments. The bond is referred to a notional $M$ , so the coupon cash-flows are $C_i = \gamma_i c M$ and the principal paid at maturity $T = t_N$ is $M$ .

The payoff is therefore:

X = \sum_{i=1}^{N} \gamma_i c M 1_{t_i} + M 1_T

(160)

The price at time $t$ is therefore given by:

B_t = {\mathbb E}_t\left[ \sum_{i=1}^{N} \frac{m_{t_i}}{m_t}\gamma_i c M 1_{t_i} + \frac{m_{T}}{m_t} M 1_T \right] = \sum_{i=1}^{N} {\mathbb E}_t\left[ \frac{m_{t_i}}{m_t}\right] \gamma_i c M + {\mathbb E}_t\left[\frac{m_{T}}{m_t}\right] M

(161)

We define the discount factor as $D(t, t_i) \equiv {\mathbb E}_t\left[ \frac{m_{t_i}}{m_t}\right]$ , so we have:

B_t = \sum_{i=1}^{N} D(t, t_i) \gamma_i c M + D(t, T) M

(162)

How to proceed from here depends on our modelling choices regarding the risk factors that are relevant for pricing as well as the set of liquid instruments with available prices. For example, if we have a set of $N$ bonds from the same issuer paying coupons at the same dates $t_i$ but with different maturities, we could simply write the $N$ pricing equations and solve for the discount factors $D(t, t_i)$ , without having to compute explicitely the SDF. This could be used to price non-standard bonds (e.g. with different deterministic coupons or day-count fraction conventions) as far as they pay on the same time grid. If they pay at different times, we need to make some theoretical hypothesis to be able to interpolate the value of the discount factors, or directly build a model of the SDF.

A first simple model is assuming that bonds only depend on a single risk factor, an overall macroeconomic interest rate $r_t$ , for example a short-term interbank rate (e.g. one linked to collateralized contracts like overnight index swaps, see chapter Mechanics of Financial Instruments). For the moment, we consider it deterministic and constant: $r_t = r$ . Let us assume in this market we have access to a money-market account that accrues interest continuously. The pay-off at time $T$ of the money market account is $\beta_T = \beta_t e^{r(T-t)}$ , for a initial investment $\beta_t$ , which is also naturally the price of this instrument at time $t$ . Therefore, the pricing equation is given by:

m_t \beta_t= {\mathbb E}_t\left[ m_T \beta_T \right] = m_T \beta_T

(163)

where, in the second step, we have applied that interest rates are deterministic and also the only risk factor in our model, so the SDF becomes deterministic as well. Therefore, the SDF is given by:

m_T = m_t e^{-r(T-t)}

(164)

whose dynamics is: $dm_t = - r m_t dt$ . We can then simply compute the discount factors at any arbitrary time as $D(t, t_i) = e^{-r(t_i -t)}$ , and the price of a standard bond simplifies to:

B_t = \sum_{i=1}^{N} e^{-r(t_i-t)} \gamma_i c M + e^{-r(T-t)} M

(165)

Notice that we have recovered the pricing equation derived in the present value pricing framework. As already anticipated, the SDF framework is general enough to incorporate this pricing framework, which corresponds to the case of a simple market with only one tradable instrument, the money-market account, and the hypothesis that interest rates are deterministic and constant. It is actually not difficult to generalize this result to time-dependent deterministic interest rates $r_t$ . Using $dm_t = -r_t m_t dt$ , we have $m_T = m_t e^{-\int_t^T r_t dt}$ , i.e. $D(t, t_i) = e^{-\int_t^{t_i} r_t dt}$ .

In practice, though, it is too simplistic to consider that the price of bonds, even those issued by governments with sound finances, does not have an idiosyncratic country risk factor. This can be seen empirically, since the price of traded bonds don’t usually matches the discounting of their future cash-flows using interbank rates. The standard practice is to introduce their own interest rate risk factors, defined by the so-called yield curve $y(t, T_k)$ , which by definition is the interest rate that matches market prices of standard bonds with maturities $T_k$ :

B_{k,t}^{mkt} = \sum_{i=1}^{N} e^{-y(t, T_k)(t_i-t)} \gamma_i c_k M_k + e^{-y(t, T_k)(T_k-t)} M_k

(166)

Again, in order to extend this pricing framework to other instruments with non-liquid prices, we need to be able to interpolate the yield curve to other maturities. Market practitioners might directly use interpolation schemes that ensure the yield curve is well behaved, e.g. does not produce prices that are arbitragable. There is also a large literatur on term-structure interest rate models from which consistent yield curve parametric functions can be derived, that are then fitted to market prices. We refer the reader to Brigo & Mercurio, 2006 Andersen & Piterbarg, 2010 Andersen & Piterbarg, 2010 for more details.

For the purpose of our discussion on how to build stochastic discount factor models, let us consider one of the most simple instances of such term-structure models, the Vasicek model Vasicek, 1977. This model assumes that the entire yield curve is driven by a single risk factor, represented by an instantaneous continuously compounded short rate $r_t$ that drives the movements of the full yield curve $y(t, T)$ . The short rate $r_t$ is modeled as a stochastic process following an Ornstein–Uhlenbeck mean-reverting dynamics, as discussed in Stochastic Calculus:

dr_t = \kappa (\theta - r_t) dt + \sigma dW_t

(167)

where $\kappa > 0$ is the speed of mean reversion, $\theta$ the long run mean level, $\sigma > 0$ the volatility and $W_t$ a Wiener process. Notice that this short-rate is not anymore a interbank reference rate, but a funding rate linked to the issuer. As mentioned above, an alternative model could try to keep an explicit decomposition as $r_t = r_t^{ois} + s_t$ , where now $r_t^{ois}$ is the interbank rate and $s_t$ the spread associated with the specific issuer, linked to specific funding, credit and liquidity characteristics of the issuer. But we will not follow this path in this section.

In order to find the SDF, we still assume there is a money-market account $\beta_t$ now linked to the funding short-rate $r_t$ of the issuer. Additionally, we define so-called zero-coupon bonds (ZCBs) that pay only a principal of 1 $at maturity$ T$, whose prices are directly the discount factors, since:

P_t(1_{T}) = {\mathbb E}_t \left[\frac{m_T}{m_t} \right] = D(t, T)

(168)

We propose the simplest ansatz for the SDF that preserves non-arbitrability, i.e. as we discussed in chapter Stochastic Calculus, a log-normal process whose stochastic differential equation is given by:

\frac{d m_t}{m_t} = \mu_t(r_t) dt + \lambda_t(r_t) dW_t

(169)

where $\mu$ and $\lambda$ are, for the moment, generic functions of time and the short-rate.

Our SDF has to price all the instruments in our market: the money-market account and the zero-coupon bonds. We first apply the continuous-time version of our pricing equation to the money-market equation, namely:

{\mathbb E}[d(m_t \beta_t)] = 0

(170)

where, recall, $d\beta_t = r_t \beta dt$ . Applying Ito’s Lemma:

d(m_t \beta_t) = dm_t \beta_t + m_t d\beta_t + dm_t d\beta_t = \beta_t m_t (\mu_t(r_t) + r_t) dt + \beta_t m_t \lambda_t(r_t) dW_t

(171)

Applying the pricing equation, we get a condition on the SDF:

{\mathbb E}[d(m_t \beta_t)] = \beta_t m_t (\mu_t(r_t) - r_t) dt = 0 \rightarrow \mu_t(r_t) = - r_t

(172)

Let us apply it now to the ZCBs. We make the ansatz $D(t, T) = f(t, r_t)$ given that the SDF itself is Markovian on $r_t$ . Applying Ito’s lemma to this expression, we get:

d f(t, r_t) = \frac{\partial f}{\partial t} dt + \frac{\partial f}{\partial r_t} dr_t + \frac{1}{2} \frac{\partial^2 f}{\partial^2 r_t} \sigma^2 dt

(173)

where we have used the $SDE$ for $r_t$ given by the Vasicek model. Now we apply Ito’s on $d(m_t f(t, r_t))$ :

d (m_t f(t, r_t)) = m_t \left(\frac{\partial f}{\partial t} + \kappa (\theta - r_t) \frac{\partial f}{\partial r_t} + \frac{1}{2} \sigma^2 \frac{\partial^2 f}{\partial^2 r_t}\right) + m_t \sigma \frac{\partial f}{\partial r_t} dW_t + f m_t (-r dt + \lambda_t dW_t) + \lambda_t m_t \sigma \frac{\partial f}{\partial r_t} dt

(174)

We use now the pricing equation:

{\mathbb E}[d(m_t D(t, T))] = 0

(175)

to get the following partial differential equation for $f(t, r_t)$ :

\frac{\partial f}{\partial t} + \kappa (\theta - r_t) \frac{\partial f}{\partial r_t} + \frac{1}{2} \sigma^2 \frac{\partial^2 f}{\partial^2 r_t} - f r + \lambda_t \sigma \frac{\partial f}{\partial r_t} = 0

(176)

with terminal condition $f(T, r_T) = 1$ . To solve this equation we make a farther simplification and consider that $\lambda_t(r_t)$ is affine in $r_t$ , meaning it is a linear function:

\lambda_t(r_t) = \lambda_0 + \lambda_1 r_t

(177)

In this case, the exponential ansatz $f(t, r_t) = A(t, T) e^{-B(t, T)r_t}$ transform the problem into the following two PDEs:

\dot{B}(t, T) = 1 - (\kappa + \lambda_1 \sigma) B(t, T)

(178)

\dot{A}(t, T) = A(t, T)[\kappa \theta B(t, T) - \frac{1}{2}\sigma^2 B(t, T)^2 - \lambda_0 \sigma B(t, T)]

(179)

with terminal conditions $B(T, T) = 0$ and $A(T, T) = 1$ . The solution reads:

B(t, T) = \frac{1- e^{-(\kappa + \sigma \lambda_1)(T-t)}}{\kappa + \sigma \lambda_1}

(180)

A(t, T) = \left(\frac{\kappa \theta - \sigma \lambda_0}{\kappa + \sigma \lambda_1} - \frac{\sigma^2}{2 (\kappa + \sigma \lambda_1)}\right)(B(t, T) - (T-t)) - \frac{\sigma^2}{4(\kappa + \sigma \lambda_1)}B(t, T)^2

(181)

with this solution, now we can fit the parameters $\lambda_0$ and $\lambda_1$ to prices of zero coupon bonds that can be themselves be extracted from liquid bond prices. As expected, though, with two parameters we will be able to fit only approximately this term structure. In order to fit the prices of any set of liquid bonds from a given issuer, we need a model that allows for further flexibility. One such model is for example the Hull & White model Hull & White, 1990.

Stock pricing¶

We now consider stock pricing within the SDF framework. Stocks are fundamentally different from bonds because they are claims to an uncertain and potentially growing dividend stream, and their value depends on multiple macroeconomic risk factors simultaneously. We consider a two-factor model where the relevant risk drivers are the interest rate $r_t$ (following the Vasicek dynamics introduced above) and a dividend growth factor.

Dividends $D_t$ follow a log-normal process:

\frac{dD_t}{D_t} = g \, dt + \sigma_D \, dW_D

(182)

where $g$ is the long-run expected dividend growth rate, $\sigma_D$ is the volatility of dividend growth, and $W_D$ is a Brownian motion independent of the interest rate Brownian motion $W_r$ . The SDF must account for both sources of risk, and the two-factor extension of the log-normal SDF is:

\frac{dm_t}{m_t} = -r_t \, dt - \lambda_r \, dW_r - \lambda_D \, dW_D

(183)

where $\lambda_r$ and $\lambda_D$ are the market prices of interest rate risk and dividend growth risk respectively. As in the bond pricing case, calibrating from the money market account — which pays $r_t$ with no exposure to either risk factor — pins down the drift of $m_t$ to $-r_t$ , consistently with the SDF above.

For the stock, we assume the price $S_t$ is driven by both risk factors:

\frac{dS_t}{S_t} = \mu_S \, dt + \sigma_{S,r} \, dW_r + \sigma_{S,D} \, dW_D

(184)

Applying Ito’s product rule to $m_t S_t$ and imposing the SDF pricing condition — that $m_t S_t + \int_0^t m_s D_s \, ds$ is a martingale — the drift of this process must vanish. Computing the cross-variation terms gives $d\langle m, S \rangle_t = m_t S_t(-\lambda_r \sigma_{S,r} - \lambda_D \sigma_{S,D}) \, dt$ , and setting the total drift to zero yields:

\mu_S - r_t = \lambda_r \sigma_{S,r} + \lambda_D \sigma_{S,D} - \frac{D_t}{S_t}

(185)

The left-hand side is the expected excess capital gain over the risk-free rate, and rearranging shows that the expected excess total return (capital gain plus dividend yield) equals the risk premium:

\underbrace{\mu_S + \frac{D_t}{S_t}}_{\text{total expected return}} - r_t = \lambda_r \sigma_{S,r} + \lambda_D \sigma_{S,D}

(186)

This is the continuous-time ICAPM: the equity risk premium is compensation for exposure to each priced risk factor, weighted by the stock’s sensitivity to that factor. Typically $\sigma_{S,D} > 0$ (positive dividend shocks raise prices) and $\lambda_D > 0$ , so dividend growth risk earns a positive premium. The interest rate sensitivity $\sigma_{S,r}$ is often negative for long-duration growth stocks (rising rates reduce the present value of distant dividends), so the interest rate risk premium $\lambda_r \sigma_{S,r}$ may partially offset the dividend premium. The two market prices of risk $\lambda_r$ and $\lambda_D$ must be inferred from cross-sectional asset pricing data or calibrated to observed risk premia.

Shiller decomposition. In discrete time, the SDF pricing equation reads:

S_t = E_t[m_{t+1}(D_{t+1} + S_{t+1})]

(187)

Applying the identity $\mathbb{E}[XY] = \mathbb{E}[X] \, \mathbb{E}[Y] + \text{Cov}(X, Y)$ :

S_t = \underbrace{E_t[m_{t+1}]}_{\displaystyle\frac{1}{1+r_f}} E_t[D_{t+1} + S_{t+1}] + \text{Cov}_t(m_{t+1},\, D_{t+1} + S_{t+1})

(188)

The covariance term is negative: in recessions, the SDF is high (marginal utility is high when consumption is scarce) and dividends are low, so $\text{Cov}(m, D) < 0$ . This negative covariance acts as an additional discounting channel beyond the risk-free rate. Crucially, the covariance is time-varying: it is more negative in bad times (when risk premia are elevated) than in good times. This time-variation in risk premia resolves Shiller’s excess volatility puzzle Shiller, 1981: the empirical observation that stock prices vary far more than the present value of future dividends — discounted at a constant rate — implies. Under the SDF framework, price volatility reflects not only news about future dividends but also changes in the covariance between the SDF and future payoffs, i.e., changes in risk premia across the business cycle.

Option pricing¶

Options are non-linear claims on an underlying asset, but the SDF framework prices them in exactly the same way as bonds and stocks. Consider a European option with payoff $f(S_T)$ at maturity $T$ — for a call, $f(S_T) = (S_T - K)^+$ . The SDF pricing equation gives:

C_0 = E^P[m_T \, f(S_T)]

(189)

To evaluate this expectation we exploit the structure of the log-normal SDF. In the one-factor BSM setting (a single Brownian motion $W_t$ driving the stock), the SDF is:

m_T = \exp\!\left(-rT - \lambda W_T - \frac{1}{2}\lambda^2 T\right), \qquad \lambda = \frac{\mu - r}{\sigma}

(190)

Separating the time-discounting factor:

C_0 = e^{-rT} \, E^P\!\left[\underbrace{e^{-\lambda W_T - \frac{1}{2}\lambda^2 T}}_{\displaystyle Z_T} \, f(S_T)\right]

(191)

The term $Z_T = e^{-\lambda W_T - \frac{1}{2}\lambda^2 T}$ is the Radon-Nikodym derivative of a new probability measure $Q$ with respect to $P$ , i.e., $dQ/dP = Z_T$ . By Girsanov’s theorem, $\tilde{W}_t = W_t + \lambda t$ is a standard Brownian motion under $Q$ , and substituting into the stock dynamics:

\frac{dS_t}{S_t} = \mu \, dt + \sigma \, dW_t = r \, dt + \sigma \, d\tilde{W}_t

(192)

Under $Q$ the stock grows at the risk-free rate $r$ , and the option price simplifies to:

C_0 = e^{-rT} \, E^Q\!\left[f(S_T)\right]

(193)

Substituting the log-normal distribution of $S_T$ under $Q$ recovers the BSM formula. The SDF approach makes transparent what the PDE argument leaves implicit: the risk-neutral pricing formula is not specific to the BSM model, but follows directly from the existence of a positive SDF. The market price of risk $\lambda$ does not appear in the final formula because it is absorbed into the change of measure — option prices are independent of investors’ views on the drift $\mu$ of the stock. In complete markets the SDF is unique, so the risk-neutral measure $Q$ is unique and derivatives have a single arbitrage-free price. When markets are incomplete — for example with stochastic volatility not spanned by traded assets — infinitely many valid SDFs exist, and the resulting range of admissible option prices is precisely the interval of utility indifference prices derived for different risk appetites.

Connection to previous pricing frameworks¶

The SDF provides a single unifying language for all the pricing models introduced in this chapter. Three special cases are particularly instructive.

Deterministic discount factor. If there is no uncertainty in future payoffs, or equivalently if all market prices of risk are zero ( $\lambda = 0$ ), the SDF reduces to a deterministic discount factor $m_t = e^{-rt}$ (for a flat term structure) or $m_t = \exp\!\left(-\int_0^t r_s \, ds\right)$ when interest rates are deterministic but time-varying. The SDF pricing equation $p_0 = E[m_T C_T]$ then reduces to the standard present value formula $p_0 = e^{-rT} C_T$ , since there is no randomness to integrate over. Bond pricing from deterministic cash flows is therefore the limiting case of the SDF framework with no risk premia.

Utility functions as SDF. Consider a representative investor who maximises expected discounted utility over consumption, $\mathbb{E}\!\left[\sum_{t=0}^\infty \beta^t U(C_t)\right]$ . The first-order optimality condition for holding an asset with gross return $1 + R_{t+1}$ is:

U'(C_t) = \beta \, E_t\!\left[U'(C_{t+1})(1 + R_{t+1})\right]

(194)

This rearranges to the standard SDF pricing equation $1 = E_t[m_{t+1}(1+R_{t+1})]$ with the SDF given by the intertemporal marginal rate of substitution:

m_{t+1} = \beta \, \frac{U'(C_{t+1})}{U'(C_t)}

(195)

The SDF is high when future consumption is scarce relative to today (high marginal utility), which is precisely the definition of a bad economic state. When $U$ is the exponential utility $U(C) = -e^{-\gamma C}$ , the resulting SDF closes the loop with the utility indifference prices derived earlier in this chapter: the indifference price from the dealer’s dynamic optimisation is exactly the price implied by using the exponential-utility marginal rate of substitution as the SDF.

Radon-Nikodym derivative as SDF. Given any valid SDF $m_T > 0$ , the normalised ratio $Z_T = m_T / E[m_T]$ defines a Radon-Nikodym derivative $dQ/dP = Z_T$ that transforms the physical measure $P$ into a risk-neutral measure $Q$ . Under $Q$ , all traded assets earn the risk-free rate as their expected return, and the SDF pricing equation becomes:

p_0 = E^P[m_T C_T] = e^{-r T} \, E^Q[C_T]

(196)

recovering risk-neutral pricing. Girsanov’s theorem makes the change of drift operational: the market price of risk $\lambda$ enters as the drift correction applied to each Brownian motion when switching from $P$ to $Q$ . The complete BSM framework — including the absence of the physical drift $\mu$ from option prices — is a direct consequence of this change-of-measure structure embedded in the SDF. All three pricing approaches (present value theory, utility indifference, and risk-neutral pricing) are therefore not competing frameworks but successive enrichments of a single SDF-based theory of asset prices.

[^2] If $X \sim {\mathcal N}(\mu_X, \sigma_X^2)$ and $Z = e^X$ then $\mathbb{E}[Z] = e^{\mu_X + \frac{1}{2} \sigma_X^2}$

Exercises¶

Derive the optimal linear combination of predictors in the sense that minimizes the variance of the combined predictor, for the case in which the individual predictors are correlated.
Derive the Black-Scholes-Merton differential equation by using the portfolio replication argument for a dealer that hedges the risk of an european option (call or put) with strike $K_1$ and maturity $T$ , using another (liquid) option tradable in the market with strike $K_2$ and same maturity $T$ . As in the original BSM derivation, the dealer uses a cash account to remunerate cash positions or borrow cash. Formally, the replication or hedging portfolio is $\Pi_t = \Delta_t C_2(S_t, t) + \beta_t$ , with the terminal condition $\Pi_T = C_1(S_T, T)$ . Hint: link the result with the market price of risk for options introduced in this chapter.
The lag-1 autocovariance estimator of the noise variance in the simple linear pricing model $p_t = \alpha + \beta s_t + \epsilon_t$ with $s_t = s_{t-1} + \nu_t$ is related to a classical estimator in market microstructure. Show that the formula for $\hat{\sigma}_\nu^2$ obtained from $\text{Cov}(p_t - p_{t-1}, p_{t-1} - p_{t-2}) = -\sigma_\epsilon^2$ (under the assumption that $\sigma_\nu^2 \approx 0$ relative to the bid-ask bounce) is equivalent to the Roll (1984) estimator of the effective bid-ask spread $s = 2\sqrt{-\text{Cov}(\Delta p_t, \Delta p_{t-1})}$ . What market structure assumption does the Roll estimator rely on, and when would it break down?
Consider the Vasicek model $dr_t = \kappa(\theta - r_t) \, dt + \sigma \, dW_t$ with market prices of risk $\lambda_0$ and $\lambda_1$ as presented in this chapter. (a) Verify that the functions $B(t, T)$ and $A(t, T)$ given in the text solve the ODE system derived from the term structure PDE. (b) Show that in the limit $\kappa \to 0$ (no mean reversion) and $\lambda_0 = \lambda_1 = 0$ (no risk premium), the zero coupon bond price reduces to $P(t, T) = \exp\!\left(-r_t(T-t) + \frac{\sigma^2}{6}(T-t)^3\right)$ , which corresponds to the Ho-Lee model driven by a constant-volatility Brownian motion.
In the utility indifference pricing framework, the certainty equivalent $C_t$ satisfies $-e^{-\gamma C_t} = E_t[-e^{-\gamma \Pi_T}]$ where $\Pi_T$ is the terminal P&L. For an investor who holds an option with payoff $f(S_T)$ and delta-hedges it continuously, show that as $\gamma \to 0$ (risk-neutral limit) the indifference price converges to the BSM price $e^{-r(T-t)} E^Q[f(S_T)]$ . Conversely, for large $\gamma$ (very risk-averse investor), show that the indifference price approaches the worst-case payoff $e^{-r(T-t)} \inf_{S_T} f(S_T)$ , regardless of the distribution of $S_T$ .
SDF and the equity risk premium. In a discrete-time, one-period economy, suppose the SDF is $m = a - b R_M$ where $R_M$ is the gross return on the market portfolio. (a) Using the conditions $E[m R_f] = 1$ (risk-free asset) and $E[m R_M] = 1$ (market), solve for $a$ and $b$ in terms of $R_f$ , $E[R_M]$ and $\text{Var}(R_M)$ . (b) Show that for any asset $i$ with gross return $R_i$ , the SDF pricing equation $E[m R_i] = 1$ reduces to the CAPM: $E[R_i] - R_f = \beta_i (E[R_M] - R_f)$ , where $\beta_i = \text{Cov}(R_i, R_M)/\text{Var}(R_M)$ . (c) What constraints must $a$ and $b$ satisfy for $m$ to be a valid SDF (strictly positive)?
Put-call parity from the SDF. Consider a European call $C_t = E_t[m_{t,T}(S_T - K)^+]$ and a European put $P_t = E_t[m_{t,T}(K - S_T)^+]$ on the same non-dividend-paying stock $S_t$ , with the same strike $K$ and maturity $T$ . (a) Using the SDF pricing equation and the identity $(S_T - K)^+ - (K - S_T)^+ = S_T - K$ , derive the put-call parity relation $C_t - P_t = S_t - K P(t, T)$ where $P(t, T) = E_t[m_{t,T}]$ is the zero coupon bond price. (b) What property of the SDF is essential for this derivation? (c) Does put-call parity hold in an incomplete market, and why?
Kalman filter steady state. For the single-instrument pricing model with Kalman gain $K_t = \sigma_\nu^2 / (\sigma_\nu^2 + \sigma_\epsilon^2) \equiv K$ constant (steady-state Kalman filter), show that the filtered estimate $\hat{s}_t$ satisfies the exponential weighted moving average (EWMA) recursion $\hat{s}_t = (1-K)\hat{s}_{t-1} + K p_t$ . Interpret the Kalman gain as the weight given to the new observation relative to the prior. For what ratio $\sigma_\epsilon^2 / \sigma_\nu^2$ does the filter give equal weight to the new price and the prior estimate?

Footnotes¶

We consider non-dividend paying stocks for simplicity, the extension of the theory to dividend paying stocks is relatively straightforward
↩
A well-known historical short-coming of the BSM framework is the implication that european options on the same underlying with different strikes and maturities should have the same implied volatility, equal to the expected volatility of the stock. The implied volatility is the one obtained by inverting the BSM formula given prices observed in the market, assuming for instance that there is a set of standard options that are traded in an exchange. In the first years of application of the BSM theory to price options, around the 1980s, this had the consequence of having very small premiums for options, particularly put options, with strikes very deep out-the-money (i.e. far from the underlying price at the time of quoting). This was a consequence of a Gaussian assumption on price returns, that predicted a very low probability of such options being exercised. In 1989, such prediction was contradicted when the market dramatically crashed, forcing dealers to readjust the prices with respect to the BSM formula. Multiple models such as local or stochastic volatility models, or models with jumps in the dynamics, have been proposed later to address these issues. One point to bear in mind is that the logic applied in the BSM framework can be still applied when introducing these more complex dynamics, and deterministic premiums can be derived as far as we add extra instruments in the hedging portfolio that allows the dealer to neutralize those risks (stochastic volatility, jumps, etc)
↩

References¶

Cochrane, J. H. (2005). Asset Pricing: Revised Edition. Princeton University Press.
Gu’eant, O., & Pu, J. (2018). Mid-price estimation for European corporate bonds: a particle filtering approach (Papers 1810.05884). arXiv.org.
Sinclair, E. (2010). Option Trading: Pricing and Volatility Strategies and Techniques (1st ed.). Wiley.
Guo, X., Lai, T. L., Shek, H., & Wong, S. P.-S. (2017). Quantitative Trading: Algorithms, Analytics, Data, Models, Optimization. Chapman. 10.1201/9781315371580
Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81(3), 637–654.
Merton, R. C. (1973). Theory of rational option pricing. Bell Journal of Economics and Management Science, 4(1), 141–183.
Joshi, M. S. (2003). The Concepts and Practice of Mathematical Finance. Cambridge University Press.
Wilmott, P. (2007). Paul Wilmott Introduces Quantitative Finance (2nd ed.). John Wiley & Sons.
Evans, L. C. (2010). Partial Differential Equations (2nd ed., Vol. 19). American Mathematical Society.
Brigo, D., & Mercurio, F. (2006). Interest Rate Models: Theory and Practice: With Smile, Inflation and Credit (2nd ed.). Springer.
Andersen, L. B. G., & Piterbarg, V. V. (2010). Interest Rate Modeling. Volume 1: Foundations and Vanilla Models. Atlantic Financial Press.
Andersen, L. B. G., & Piterbarg, V. V. (2010). Interest Rate Modeling. Volume 2: Term Structure Models. Atlantic Financial Press.
Vasicek, O. A. (1977). An equilibrium characterization of the term structure. Journal of Financial Economics, 5(2), 177–188. 10.1016/0304-405X(77)90016-2
Hull, J., & White, A. (1990). Pricing interest-rate-derivative securities. The Review of Financial Studies, 3(4), 573–592.
Shiller, R. J. (1981). Do Stock Prices Move Too Much to Be Justified by Subsequent Changes in Dividends? American Economic Review, 71(3), 421–436.