Processing math: 31%
World Scientific
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at customercare@wspc.com for any enquiries.
https://doi.org/10.1142/S0219024924500031Cited by:5 (Source: Crossref)

Abstract

In this paper, we introduce the log-normal stochastic volatility (SV) model with a quadratic drift to allow arbitrage-free valuation of options on assets under money-market account and inverse martingale measures. We show that the proposed volatility process has a unique strong solution, despite non-Lipschitz quadratic drift, and we establish the corresponding Feynman–Kac partial differential equation (PDE) for computation of conditional expectations under this SV model. We derive conditions for arbitrage-free valuations when return–volatility correlation is positive to preclude the “loss of martingality”, which occurs in many traditional SV models. Importantly, we develop an analytic approach to compute an affine expansion for the moment generating function of the log-price, its quadratic variance (QV) and the instantaneous volatility. Our solution allows for semi-analytic valuation of vanilla options under log-normal SV models closing a gap in existing studies.

We apply our approach for solving the joint valuation problem of vanilla and inverse options, which are popular in the cryptocurrency option markets. We demonstrate the accuracy of our solution for valuation of vanilla and inverse options.a By calibrating the model to time series of options on Bitcoin over the past four years, we show that the log-normal SV model can work efficiently in different market regimes. Our model can be well applied for modeling of implied volatilities of assets with positive return–volatility correlation.

1. Introduction

Empirical studies strongly support evidence that the volatility of price returns is itself a stochastic process (see, for example, Shephard (2005) for a comprehensive survey). It is accepted that the celebrated model by Black & Scholes (1973) and Merton (1973), which assumes constant volatility, cannot explain implied volatility surfaces observed in option markets, which are inhomogeneous in strike and maturity dimensions. In contrast, stochastic volatility (SV) models are able to fit to implied volatility surfaces and their dynamics.

1.1. Evidence of log-normality of implied and realized volatilities

Applications of log-normal SV models, which are based either on the log-normal dynamics for the instantaneous volatility or on the Gaussian dynamics for the logarithm of the volatility, are widespread in practice. Predominantly, SABR model by Hagan et al. (2002) is widely used among practitioners for fitting implied volatility surfaces. However, for modeling the dynamics of volatility surfaces, we need to account for the mean-reversion of the volatility process like in conventional SV models. The mean-reversion ensures that the distribution of the volatility has a stationary long-run distribution and volatility paths simulated in Monte Carlo (MC) do not explode. In another widely used specification, the so-called Exp-OU specification, the SV process is defined for the logarithm of the volatility.b

In an excellent study Christoffersen et al. (2010) examine the empirical performance of Heston (see Heston (1993)), log-normal and 3/2 SV models using three sources of market data: the VIX index, the implied volatility for options of the S&P500 index, and the realized volatility of returns on the S&P500 index. They found that, for all three sources of data, the log-normal SV model outperforms its alternatives. Tegner & Poulsen (2018) report similar finding using realized volatilities.

1.2. Analytical tractability

Log-normal SV models cannot be handled by semi-analytic Fourier methods available for affine SV models. The analytical intractability may have lead to a slow adaptation of log-normal SV models in spite of their empirical support. On the one hand, the log-normal SV model with the linear drift does not belong to the family of affine diffusions because the so-called admissibility conditions are not satisfied for the variance term, see Duffie et al. (2003), Filipović & Mayerhofer (2009). On the other hand, the log-normal SV model with the quadratic drift does not belong to a general family of polynomial diffusions, see Filipović & Larsson (2016), Cuchiero et al. (2012). Carr & Willems (2019) introduce measure change to relate the model to a class of polynomial diffusions, which comes at cost of introducing an additional state variable. Authors then approximate the payoff function using an orthogonal polynomial expansion, see Ackerer & Filipović (2020), for option pricing applications. A related approach for computing moments under SV models using expansions is developed by Alòs et al. (2020).

We contribute to these studies by developing a closed-form approximate and very accurate solution for the log-normal SV model which enables analytical intractability of this model. We develop an affine expansion to derive closed-form solution for the moment generating function (MGF) of the log-price process, volatility and its quadratic variance (QV). We show that our solution to the MGF produces valid probability density functions of the state variables, which are matched with MC simulations.

1.3. Properties of log-normal SV model

In a related research, Lewis (2018) introduces a log-normal SV model with a quadratic drift without constant term. Lewis (2018) suggests to apply the model only when return–volatility correlation is negative because, otherwise, the model leads to the loss of martingality. Carr & Willems (2019) introduce the log-normal SV model with the quadratic drift, similar to our model. In our paper we augment the log-normal SV model with the linear drift term introduced in Karasinski & Sepp (2012) to include the quadratic drift term. We contribute to studies of Lewis (2018) and Carr & Willems (2019) by defining the martingale conditions under both MMA and inverse martingale measures. We also formulate the Feynman–Kac theorem linking the model dynamics with the quadratic drift to the partial differential equation (PDE) approach, because the classic Feynman–Kac does not apply to non-Lipschitz coefficients. We then introduce the MGF as the solution to the model PDE and develop an analytic transform-based approach for the valuation of vanilla and inverse options.

1.4. Assets with positive return–volatility correlation

Carr & Wu (2017) propose the following three economic factors behind the negative return–volatility correlation, which most typically observed in the dynamics of stock prices and stock indices: aggregate financial leverage, systematic risk, positive auto-correlation of negative shocks. However, in many markets we actually observe price dynamics with positive return–volatility correlation either on a permanent basis (for VIX index, short and leveraged short exchange-traded funds (ETFs)) or on a transitory regime-based basis (for some commodities, foreign currencies, and cryptocurrencies). In fact, exp-ou and log-normal SV models with the linear drift lead to the “loss of martingality” of the price process when the return–volatility correlation is positive (see, for example, Lewis (20002018) and Lions & Musiela (2007)). We contribute by establishing conditions on model parameters of log-normal SV model which produce true martingale dynamics.

1.5. Inverse martingale measure

For inverse options, both option premium and final payoff are quoted in the units of the underlying asset. In particular, for options on cryptocurrencies, the advantage of inverse payoffs is that all option-related transactions can be handled using units of underlying cryptocurrencies, such as Bitcoin or Ether, without using fiat currencies (see Lucic (2021) and Lucic & Sepp (2023) for a connection between the valuation of the inverse cryptocurrency options and FX options). In case of cryptocurrencies, cash settled options would require using a stablecoin as a tradable equivalent of United States Dollar (USD) or any other fiat currency. Given inherent and frequent depegging risk of stablecoins, Alexander et al. (2023) argue for wider adoptation of inverse and quanto options. We also suggest that similar payoff can be advantageous for some commodity markets (such as gold) where option holders may opt for physical settlement of positive option premium through the delivery of the underlying commodity.

We contribute to the literature by incorporating the inverse measure in our valuation framework using log-normal SV model. We formulate and solve the joint valuation problem of vanilla and inverse options using Fourier transform methods.

1.6. Organization of the paper

In Sec. 2, we provide the framework for valuation under money-market account (MMA) and inverse measures. In Sec. 3, we introduce the log-normal beta SV model with quadratic drift and establish important properties of the volatility process. We further characterize martingale property for the drift-adjusted price process and its inverse under MMA and inverse measures, respectively. We also study moments of the volatility process by presenting an approximate solution of the finite-dimensional linear system of ordinary differential equations (ODEs). Finally, we present drift-implicit Euler scheme for MC simulation that preserves the positivity of the volatility process and study its convergence properties. In Sec. 4, we present first- and second-order affine expansion for the MGF of the log-price process, the QV and the volatility. In Sec. 5, we apply affine expansion to derive closed-form solutions for option valuation problem under MMA and inverse measures. In Sec. 6, we apply our method for model calibration to market data and demonstrate the accuracy of both the model specification and our proposed solution methods. We conclude in Sec. 7. Technical proofs are provided in Appendix A.

2. Framework and Definitions

2.1. Vanilla and inverse payoffs

The payoff of (cash-settled) vanilla options is settled in the units of cash. In contrast, the payoff of an inverse option is settled in the units of the underlying asset using the spot price observed at the option maturity. We denote the spot price at time T by ST and option strike price by K, with both quoted in United States Dollars (USD). We consider vanilla call and put payoffs settled in USD cash at maturity time T and denote them by

ucall(ST)=max{STK,0},uput(ST)=max{KST,0}.(2.1)
Payoffs of inverse call and put options are converted to units of the underlying asset at maturity T as follows :
rcall(ST)=1STmax{STK,0},rput(ST)=1STmax{KST,0}.(2.2)

2.2. Valuation under equivalent MMA and inverse measures

We consider a continuous-time market with a fixed horizon date T>0 and uncertainty modeled on probability space (Ω,𝔽,) equipped with filtration 𝔽={t}0tT. We assume that 𝔽 is right-continuous and satisfies usual conditions.

2.2.1. Spot markets

Assumption 2.1. Risk-free rate r(t) is deterministic with the value of one unit of the MMA M(T) given by M(T)=eˉr(0,T), ˉr(t,T)=Ttr(s)ds.

We assume that the market is complete and satisfies the so-called No Free Lunch with Vanishing Risk (NFLVR) and No Dominance (ND) conditions (see Jarrow & Larsson (2012)). By choosing M as a numéraire, we consider an equivalent martingale measure , induced by M. Accordingly, the time-t value of an option, denoted by U(t,S), with payoff function u(ST) at time T, equals

U(t,S)=M(t)𝔼[1M(T)u(ST)|t],(2.3)
where expectation 𝔼 is taken under the MMA martingale measure .

Next, by choosing spot price S as a numéraire, we consider an inverse martingale measure ˜, induced by S. The option value function, denoted by Ũ(t,S), with the payoff paid in units of S, equals

Ũ(t,S)=St̃𝔼[1STu(ST)|t],(2.4)
where expectation ̃𝔼 is taken under the inverse martingale measure ˜.

Theorem 2.1. We assume that market is complete and satisfies NFLVR and ND conditions. We further assume that both MMA measure  and inverse measure ˜ are equivalent martingale measures. Then the values of options under MMA measure in Eq. (2.3) and under inverse measure in Eq. (2.4) are unique and satisfy

M(t)𝔼[1M(T)u(ST)|t]=St̃𝔼[1STu(ST)|t].(2.5)

Proof. As shown in Theorem 3.2 of Jarrow & Larsson (2012) and Theorem 2.17 of Herdegen & Schweizer (2018), the market satisfies NFLVR and ND conditions if and only if is a true martingale measure. Hence, asset prices, discounted by numéraire M are true -martingales. Consequently, due to the uniqueness of price, the equality in Eq. (2.5) holds. To justify the equality of both sides, we apply the existence and uniqueness of equivalent inverse (true) martingale measure ˜ for the numéraire S, see Theorem 4.4 and Corollary 3.11 in Herdegen & Schweizer (2018). □

2.2.2. Futures markets

Since most option markets for commodities and cryptocurrencies are based on futures, we also adopt our methodology for futures options. We consider a set of futures contracts settled at times {Tk}k=1,,K with prices {FTkt}k=1,,K.

Assumption 2.2. The convenience yield q(t) of futures prices is deterministic with the integrated convenience yield given by

ˉq(t,T)=Ttq(s)ds.(2.6)

Definition 2.1. Measure ˜T induced by price FTt with futures maturity at T, T{Tk}, chosen as a numéraire is called the inverse T-futures measure.

Corollary 2.1. We assume that market is complete and satisfies NFLVR and ND conditions. We further assume that both MMA measure  and inverse futures measure ˜T are equivalent martingale measures. Then option values under MMA and inverse measures are unique and satisfy

M(t)𝔼[1M(T)u(FT)|t]=Ft̃𝔼T[1FTu(FT)|t].(2.7)

For markets with both vanilla and inverse options being tradable, such as cryptocurrency markets, any SV model applied for joint valuation of options across several exchanges and contract specifications must satisfy the conditions (2.5) and (2.7). For the proposed log-normal SV dynamics in Eq. (3.12), we must impose certain conditions on model parameters, which we address in Theorem 3.7.

2.2.3. Put–call parity

We will discuss an important topic of the put–call parity for inverse options under a generic SV model with SV driver σt. We establish that the put–call parity is valid if volatility σt is not explosive.

Lemma 2.1. We assume that volatility process σt is nonexplosive under . Then put–call parity for vanilla call option, C(t,S,K), and put option, P(t,S,K), defined by

C(t,S,K)P(t,S,K)=SKeˉr(t,T)(2.8)
holds irrespective whether the price process is a true martingale under .

We assume that volatility process σt is nonexplosive under ˜. Put–call parity for inverse call option, ˜C(t,S,K), and inverse put option, ˜P(t,S,K), defined by

˜C(t,S,K)˜P(t,S,K)=1S1Keˉr(t,T)(2.9)
holds irrespective whether (drift-adjusted) inverse price process is a true martingale under ˜.

Proof. We only provide a sketch of the proof and refer to Theorems 9.2 and 9.3 in Lewis (2000) and to Sin (1998) for details. As vanilla call option payoff is unbounded, the price of the call option is only local martingale under . Taking into account “correction” term (so-called martingale defect) reflecting the difference between true martingale and strict local martingale properties, yields Eq. (2.8). Relationship in Eq. (2.9) follows from the same argument, but applied to ˜. □

In Theorem 3.2, we show that volatility process σt does not explode under the proposed log-normal SV dynamics in Eq. (3.12).

3. Log-Normal Beta SV Model

3.1. Dynamics under measure

We start with statistical measure and we consider the log-normal beta SV model introduced by Karasinski & Sepp (2012). We augment the dynamics with the quadratic drift in the volatility process following Lewis (2018) and Carr & Willems (2019). We first introduce the dynamics of price process St and volatility process σt under statistical measure as follows :

dSt=μtStdt+σtStdW(0)t,dσt=(κ1+κ2σt)(𝜃σt)dt+βσtdW(0)t+𝜀σtdW(1)t,(3.1)
where μt is statistical drift, W(0), W(1) are uncorrelated Brownian motions under , κ1>0 and κ20 are linear and quadratic mean-reversion rates, respectively, 𝜃>0 is the mean of the volatility, β is the volatility beta which measures the sensitivity of the volatility to changes in the spot price, and 𝜀>0 is the volatility of residual volatility.

Empirically, Bakshi et al. (2006) find that the presence of nonlinear terms, including the quadratic term, in the volatility drift is significant when using econometric estimation of the dynamics of the VIX index.

We assume that statistical measure and MMA measure are related by

ζ(t):=dd|t=[t0λ0(u)dW(0)u+λ1(u)dW(1)u],(3.2)
where () denotes Itô exponential, and λ0(t) and λ1(t) are equity and volatility risk-premias, respectively.

Assuming that process ζ(t) is a true martingale, the following processes

Ŵ(0)t=W(0)t+t0λ0(u)du,Ŵ(1)t=W(1)t+t0λ1(u)du(3.3)
define independent Brownian motions under by Girsanov theorem (we establish necessary and sufficient conditions for equivalence of measures in Theorem 3.1). Thus, the dynamics of price process in Eq. (3.1) under MMA measure becomes
dSt=(μtλ0(t)σt)Stdt+σtStdŴ(0)t.
As discounted price process must be a martingale under , we need to require that
μtλ0(t)σt=rtλ0(t)=μtrtσt.(3.4)

As a result, the dynamics of volatility process under MMA measure become

dσt=[κ1𝜃(κ1κ2𝜃)σtκ2σ2t(βλ0(t)+𝜀λ1(t))σt]dt+βσtdŴ(0)t+𝜀σtdŴ(1)t,(3.5)
where Ŵ(0), Ŵ(1) are independent Brownian motions under .

As customary,c we assume that the market price of risk is proportional to the volatility as follows :

λ0(t)=ˉλ0σt,λ1(t)=ˉλ1σt,(3.6)
where ˉλ0 and ˉλ1 are constants. As a result, we rewrite Eq. (3.5) as follows :
dσt=[κ1𝜃(κ1κ2𝜃)σt(κ2+βˉλ0+𝜀ˉλ1)σ2t]dt+βσtdŴ(0)t+𝜀σtdŴ(1)t.(3.7)

To retain the functional form of the volatility process, we rescale model parameters ̂𝜃, ˆκ1, ˆκ2 under as follows :

κ1𝜃=ˆκ1̂𝜃,κ1κ2𝜃=ˆκ1ˆκ2̂𝜃,κ2+βˉλ0+𝜀ˉλ1=ˆκ2.(3.8)
Analytic inversion of Eq. (3.8) yields risk-transformed model parameters under :
ˆκ2=κ2+βˉλ0+𝜀ˉλ1,̂𝜃=12ˆκ2((κ1κ2𝜃)+(κ1κ2𝜃)2+4𝜃κ1ˆκ2),ˆκ1=12((κ1κ2𝜃)+(κ1κ2𝜃)2+4𝜃κ1ˆκ2).(3.9)

Thus, under martingale measure , the dynamics of price and volatility processes become

dSt=rtStdt+σtStdŴ(0)t,dσt=(ˆκ1+ˆκ2σt)(̂𝜃σt)dt+βσtdŴ(0)t+𝜀σtdŴ(1)t.(3.10)

Theorem 3.1. We assume that κ10, 𝜃>0. Then measures  and  are equivalent, iff

κ2max{βˉλ0𝜀ˉλ1,0}.(3.11)

Proof. By analogy to the proof of Theorem 3.6. □

As a result, we obtain that the functional form of log-normal SV model with the quadratic drift is invariant under the change of measure from to . We note that the log-normal volatility model with linear drift does not share such invariance property due to a quadratic term arising under . The model invariance under the change of measure is an important feature shared by the log-normal SV model with quadratic drift. In contrast, Sepp & Rakhmonov (2023) show that most conventional SV models do not satisfy the invariance of -model under the change of measure to . In the following, we will consider model dynamics under and drop the hat notation in Eq. (3.10).

3.2. Dynamics under MMA measure

We consider price dynamics with the log-normal beta SV model in Eq. (3.1) under the MMA measure . The joint dynamics are specified for the spot price St, the instantaneous volatility σt and the QV It under the MMA measure as follows :

dSt=r(t)Stdt+σtStdW(0)t,S0=S,dσt=(κ1+κ2σt)(𝜃σt)dt+βσtdW(0)t+𝜀σtdW(1)t,σ0=σ,dIt=σ2tdt,I0=I,(3.12)
where r(t) is the deterministic risk-free rate; W(0)t and W(1)t are uncorrelated Brownian motions, and the model parameters are the same as defined in Eq. (3.1). Using volatility beta β and volatility-of-volatility 𝜀, we introduce the total instantaneous variance of the volatility process 𝜗2 as follows :
𝜗2=β2+𝜀2.(3.13)

The mean-reversion of the volatility process is specified using the mean level of the volatility 𝜃, 𝜃>0, and the mean-reversion speed which is the linear function of the volatility κ1+κ2σt, κ10,κ20. As a result, the quadratic drift of the volatility process is given by

μ(σ)(κ1+κ2σ)(𝜃σ)=κ1𝜃(κ1κ2𝜃)σκ2σ2.(3.14)

We introduce the zero-drift stochastic driver Zt, Zt=et0r(t)dtSt and its inverse Rt, Rt=Z1t, with the following dynamics :

dZt=ZtσtdW(0)t,Z0=S0,dRt=σ2tRtdtRtσtdW(0)t,R0=1/Z0.(3.15)

Next, we introduce the log of the zero-drift price process Xt=lnZt :

dXt=12σ2tdt+σtdW(0)t,X0=lnS0.(3.16)

Using XT under Assumption 2.1, we represent the spot price ST by

ST=eˉr(0,T)eXT.(3.17)
Similarly, under Assumption 2.2, using Eq. (3.17), we represent the futures settled at time T in as
fT(T)=eˉr(0,T)ˉq(0,T)eXT,(3.18)
where the integrated convenience rate ˉq(0,T) is defined in Eq. (2.6). Accordingly, for modeling purposes, we focus on the zero-drift price dynamics Zt and its log-price process Xt defined in Eq. (3.16).

3.2.1. Properties of log-normal SV process

We establish important properties related to the regularity of the volatility process. We consider the volatility process in model dynamics (3.12) represented as follows :

dσt=μ(σt)dt+v(σt)dW()t,σ0=σ,μ(σ)=(κ1+κ2σ)(𝜃σ),v(σ)=𝜗σ,(3.19)
where W()t is a standard Brownian motion and 𝜗 is the total volatility of volatility.

We notice that the process σt is not a polynomial diffusion, see Filipović & Larsson (2016), Lemma 2.2, because the drift coefficient is a second-order polynomial in σt. Since the process in Eq. (3.19) has super-linearly growing drift, standard results for the uniqueness of strong solution are no longer applicable, see Sec. 5 of Karatzas & Shreve (1991) for details. We will establish the existence of the unique strong solution in Theorem 3.3. We first provide a boundary classification for the volatility process to show that log-normal volatility process σt in dynamics (3.19) cannot reach 0 or explode to +, which are important properties of the volatility dynamics. Our proof is based on existence of solutions for general stochastic differential equations (SDEs) with so-called monotonic coefficients, so that it is potentially applicable to wider class of diffusion processes.

Assumption 3.1. We assume that model parameters κ1, κ2, 𝜃, 𝜗 satisfy

κ10,κ20,𝜃>0,𝜗0.(3.20)

Theorem 3.2 (Regularity). Under Assumption 3.1, the boundary points {0,+} are unattainable for process σt in SDE (3.19).

Proof. Our proof is based on Feller boundary classification for one-dimensional diffusion processes with technical details given in Sec. A.1. □

Theorem 3.3 (Existence and uniqueness of solution). Under Assumption 3.1, SDE in Eq. (3.19) has the unique strong solution.

Proof. We will refer to following theorem that guarantees the existence and uniqueness of general SDEs with monotonic coefficients.

Theorem 3.4 (Gyöngy & Krylov (1980)). Let at(x):[0,T]×, bt(x):[0,T]× be functions, continuous in x, satisfying following conditions:

(1)

T0sup|x|R(|at(x)|+|bt(x)|2)dt< for any T0, R0,

(2)

for x,y,z such that |x|R,|y|R,|z|R following conditions hold:

(a)

(monotonicity condition) :

2(xy)(at(x)at(y))+(bt(x)bt(y))2Kt(R)(xy)2,(3.21)

(b)

(growth condition): 2zat(z)+(bt(z))2Kt(1)(1+z)2,

where function Kt(R):[0,T]×++ satisfies t0Ks(R)ds< for any T0, R0.

Then there exists unique solution of the SDE

xt=x0+t0as(xs)ds+t0bs(xs)dWs,tT.(3.22)

We show that SDE (3.19), where κ1,κ2,𝜃,𝜗 satisfy (3.20), does indeed satisfy conditions 1–3. We emphasize that the positivity of the process demonstrated in Theorem 3.2 is important. Since in our case at(x)=(κ1+κ2x)(𝜃x), bt(x)=𝜗x, we get

2(xy)(at(x)at(y))+(bt(x)bt(y))2=(𝜗22(κ1κ2𝜃))(xy)22κ2(x+y)(xy)2(𝜗22(κ1κ2𝜃)+2κ2R)(xy)2
when |x|R,|y|R. As for any R>0 we have t0[𝜗22(κ1κ2𝜃)+2κ2R]ds<, we conclude that 2a is satisfied. For the growth condition, we have
2zat(z)+bt(z)2=2κ1𝜃z+[𝜗22(κ1κ2𝜃)]z22κ2z3.(3.23)

Using that zero is unattainable boundary and simple inequalities z1+z2, z3<0 valid for z>0, we can estimate the right hand side of (3.23) as follows :

2κ1𝜃(z2+1)+[𝜗22(κ1κ2𝜃)]z22κ1𝜃+[2κ1𝜃+𝜗22(κ1κ2𝜃)]z2.
We conclude that the growth condition 2b is also satisfied. □

Proposition 3.1 (Bounded moments). Assume that κ1𝜃>0, κ2>0 and remaining parameters κ2, 𝜃, 𝜗 satisfy conditions (3.20). Let |p|1 and T>0. Then we have

𝔼[supt[0,T]σpt]<.

Proof. First we prove a slightly weaker result. □

Proposition 3.2. Assume that κ1𝜃>0, κ2>0 and κ2, 𝜃, 𝜗 satisfy conditions (3.20). Then, for every |p|1: supt[0,T]𝔼σpt< for any T>0.

Proof. Let us first consider the case p1. We use same approach as in Lions & Musiela (2007). Applying Itô’s lemma to σpt, we have

dσpt=pσp1t[κ1𝜃(κ1κ2𝜃𝜗22p(p1))σtκ2σ2t]dt+p𝜗σptdWt.
The function U(τ)=𝔼(σpt) then solves the problem
Uτ+(σ)U=0,U(0,σ)=σp,(σ)=(κ1𝜃(κ1κ2𝜃)σκ2σ2)σ+12𝜗2σ22σ2.(3.24)
Let us find supersolution Ū to (3.24) using ansatz Ū(σ)=σpexp(aσ+bτ), a>0, b>0. Inserting it into (3.24), we see that its left-hand side equals
σpexp(aσ+b𝜃)[(b+aκ1𝜃p(κ1κ2𝜃)+𝜗22p(p1))+pκ1𝜃σ1+(p𝜗2a(κ1κ2𝜃)κ2p)σ+(κ2a+a2𝜗22)σ2].
Since the term in the brackets is a second-order polynomial in σ, Ū is a supersolution as soon as κ2>0.

The case p1 is treated similarly: we denote q:=p1 and introduce ˜σt:=σ1t which is well-defined due to Theorem 3.2. We find that Ũ(τ):=𝔼(˜σqt) solves the problem

Ũτ+˜(σ)Ũ=0,Ũ(0,σ)=˜σq,˜(σ)=(κ2+(κ1κ2𝜃+𝜗2)˜σκ1𝜃˜σ2)˜σ+12𝜗2˜σ22˜σ2.(3.25)
Inserting ansatz into (3.25), we see that its left-hand side equals
˜σq1exp(a˜σ+b𝜃)[b+()+()˜σ+(2aκ1𝜃+a2𝜗2)˜σ],
where we omitted the terms not depending on b and ˜σ for brevity. Since the term in square brackets is a second-order polynomial in ˜σ, Ũ is a supersolution if κ1𝜃>0. □

Similarly to the proof of Proposition 3.2, we apply Itô’s formula to σpt

σpt=σp0+t0σp1u((κ1+κ2σu)(𝜃σu)+𝜗22p(p1)σpu)dt+p𝜗t0σpudW()u.

We obtain

𝔼supt[0,T]σptσp0+𝔼supt[0,T]t0|pσp1u(k0+k1σu)(𝜃σu)+𝜗22p(p1)σpu|dt+p𝜗𝔼[supt[0,T]t0σpudW()u].
Using Burkhölder–Davis–Gundy inequality and Proposition 3.2, we establish
𝔼[supt[0,T]u0σpudW()u]C𝔼(T0σptdt)1/2C(T0𝔼σ2pt)1/2.

3.3. Dynamics under inverse measure

First we define the model dynamics (3.12) under the inverse measure ˜. As a by-product, we show that in SDE (3.26) the quadratic term is important to preserve the functional specification of volatility drift under ˜.

Theorem 3.5. The dynamics of the volatility σt in (3.12) under ˜ are given by

dσt=(κ1𝜃(κ1κ2𝜃)σt(κ2β)σ2t)dt+βσtd˜W(0)t+𝜀σtd˜W(1)t.(3.26)

MMA measure  and inverse measure ˜ are related by the density process

Λt=𝔼t[d˜/d]=Zt/Z0,Λ0=1,(3.27)
where zero-drift stochastic driver Zt is defined in Eq. (3.15).

Proof. We consider Rt=Z1t. Applying Itô’s lemma, we see that Rt satisfies

dRt=σ2tRtdtσtRtdW(0)t=σtRt(dW(0)tσtdt).

Using Girsanov’s theorem, we switch to the measure ˜ under which the processes ˜W(0)t and ˜W(1)t defined by ˜W(0)t=W(0)tt0σsds, ˜W(1)t=W(1)t are uncorrelated Brownian motions. Then the dynamics of volatility process σt under ˜ become

dσt=(κ1+κ2σt)(𝜃σt)dt+βσt(d˜W(0)t+σtdt)+𝜀σtd˜W(1)t(3.28)
and Eq. (3.26) follows. We note that ˜ and are related by the density Λt as follows :
dΛt/Λt=σtdWtΛt:=exp(t0σsdWs12t0σ2sds)=Zt/Z0.(3.29)
 □

We highlight that the idea of using change of measure to study the martingale property is widely used in the literature, see Ruf (2015) and references therein. We now derive sufficient conditions for the equivalence of measures ˜ which is required for the application of Girsanov theorem in the proof of Theorem 3.5.

Theorem 3.6. Under Assumption 3.1, measures  and ˜ are equivalent if and only if κ2β.

Proof. By Theorem 3.5, the density 𝔼t[d˜/d]=Zt/Z0. Therefore, the martingale property for Zt under guarantees equivalence ˜. Using Theorem 3.7(1) concludes the proof. □

Theorem 3.7 (Martingality of Log-normal SV model with quadratic drift). Under Assumption 3.1, the price processes Zt and Rt in Eq. (3.15) for model dynamics (3.12) have following properties:

(1)

The process Zt is a martingale under the MMA measure  iff κ2β.

(2)

The process Rt is a martingale under the inverse measure ˜ iff κ22β.

Proof. (1) Theorem 9.2 in Lewis (2000) or Sin (1998) shows that Zt is -martingale if and only if the process

dσt=((κ0+κ1σt)(𝜃σt)+βσ2t)dt+𝜗σtd˜W()t
has unique strong solution under ˜. Using Theorem 3.3, we immediately obtain that the uniqueness holds iff κ2β.

(2) According to Theorem 3.5, the dynamics of {σt,Rt} under ˜ become

dσt=((κ1+κ2σt)(𝜃σt)+βσ2t)dt+βσtd˜W(0)t+𝜀σtd˜W(1)t,dRt=(σt)Rtd˜W(0)t.(3.30)

Using Theorems 2.4(i), 2.4(ii) of Lions & Musiela (2007), Rt is a martingale if

lim supξ+(βξ2+(κ1+κ2ξ)(𝜃ξ)+βξ2)ξ1<,
which holds if κ22β. Rt is not a martingale if for some smooth, positive, increasing function φ we have
lim infξ+(βξ2+(κ1+κ2ξ)(𝜃ξ)+βξ2)φ(ξ)1>0,(3.31)
where +𝜀φ(ξ)1dξ<, 𝜀>0. Choosing φ(ξ)=ξ2, (3.31) holds if κ<2β. □

Corollary 3.1 (Martingality of Log-normal SV model with linear drift). For dynamics (3.12) with κ2=0 we obtain the following:

(1)

The process Zt is martingale under  iff β0.

(2)

The process Rt is martingale under ˜ iff β0.

Proof. Follows by setting κ2=0 in Theorem 3.7. □

Importantly we conclude that the log-normal SV model with linear drift leads to arbitrages when the return–volatility correlation is positive.

3.4. Joint dynamics of state variables under and ˜

We introduce the mean-adjusted volatility process Yt

dYt=(κYtκ2Y2t)dt+𝜗(Yt+𝜃)dW()t,Y0Y=σ0𝜃,κ=κ1+κ2𝜃(3.32)
on the domain (𝜃,). The marginal dynamics of Yt using SDE (3.19) become
dYt=(κYtκ2Y2t)dt+𝜗(Yt+𝜃)dW()t,Y0Y=σ0𝜃,(3.33)
where W()t is a standard Brownian motion.

Corollary 3.2 (Dynamics under the MMA measure ℚ). The joint dynamics of log-price Xt, the mean-adjusted volatility process Yt, and the QV It under the MMA measure  are driven by

dXt=12(Yt+𝜃)2dt+(Yt+𝜃)dW(0)t,X0=X,dYt=(κYtκ2Y2t)dt+β(Yt+𝜃)dW(0)t+𝜀(Yt+𝜃)dW(1)t,Y0=Y,dIt=(Yt+𝜃)2dt,I0=I.(3.34)

Itô’s lemma, applied to Eq. (3.33), yields marginal dynamics of Y under ˜ :

dYt=(˜λ˜κYt˜κ2Y2t)dt+𝜗(Yt+𝜃)d˜W()t,Y0=Y,˜λ=β𝜃2,˜κ=κ1κ2𝜃+2(κ2β)𝜃,˜κ2=κ2β,(3.35)
where ˜W()t is a standard Brownian motion under ˜.

Corollary 3.3 (Dynamics under the inverse measure ℚ̃). The joint dynamics of log-price Xt, the mean-adjusted volatility process Yt, and the QV It under the inverse measure ˜ are driven by

dXt=12(Yt+𝜃)2dt+(Yt+𝜃)d˜W(0)t,X0=X,dYt=(˜λ˜κYt˜κ2Y2t)dt+β(Yt+𝜃)d˜W(0)t+𝜀(Yt+𝜃)d˜W(1)t,Y0=Y,dIt=(Yt+𝜃)2dt,I0=I.(3.36)

3.5. Steady-state distribution of volatility process

It is important to establish that the steady-state density (SSD) of the volatility process σt in SDE (3.19) exists. The SSD, denoted by G(σ), solves the following ODE :

12𝜗22σ2(σ2G)σ((κ1+κ2σ)(𝜃σ)G)=0.(3.37)
The solution is Generalized Inverse Gaussian distribution (Jørgensen (1982)) :
G(σ)=cση1exp{(qσ+bσ)},σ>0,q=2κ1𝜃𝜗2,b=2κ2𝜗2,η=2κ2𝜃κ1𝜗21,c(b)=(b/q)η/22Kη(2qb),(3.38)
where b>0, q0, η and Kη is a modified Bessel function of the second kind. For κ2=0 we use c(0)=qηΓ(η), where Γ(x) is the gamma function, and the steady state PDF is the inverse gamma function. The rth moment m(r) associated to the density in Eq. (3.38) is given by
m(r)={(qb)r/2Kη+r(2qb)Kη(2qb),b>0,r+,qrΓ(ηr)Γ(η),b=0,r<η,,b=0,rη.(3.39)
We see that when κ2=0, the volatility moments exist only to the order r less than r<1+2κ1/𝜗2.

In Fig. 1(a), we show the SSD for the three sets of model parameters. The case (κ1=4,κ2=0) corresponds to the linear mean-reversion; the case (κ1=4,κ2=4) (generally the case with κ2=κ1/𝜃) corresponds to the pure quadratic drift; the case (κ1=4,κ2=8) corresponds to the quadratic drift. The linear model with κ2=0 implies the heavy right tail for the distribution of the volatility. In Fig. 1(b), we show the skewness of the volatility as function of κ2. As κ2 increases, the skewness of the volatility declines.

Fig. 1.

Fig. 1. (a) The steady state PDF of the volatility computed using Eq. (3.38) with fixed κ1=4 and κ2={0,4,8}. (b) and (c) The skewness of the volatility computed using Eq. (3.39) and the excess kurtosis of the unconditional returns distribution computed using Eq. (3.44), respectively, both as functions of κ2, for the three choices of κ1=1,4,8. Other model parameters are fixed to 𝜃=1 and 𝜗=1.5.

3.5.1. Unconditional distribution of returns

We make an interesting connection between the skewness of the volatility process and the kurtosis of returns distribution. Following Chap. 9 in Barndorff-Nielsen & Shiryaev (2015), we assume that the distribution of returns over period δt is Gaussian conditional on σ :

q(x|σ)=12πσ2δtexp(x22σδt).(3.40)

The unconditional distribution of returns under the SDD of the volatility is

p(x)=0q(x|σ)G(σ)dσ.(3.41)
The above integral cannot be computed explicitly. Instead, we compute the central moments of returns under the unconditional distribution
m(n)=xnp(x)dx=0[xnq(x|σ)dx]G(σ)dσ,(3.42)
where we exchange the integration order. It is clear that for odd n, m(n)=0. For the second and the fourth moments, we have
m(2)=δt0σ2G(σ)dσ,m(4)=3(δt)20σ4G(σ)dσ.(3.43)

We compute the excess kurtosis of returns distribution using k=3m(4)/m2(2)3 with Eq. (3.39) as follows :

k=3Kη+4(2qb)Kη(2qb)(Kη+2(2qb))23,κ2>0,3q2η23,κ2=0,κ1/𝜗2>3/2,,κ2=0,κ1/𝜗23/2.(3.44)

It follows that the kurtosis of the unconditional distribution in Eq. (3.44) is maximal when b=0 (κ2=0). The kurtosis is finite only if liner mean-reversion parameter κ1 is higher than the total volatility of variance. The kurtosis declines when rate b (κ2) increases. In Fig. 1(c), we show the kurtosis for the three choices of κ1=1,4,8. We see that for the each choice, the parameter κ2 can be thought as a dampening parameter to reduce both the skewness of the steady-state PDF of the volatility and the kurtosis of the unconditional PDF of returns.

3.6. Moments of volatility process

Computation of moments for the log-normal SV with the quadratic drift is an open problem with no exact solution. We show that there exists a closed-form solution based on a truncation method, which we further adopt for the affine expansion of the MGF. We consider the mean-adjusted volatility process defined in Eq. (3.32). We define its power function mt(n), mt(n)=Ytn, and the mth moment, m¯τ(n), n=0,1,2,, as follows :

m¯t(n)(τ)=𝔼[mτ(n)].(3.45)

Applying Itô’s lemma to mt(n) under the dynamics in Eq. (3.33), we obtain

dmt(n)=(κYtκ2Yt2)nYtn1dt+c(n)(Yt+𝜃)2Ytn2dt+𝜗(Yt+𝜃)nYn1dWt()(3.46)
with m0(n)=Y0n and c(n)=12𝜗2n(n1). We notice a pattern of the powers of n which allows for a recursive solution as follows.

Proposition 3.3 (Moments of volatility process). The solution to moments in Eq. (3.45) can be presented as a matrix equation for an infinite-dimensional vector

τM(0,)(τ)=Λ(0,)M(0,)(τ),(3.47)
where τ is the derivative w.r.t. τ and
M(0,)(τ)=(m¯(0)(τ),m¯(1)(τ),m¯(2)(τ),m¯(3)(τ),)T,M(0,)(0)=(1,Y0,Y02,Y03,)T,Λ(0,)=0000000κκ2000c(2)𝜃22c(2)𝜃(c(2)2κ)2κ2000c(3)𝜃22c(3)𝜃(c(3)3κ)3κ2000c(4)𝜃22c(4)𝜃(c(4)4κ)4κ2.

An approximate solution to ODEs (3.47) is obtained by truncating the number of terms to k and defining a finite dimensional vector of mth moments M(1,k), m=1,,k, as follows:

τM(1,k)=Λ(1,k)M(1,k)+C(1,k),M(1,k)(0)=(Y0,Y02,,Y0k)T,C(1,k)=(0,c(2)𝜃2,0,0,,kκ2Y0k+1)T,Λ(1,k)=κκ20002c(2)𝜃(c(2)2κ)2κ200c(3)𝜃22c(3)𝜃(c(3)3κ)3κ200c(4)𝜃22c(4)𝜃(c(4)4κ)4κ200c(k)𝜃22c(k)𝜃(c(k)kκ).(3.48)

The analytic solution to ODE system (3.48) is given by

M(1,k)(τ)=expm{Λ(1,k)t}M(1,k)(0)+(Λ(1,k))1(expm{Λ(1,k)t}I(k))C(1,k),(3.49)
where expm() is the matrix exponent, ⋅ and −1are the matrix product and inverse, respectively, and I(k) is k×k identity matrix.

Proof. We first present Eq. (3.46) as follows :

dmt(n)=(nκYtnnκ2Ytn+1)dt+c(n)(Ytn+2𝜃Yn1+𝜃2Yn2)dt+𝜗(Yt+𝜃)Yn1dWt()=[c(n)𝜃2Yn2+2c(n)𝜃Yn1+(c(n)nκ)Ytnnκ2Ytn+1]dt+𝜗(Yt+𝜃)mYn1dWt().

Thus, moment m¯τ(n) solves recursive equation (omitting subscript t)

τm¯(n)=c(n)𝜃2m¯(n2)+2c(n)𝜃m¯(n1)+(c(n)nκ)m¯(n)+(2𝜃c(n)nκ2)m¯(n+1).(3.50)

Using the fact that m¯(0)=1 and c(1)=0 we can present Eq. (3.50) for n=0,1, as a matrix equation for infinite-dimensional column vector in Eq. (3.47). Clearly, that under regularity conditions (real parts of eigenvalues of Λ(0,) are negative), we must have limkτm¯(k)=0. Thus, to make a finite-dimensional approximation of Eq. (3.47), we fix k and assume

τm¯(k+1)=0m¯(k+1)=Y0k+1.(3.51)

Finally, by substituting fixed values for m¯(0) and m¯(k+1), we present (3.47) for finite-dimensional vector of mth moments, m=1,,k, in Eq. (3.48). Solution in Eq. (3.49) is obtained by integration. □

In Fig. 2, we plot first four moments of the volatility process computed using Eq. (3.49) with truncation order k=4 (Fig. 2(a)) and k=8 (Fig. 2(b)). For comparison, we show estimates and 95% confidence intervals obtained by MC simulations. We see that the low-order truncation with k=4 is consistent with MC estimates for the first and second moments. The high-order truncation with k=8 is consistent with the first four moments compared to MC estimates.

Fig. 2.

Fig. 2. The first four moments of mean-adjusted volatility computed using Eq. (3.49) with the truncation order k=4 (a) and k=8 (b) as functions of τ. The model parameters σ0=1.5, 𝜃=1.0, κ1=κ2=4, 𝜗=1.5. Dots and error bars denote the estimate and 95% confidence interval, respectively, computed using MC simulations of model dynamics using scheme in Eq. (3.59) with number of path equal 100000 and daily time steps.

Remark 3.1. We note that if κ2=0, the system (3.48) becomes lower tridiagonal so that it can be solved exactly for all moments up to k by the sequential integration from n=1 to n=k.

3.7. Properties of quadratic variance process

We consider the QV process It defined by dIt=dσt2dt. The QV is an important quantity for option pricing. In particular, the expected QV equals model-based fair value of the continuous-time variance swap, which could be used for model calibration.

Proposition 3.4 (Bounded moments). Under Assumption 3.1 and κ1𝜃>0, the QV is well defined under the MMA measure and 𝔼[(It)p]< for any p, |p|1. If κ2β, then the QV is well defined under the inverse measure and [(It)p]< for any p, |p|1.

Proof. As 𝔼[(It)p]=𝔼[(0tσs)p]tp𝔼[sups[0,t]σsp]<, the first statement follows from Proposition 3.1. The second statement follows using the volatility dynamics under ̃ in Eq. (3.28). □

We define the annualized expected QV as follows :

Îτ=1τ𝔼0τσt2dt=1τ𝔼0τ(Yt+𝜃)2dt,Î0=0.(3.52)

Corollary 3.4 (Expected value). The expected value of the QV using kth order truncation in Proposition 3.3 is given by

Îτ1τ0τ(m¯0(2)(τ)+2𝜃m¯0(1)(τ))dτ+𝜃2=1τ(m¯̂(2)(τ)+2𝜃m¯̂(1)(τ))+𝜃2+O(k),(3.53)
where m¯̂(1) and m¯̂(2) are the first and second elements in vector M̂(1,k)(τ) computed using Eq. (3.54), and O(k) is the truncation error. The integrated moments of the volatility with the k-order truncation are computed by
M̂(1,k)(τ)0τM(1,k)(τ)=(Λ(1,k))1(expm{Λ(1,k)t}I)M(1,k)(0)+(Λ(1,k))1((Λ(1,k))1(expm{Λ(1,k)t}I)τI)C(1,k).(3.54)

The approximation term O(k) includes only the truncation error O(k) because Eq. (3.54) is computed analytically using matrix exponentiation. In Fig. 3, we present comparison of the analytical solution for the expected QV using Eq. (3.53) to the estimate and 95% confidence interval based on MC simulations. The truncation-based analytic solution is consistent with MC estimates.

Fig. 3.

Fig. 3. The expected QV computed using Eq. (3.53) with k=4. The top and bottom three lines correspond to σ0=1.5 and σ0=0.5, respectively. The dot and the error bars denote the estimate and 95% confidence interval computed using MC simulations of model dynamics with scheme in SDE (3.59) with number of path equal 100000 and daily time steps. The model cases of mean-reversion correspond to κ1=4 and three choices κ2={0,4,8} with other model parameters set to 𝜃=1.0, 𝜗=1.5.

3.8. Monte Carlo discretization

We consider the SDE (3.19) for the volatility process σt and introduce the log-volatility process Lt=lnσt. Applying Itô’s formula, we obtain the following dynamics for Lt :

dLt=ζ(Lt)dt+𝜗dWt(),L0=lnσ0,ζ(L)=κ1+κ2𝜃12𝜗2+κ1𝜃eLκ2eL.(3.55)

We consider a time horizon T>0 and an equidistant time grid tk=kΔ, Δ=Tn, 0kn. Backward Euler–Maruyama (BEM) scheme for Lt is defined by

L̂tk+1=L̂tk+ζ(L̂tk+1)(tk+1tk)+𝜗(Wtk+1()Wtk()).(3.56)
BEM scheme (also known as drift implicit Euler–Maruyama scheme) is based on Lamperti transform where original SDE (3.19) is transformed into an SDE (3.55) with constant diffusion coefficient and nonlinearity in the diffusion coefficient shifted into the drift of the process. We refer to Neuenkirch & Szpruch (2014), Alfonsi (2013), and Chassagneux et al. (2016) for further details.

We prove that SDE (3.56) has a unique solution in Ltk+1 in order to make BEM scheme well-defined. Although BEM scheme in Eq. (3.56) cannot be solved analytically in L̂tk+1, we still can solve Eq. (3.56) numerically due to the smoothness and monotonicity of function G(L):=Lζ(L)Δ, using just few iterations of Newton algorithm, see Proposition 3.5.

Proposition 3.5. We introduce G(l):=lζ(l)Δ. Then, for every c, the equation G(l)=c has a unique solution in l.

Proof. As G(l) is continuous, G(l)>0 and liml±G(l)=±, the result follows. □

Theorem 3.8. Backward Euler–Maruyama scheme for log-volatility process Lt in SDE (3.55) has a strong convergence rate of 1.

Proof. The proof of the theorem is based on Proposition 3 of Alfonsi (2013). We present it in Proposition 3.6.

Proposition 3.6. We fix p1 and assume that

𝔼0Tζ(Lu)ζ(Lu)+𝜗22ζ(Lu)dup+𝔼0Tζ(Lu)2dup/2<.(3.57)
Then, there exists a constant Kp>0 such that (𝔼supt[0,T]|L̂tLt|p)1/pKpΔ.

Instead of Eq. (3.57), we prove slightly stronger estimate in Eq. (3.58). The latter follows from the combination of Hölder inequality and Proposition 3.1. We omit straightforward details to obtain

𝔼supu[0,T]ζ(Lu)ζ(Lu)+𝜗22ζ(Lu)p+𝔼supu[0,T]|ζ(Lu)|p<.(3.58)
 □

Corollary 3.5. MC discretization scheme of the model dynamics (3.12) under the MMA measure using log-price Xt in SDE (3.16) is given by

Xtk+1=Xtk12σtk2(tk+1tk)+σtk(Wtk+1(0)Wtk(0)),Ltk+1=Ltk+ζ(Ltk+1)(tk+1tk)+β(Wtk+1(0)Wtk(0))+𝜀(Wtk+1(1)Wtk(1)),Itk+1=Itk+σtk2(tk+1tk),σtk+1=exp(Ltk+1)(3.59)
with Xt0=lnS0, Lt0=lnσ0, It0=I0, and where W(0) and W(1) are independent Brownian motions.

We note that the first two dynamics are defined in (,+), which does not require extra handling of boundary conditions, in contrast to some affine models for the variance process.

4. Affine Expansion of Moment Generating Function

We consider the valuation problem of a derivative security with payoff function u(X,I) settled at maturity time T using log-price process XT defined in Eq. (3.16). We denote current time by t and introduce time-to-maturity variable τ=Tt. We denote the undiscounted value function of this derivative security by U(τ,X,I,Y).

To solve the general valuation problem under both and ̃, we parametrize the value function using binary parameter p, p{1,1}, respectively, as follows :

U(τ,X,I,Y;p)=𝔼[u(XT,IT)|Xt=X,It=I,Yt=Y],p=1,𝔼̃[u(XT,IT)|Xt=X,It=I,Yt=Y],p=1,(4.1)
where the expectation 𝔼 under the MMA measure is computed using dynamics (3.34) and the expectation 𝔼̃ under the inverse measure ̃ is computed using dynamics (3.36). Here we apply the fact that the dynamics are Markovian under the both measures.

Theorem 4.1. The value function U(τ,X,I,Y;p) solves the following PDE on the domain [0,T]××+×(𝜃,)

Uτ+((Y;p)+(X;p)+(I;p))U=0,U(0,X,I,Y)=u(X,I),(4.2)
where diffusive operators (Y;p), (X;p), and (I;p) are defined as follows:
(Y;p)U=12𝜗2(Y+𝜃)2UYY+(λ(p)κ(p)Yκ2(p)Y2)UY,(X;p)U=(Y+𝜃)212(UXXpUX)+βUXY,(I;p)U=(Y+𝜃)2UI,κ(p)=κ1κ2𝜃+2κ2(p)𝜃,λ(p)=(κ2κ2(p))𝜃2,κ2(p)=κ2,p=1,κ2β,p=1.(4.3)

Proof. We note that the classic Feynman–Kac formula (for an example, see Sec. A.17 in Musiela & Rutkowski (2009)) cannot be applied directly to Eq. (4.1) because the model dynamics (3.34) and (3.36) do not satisfy the linear growth condition. In Sec. A.2 we prove that the existence and the uniqueness is guaranteed by Theorem A.1. This theorem states that if the covariance matrix of diffusive operator (Y;p)+(X;p)+(I;p) is nonnegative definite and the state variables neither explode nor leave an open domain, then there exists a unique solution to the PDE representation of the conditional expectation under corresponding model dynamics.

Now using the diffusive operators in the valuation PDE (4.2), we obtain that the diffusion matrix a in Eq. (A.14) equals

a=(Yt+𝜃)2β(Yt+𝜃)20β(Yt+𝜃)2𝜗2(Yt+𝜃)20000(4.4)
with eigenvalues given by
λ1=0,λ2,3=(1+𝜗2±1+4β22𝜗2+𝜗4)(Yt+𝜃)2.
Since β<𝜗, we observe that 0=λ1<λ2<λ3, hence matrix a is only nonnegative definite and its smallest eigenvalue is always zero.

We now set the domain D of state variables {Xt,Yt,It} in PDE (4.2) by D=×(𝜃,+)×+. As we showed in Theorem (3.2), under the MMA measure (3.34) and the inverse measure (3.36), the boundary points {0,+} are unattainable for the volatility process σt, hence solution for mean-adjusted volatility process Yt and QV It cannot explode or leave D before T. Same applies to Xt as {0,+} are unattainable for σt. Thus, condition 1 of Theorem A.1 is satisfied. As boundedness condition A.1 is satisfied by construction, the statement of Theorem 4.1 follows from Theorem A.1 stated in Sec. A.2. □

4.1. Moment generating function

We introduce the MGF of the three model variables: the log-price Xt, the QV It, and the mean-adjusted volatility driver Yt=σt𝜃. We extend our analysis using the transform variable in Yt to generalize our solution method for all the three state variables in the model. We denote the MGF by G(τ,X,I,Y;Φ,Ψ,Θ;p) using respective complex-valued transform variables Φ,Ψ,Θ

G(τ,X,I,Y;Φ,Ψ,Θ;p)=𝔼[eΦXτΨIτΘYτ|X0=X,I0=I,Y0=Y],p=1,𝔼̃[eΦXτΨIτΘYτ|X0=X,I0=I,Y0=Y],p=1.(4.5)

Using model dynamics (3.34) under the MMA measure and model dynamics (3.36) under the inverse measure, respectively, the MGF solves the following PDE on the domain [0,T]××+×(𝜃,) :

Gτ+((Y;p)+(X;p)+(I;p))G=0,G(0,X,I,Y;Φ,Ψ,Θ;p)=eΦXΨIΘY(4.6)
with operators defined in Eq. (4.3). First, we establish a sufficient condition for the existence of the MGF for the three state variables.

Theorem 4.2. Given the transform variable Φ=ΦR+iΦI, the MGF of the log-price Xτ

G(τ,X;Φ;p=1)=𝔼[eΦXτ|X0=X]
exists for ΦR(1,0), if Zτ is a martingale under the MMA measure. By Theorem 3.7(1), the necessary condition is κ2β.

Similarly, the MGF of the log-price Xτ

G(τ,X;Φ;p=1)=𝔼̃[eΦXτ|X0=X]
exists for ΦR(0,1) if Rτ is a martingale under inverse measure. By Theorem 3.7(2), the necessary condition is κ22β.

Proof. As ZT is -martingale, the result immediately follows from Jensen’s inequality, as the function g(y):=y|ΦR|1 is convex for |ΦR|(0,1). Result for the inverse measure follows analogously. □

Theorem 4.3. Given transform variable Ψ=ΨR+iΨI, the MGF of QV Iτ

G(τ,I;Ψ;p=1)=𝔼[eΨIτ]
exists for ΨR<0 if κ2>𝜗2ΨR.

Proof. We denote Kt=eIt which satisfies SDE dKt=KtdIt=Ktσt2dt and set p=ΨR. Feynman–Kac formula allows to recast the solution U(τ,σ,K)=𝔼(KTp|Kt=K,σt=σ) as the solution of the Cauchy problem

Uτ+(σ,K)U=0,U(0,σ,K)=K,(4.7)
where operator (σ,K) is defined by
(σ,K)=(κ1+κ2σ)(𝜃σ)σ+12𝜗2σ22σ2+Kσ2K.

We build a supersolution Ū(τ,σ,K) to (4.7) using the following ansatz :

Ū=Kpu(τ,σ),u(τ,σ)=eaσ+bτ,(4.8)
which yields
uKpb+κ1𝜃a+(κ2𝜃κ1)aσ+12𝜗2a2+pκ2aσ2.

We show there exists a0 such that

limσ+b+κ1𝜃a+(κ2𝜃κ1)aσ+12𝜗2a2κ2a+pσ2<+.(4.9)
We notice that inequality (4.9) holds if there exist a>0 such that 12𝜗2a2κ2a+p<0. As latter defines a parabola in a, we can always find required positive a, if κ2>𝜗2p=𝜗2ΨR. □

Theorem 4.4. Given the transform variable Θ=ΘR+iΘI, the MGF of the mean-adjusted volatility process Yτ

G(τ,Y;Θ;p)=𝔼[eΘYτ|Y0=Y],p=1,𝔼̃[eΘYτ|Y0=Y],p=1
exists for ΘR<0, if κ2>𝜗22|ΘR|, p=1 and κ2β>𝜗22|ΘR|, p=1.

Proof. We set c=ΦR and consider

U(τ,Y;c;p):=𝔼[ecYτ|Y0=Y],p=1,𝔼̃[ecYτ|Y0=Y],p=1.(4.10)

Using Feynman–Kac formula we can rewrite U(τ,Y;c;p) as the solution of the Cauchy problem

Uτ+(Y;p)U=0,U(0,Y;c;p)=ecY,(4.11)
where operator (Y;p) is defined by Eq. (4.3). We build a supersolution Ū(τ,Y;c;p) to (4.11) using ansatz Ū=eaY+bτ,ac,b0. Inserting Ū into Eq. (4.11), we have
Ūτ+(Y;p)Ū=Ūb(λ(p)κ(p)Yκ2(p)Y2)b12𝜗2(Y+𝜃)2b2,(4.12)
where coefficients are defined in (4.3). We show there exists ac such that
limY+b(λ(p)κ(p)Yκ2(p)Y2)a12𝜗2(Y+𝜃)2a2<.
We notice that last inequality is satisfied if there exist ac>0 such that κ2(p)a+12𝜗2a2<0. Dividing by a>0 we conclude that it is satisfied if κ2(p)>𝜗22c. Rewriting the last condition in terms of κ2, β using (4.3), completes the proof. □

4.2. Affine expansion

Given that coefficients in PDE (4.6) are nonaffine in state variable Y, the PDE cannot be solved by assuming an exponential affine-solution with a finite number of coefficients. Instead, we can show that the solution to the PDE (4.6) can be represented by an exponential-affine ansatz E[] using an infinite dimensional vector A(τ)={A(k)(τ;Φ,Ψ,Θ;p)}, k=0,1,,, as

E[](τ,X,I,Y;Φ,Ψ,Θ;p)=expΦXΨI+k=0A(k)(τ;Φ,Ψ,Θ;p)Yk,(4.13)
where vector function A(τ) solve an infinite-dimensional system of quadratic ODEs :
Aτ(k)=AM(k)A+(L(k))A+H(k),k=0,1,,,(4.14)
where M(k), L(k), H(k) are infinite-dimensional sparse matrices and vectors.

We note the similarity between the current ansatz (4.13) for problem (4.6) and the problem of computing the moments of the volatility process addressed in Proposition 3.3. There we first represent the recursive solution to the moments as an infinite dimensional vector in Eq. (3.47). We then obtain an approximate solution by a truncation of infinite series and specifying a boundary condition to reduce the problem to a finite-dimensional vector (3.48), which can be solved using analytical methods. Here we follow this insight for solving PDE (4.6) and term our (truncated) solution as the affine expansion for a given order.

4.2.1. First-order expansion

Theorem 4.5 (First-order affine expansion). The MGF in Eq. (4.5) can be decomposed as follows:

G(τ,X,I,Y;Φ,Ψ,Θ;p)=exp{ΦXΨI}[E[1](τ,Y;Φ,Ψ,Θ;p)+R[1](τ,Y;Φ,Ψ,Θ;p)],(4.15)
where E[1] is the leading first-order term and R[1] is the remainder term, such that E[1](τ=0)=1 and R[1](τ=0)=0. The leading term E[1] is given by the exponential-affine form
E[1](τ,Y;Φ,Ψ,Θ;p)=expk=02A(k)(τ;Φ,Ψ,Θ;p)Yk,(4.16)
where vector function A(τ)={A(k)(τ;Φ,Ψ,Θ;p)}, k=0,1,2, solve the quadratic differential system as a function of τ:
Aτ(k)=AM(k)A+(L(k)(p))A+H(k)(p),M(k)=0000𝜃2𝜗220000,0000𝜃𝜗2𝜃2𝜗20𝜃2𝜗20,0000𝜗222𝜃𝜗202𝜃𝜗22𝜃2𝜗2,L(k)(p)=0λ(p)𝜃2βΦ𝜃2𝜗2,0κ(p)2𝜃βΦ2(λ(p)+𝜃𝜗2𝜃2βΦ),0βΦκ2(p)𝜗2κ(p)4𝜃βΦ,H(k)(p)=12𝜃2(Φ2+pΦ2Ψ),𝜃(Φ2+pΦ2Ψ),12(Φ2+pΦ2Ψ)(4.17)
with the initial condition A(0)=(0,Θ,0).

The remainder term R[1] solves the following PDE (omitting arguments) :

Rτ[1]+(Y;p)R[1]=E[1]F[1](Y,A(1),A(2)),R[1](0,Y;Φ,Ψ;p)=0(4.18)
with operator (Y) defined in Eq. (4.3) and residual F[1] defined by
F[1](Y,A(1),A(2))=n=34C(n)(τ;A(1),A(2))Yn,C(3)(τ)=2A(2)(𝜗2(2𝜃A(2)+A(1))βΦκ2),C(4)(τ)=2𝜗2(A(2))2.(4.19)

Proof. We take as an ansatz the leading term E[1](τ,Y;Φ,Ψ,Θ;p) in decomposition (4.16) and substitute it into the PDE (4.6). To obtain the ODEs (4.17) for A(k), we match terms with Yk, k=0,1,2. To obtain the PDE for the remainder term R[1], we account for higher powers of Yk, k=0,1,2. Finally, initial conditions for ODEs (4.17) and PDE (4.18) are set to reproduce conditions of Eq. (4.6). □

Proposition 4.1. We take Ψ=Θ=0 and Φ=p/2, p{1,1}. Assuming that hypotheses 1, 2, 3 listed in Theorem 4.7 hold, the continuous solution A(τ) in Eq. (4.17) exists on [0,+).

Proof. The proof is identical to that of Theorem 4.7 for the second-order expansion. We skip it for brevity. □

In Fig. 4, we show the real and imaginary parts of ODE solutions A(τ) and E[1]. We see that functions A(1) and A(2) quickly reach an equilibrium point, while A(0) is the nonstationary part which contributes to the leading term E[1].

Fig. 4.

Fig. 4. The solution of ODEs of the first order expansion in Eq. (4.17) as function of τ for Φ=0.5+2i, p=1. We use the model parameters calibrated to Bitcoin options data and reported in Eq. (6.4). (a) The real part of the solution, (b) the imaginary part of the solution, (c) the the real and imaginary parts of the exponential term E[1] in Eq. (4.16).

Corollary 4.1 (Estimate of the first-order remainder term). We obtain the following estimate for the remainder term R[1] in Eq. (4.18) :

|R[1](τ,Y;Φ,Ψ,Θ;p)|n=34C(4)(τ)×Mσ(n)(τ),(4.20)
where Mσ(n)(τ) is the nth central moment of the volatility defined by
Mσ(n)(τ)𝔼[(σ(τ)𝜃)n]=𝔼[Yn(τ)].(4.21)

Proof. The proof is analogous to the second-order expansion in Proposition 4.3. □

Corollary 4.2. First-order affine expansion for the MGF (4.5) is obtained by using the leading term E[1] in Eq. (4.15) :

G(τ,X,I,Y;Φ,Ψ,Θ;p)=exp{ΦXΨI}E[1](τ,Y;Φ,Ψ,Θ;p),(4.22)
where the approximation error arises from the truncation of the infinite dimensional affine anzats in Eq. (4.13) with the error magnitude estimated using Eq. (4.20).

In Fig. 6, we illustrate that the first-order approximation (4.22) for the MGF produces valid and accurate PDFs for all three state variables.

4.2.2. Second-order expansion

Theorem 4.6 (Second-order affine expansion). The MGF in Eq. (4.5) can be decomposed as follows:

G(τ,X,I,Y;Φ,Ψ,Θ;p)=exp{ΦXΨI}[E[2](τ,Y;Φ,Ψ,Θ;p)+R[2](τ,Y;Φ,Ψ,Θ;p)],(4.23)
where E[2] is the leading term such that E[2](τ=0)=1 and R[2](τ=0)=0 is the remainder term.

The leading term E[2](τ,Y;Φ,Ψ;p) is given by the exponential-affine form

E[2](τ,Y;Φ,Ψ,Θ;p)=expk=04A(k)(τ;Φ,Ψ,Θ;p)Yk,(4.24)
where vector function A(τ)={A(k)(τ;Φ,Ψ,Θ;p)}, k=0,,4, solves the system the quadratic differential system as a function of τ:
Aτ(k)=AM(k)A+(L(k)(p))A+H(k)(p),M(k)=000000𝜃2𝜗22000000000000000000,000000𝜃𝜗2𝜃2𝜗2000𝜃2𝜗20000000000000,000000𝜗222𝜃𝜗232𝜃2𝜗2002𝜃𝜗22𝜃2𝜗200032𝜃2𝜗200000000,0000000𝜗23𝜃𝜗22𝜃2𝜗20𝜗24𝜃𝜗23𝜃2𝜗2003𝜃𝜗23𝜃2𝜗20002𝜃2𝜗2000,0000000032𝜗24𝜃𝜗2002𝜗26𝜃𝜗24𝜃2𝜗2032𝜗26𝜃𝜗292𝜃2𝜗2004𝜃𝜗24𝜃2𝜗200,L(k)(p)=0λ(p)𝜃2βΦ𝜃2𝜗200,0κ(p)2𝜃βΦ2(λ(p)+𝜃𝜗2𝜃2βΦ)3𝜃2𝜗20,0βΦκ2(p)𝜗22κ(p)4𝜃βΦ3(2q𝜗2+λ(p)𝜃2βΦ)6𝜃2𝜗2,002(βΦ+κ2(p))3(𝜗2κ(p)2𝜃βΦ)4(3𝜃𝜗2+λ(p)𝜃2βΦ),0003(βΦ+κ2(p))2(3𝜗22κ(p)4𝜃βΦ),H(k)(p)=12𝜃2(Φ2+pΦ2Ψ),𝜃(Φ2+pΦ2Ψ),12(Φ2+pΦ2Ψ),0,0(4.25)
with initial condition A(0)=(0,Θ,0,0,0).

The remainder term R[2] solves the following PDE (omitting arguments)

Rτ[2]+(Y;p)R[2]=E[2]F[2],R[2](0,Y;Φ,Ψ,Θ;p)=0(4.26)
with operator (Y) defined in Eq. (4.3) and function F[2] defined by
F[2](τ,Y;A(1),A(2),A(3),A(4))=k=58C(k)(τ;A(1),A(2),A(3),A(4))Yk,C(5)(τ)=3𝜗2A(3)(3𝜃A(3)+2A(2))+4A(4)(𝜗2(3𝜃2A(3)+4𝜃A(2)+A(1))κ2),C(6)(τ)=12𝜗2(16𝜃2(A(4))2+16A(4)(3𝜃A(3)+A(2))+9(A(3))2),C(7)(τ)=4𝜗2A(4)(4𝜃A(4)+3A(3)),C(8)(τ)=8𝜗2(A(4))2.(4.27)

Proof. The proof follows by analogy to the first-order expansion in Theorem 4.5. □

Remark 4.1. We emphasize that sparse matrices M(k) in the quadratic term do not depend on the measure parameter p, while the linear L(k)(p) and free terms H(k)(p) depend on p in ODEs (4.17), (4.25) for the first-and the second-order expansion.

We now present the result on the existence of continuous solution to Eq. (4.25) in Proposition 4.2 and discuss it further in Sec. 4.2.3.

Proposition 4.2. We take Ψ=Θ=0 and Φ=p/2, p{1,1}. Assuming that hypotheses 1,2,3 listed in Theorem 4.7 hold, the continuous solution A(τ) in Eq. (4.25) exists on [0,+).

Proof. See Theorem 4.7. □

In Fig. 5, we show the real and imaginary parts of A(τ) for the second-order expansion and the leading term E[2]. We see that, similarly to the first order expansion illustrated in Fig. 4, functions A(1),A(2),A(3),A(4) reach an equilibrium point, while A(0) is the nonstationary part which contributes to the leading term E[2]. By comparing the low order terms A(0),A(1),A(2) between the first and the second expansions, we notice no visible difference, which suggest that the expansion is rather recursive and higher order terms make insignificant contribution to the low order terms. We also notice that terms A(3) and A(4) are very small in magnitude compared to the first three terms, which suggest that higher order term only provide a small marginal contribution.

Fig. 5.

Fig. 5. The solution of ODEs in Eq. (4.25) as function of τ for Φ=0.5+2i, p=1. We use the model parameters calibrated to Bitcoin options data and reported in Eq. (6.4). (a) The real part of the solution, (b) the imaginary part of the solution, (c) the real and imaginary parts of the exponential term E[2] in Eq. (4.24).

Corollary 4.3 (Estimate of the second-order remainder term). We obtain the following estimate for the remainder term R[2] which solves Eq. (4.26) :

|R[2](τ,X;Φ,Ψ,Θ;p)|n=58Cn(τ)×Mσ(n)(τ),(4.28)
where Mσ(n) is the nth central moment of the volatility defined by (4.21).

Proof. The formal solution for the remainder term R[2] solving problem (4.26) is obtained by applying the Feynman–Kac formula and is given by

R[2](τ,Y;Φ,Ψ)=0τ𝜃[E[2](τt,Y;Φ,Ψ)F[2](τt,Y)]P(t,Y;Y)dtdY,(4.29)
where P(t,Y;Y) is PDF of Y=Y(t) conditional on Y. P solves PDE (4.2), subject to initial condition P(0,Y;Y)=δ(YY). Using Eq. (4.29), we obtain
|R[2](τ,Y;Φ,Ψ)|0τ𝜃|E[2](τt,Y;Φ,Ψ)F[2](τt,Y)P(t,Y;Y)|dtdY0τ𝜃|F[2](τt,Y)P(t,Y;Y)|dtdY.(4.30)
Here, we apply the bound on the MGF |E[2](τt,Y;Φ,Ψ)|1 and integrate out the log-return X and the QV I. Next we substitute the polynomial function F[2] in Eq. (4.27). We use the continuity of functions A(k), k=1,2,3,4, and apply the first mean value theorem for definite integrals to estimate the time integral. Finally, we approximate the expected moments of the mean-adjusted volatility by its steady-state nth order central moments Mσ(n) specified in Eq. (4.21). □

Corollary 4.4. Second-order affine expansion for the MGF (4.5) is obtained by using the leading term E[2] in Eq. (4.24) as

G(τ,X,I,Y;Φ,Ψ,Θ;p)=exp{ΦXΨI}E[2](τ,Y;Φ,Ψ,Θ;p),(4.31)
where the approximation error arises from the truncation of the infinite dimensional affine anzats in (4.13) with the error magnitude estimated by Eq. (4.28).

In Fig. 6, we illustrate that the second-order approximation (4.31) for the MGF produces valid and accurate PDFs for all three state variables.

4.2.3. Solution of quadratic differential systems

It is established that the global existence problem for the solution of quadratic differential systems is nontrivial, see Coppel (1966), Dickson & Perko (1970), Jacobson (1977), Baris et al. (2008) among others. We can show that the matrices of quadratic terms in Eqs. (4.17) and (4.25) are indefinite because their eigenvalues are of both signs. Thus, we can provide a result for global existence that is only conditional by nature. We focus on important case of option valuation on underlying assets with Φ=p/2, p{1,1}, and Ψ=Θ=0.

Dickson & Perko (1970) show that complex-valued solution of quadratic ODEs such as A(τ;Φ) in Eq. (4.17) or Eq. (4.25) has maximal interval of existence [0,τ+(Φ;A)), where τ+(Φ;A) denotes its blow-up time.

Fig. 6.

Fig. 6. PDFs computed using the inversion of the first order expansion term E[1] in Eq. (4.16) and the second-order expansion term E[2] in Eq. (4.24) for the state variables under the MMA measure: (a)–(c) PDFs of log-return, Xτ, of QV normalized by τ, Iτ/τ, of volatility στ, στ=Yτ+𝜃, respectively. We use the model parameters for Bitcoin options reported in Eq. (6.4) with time to maturity of one month, τ=1.0/12.0. The blue histogram is computed using realizations from MC simulations of joint model dynamics (3.12) using scheme in Eq. (3.59) with the number of path equal 400,000 and daily time steps.

Theorem 4.7. Assume following conditions are satisfied:

(1)

τ+(Φ;A)τ+(Φ;A), Φ, i.e. real part of A(τ;Φ) cannot blow up before A(τ;Φ).

(2)

k=02mAk(τ;Φ;p)Yk+ as ττ+(Φ), Φ, i.e. leading term of the affine expansion does not vanish as we approach blow-up time when transform variable Φ is real.

(3)

limττ+(Φ)R[n](τ;Φ)>, Φ, i.e. remainder term is uniformly bounded from below.

If MGF in Eq. (4.6) is finite with G(τ0;X,I,Y;Ψ=Φ,Ψ=0,Θ=0)<, then continuous solution A(τ;Φ,Ψ=0,Θ=0) of Eq. (4.17) and of Eq. (4.25) exists on [0,τ0).

Proof. See Sec. A.4. □

By Theorem 4.7, we obtain that continuous solution A(τ) for the first-and second-order affine expansions in Eqs. (4.16) and (4.24), respectively, exists on [0,+) if 1, 2, 3 hold.

4.3. Properties of affine expansions

We now consider the key properties of the affine expansion of the MGF. For brevity, we focus on the properties of the second-order expansion, which is our key development. For the first order expansion, martingale conditions stated in Proposition 4.3 hold, and the consistency with the expected values (but not with their variances) of three state variables is ascertained in the same way as for the second-order expansion.

Proposition 4.3 (Martingale conditions). Assuming that hypotheses 1, 2, 3 in Theorem4.7 hold, the second-order expansion of the MGF in (4.31) with E[2] defined in Eq. (4.24) satisfies the martingale conditions for log-return Xτ:

G[2](τ,Y;Φ=0,Ψ=0,Θ=0;p)E[2](τ,Y;Φ=0,Ψ=0,Θ=0;p)=1,(4.32)
𝔼[eXτ]eX0E[2](τ,Y;Φ=1,Ψ=0,Θ=0;p=1)=eX0,𝔼̃[eXτ]eX0E[2](τ,Y;Φ=1,Ψ=0,Θ=0;p=1)=eX0.(4.33)

Proof. For Φ=Ψ=Θ=0, we obtain H(p)0 in (4.25). Given zero initial conditions, A(k)(τ;Φ=0,Ψ=0,Θ=0;p)0 due to the uniqueness of the solution. Similarly, for Φ=1 and Θ=Ψ=0, we obtain that H(1)0, hence A(k)(τ;Φ=1,Ψ=0,Θ=0;p=1)0. Similarly, for Φ=1 and Ψ=0 we obtain that H(1)0, hence A(k)(τ;Φ=1,Ψ=0,Θ=0;p=1)0. □

Proposition 4.4 (Volatility moments). The MGF in Eq. (4.31) with the second-order affine term E[2] in (4.24) reproduces exactly the expected value and the variance of the volatility for κ2=0.

Proposition 4.5 (Log-price moments). The MGF in Eq. (4.31) with the second-order affine term reproduces exactly the expected value and the variance of the log-price for κ2=0.

Proposition 4.6 (QV moments). The MGF in Eq. (4.31) with the second-order affine term reproduces exactly the expected value and the variance of QV for κ2=0.

Proof. Propositions 4.4 and 4.5 are proved in Secs. A.3.1 and A.3.2, respectively. Proof of Proposition 4.6 is similar to the proof of Proposition 4.5. □

Remark 4.2. In Remark 3.1 we noted that the system of volatility moments (3.48) is solved exactly for κ2=0, otherwise the solution is approximate. Thus, we cannot generalize above statements for κ2>0. However, it is our hypothesis that moments obtained using (4.24) correspond to moments computed in (3.48) using truncation order k=4 and that truncation error as function of κ2 is very small.

In Fig. 6, we show PDFs computed using the inversion of first-order solution E[1] in Eq. (4.16) and second-order solution E[2] in Eq. (4.24) for the three state variables in model dynamics (3.34). The blue histogram is computed using MC simulations of the joint model dynamics. We see that both first-and second-order expansions produce valid PDFs and match consistently histograms produced by the MC simulations. The first-order expansion may not be very accurate for the QV and the volatility variables because it is not consistent with variances of these variables. The second-order expansion is consistent with variances and co-variances of all state variables and it accurately matches PDFs computed using MC simulations.

5. Option Valuation

We apply the zero-drift log-price process Xt in Eq. (3.16) and Eqs. (3.17) and (3.18) for spot and futures prices, respectively, to consider option valuation on both spot and futures underlyings. We denote by PT the price of either spot or futures asset

PT=eμ̄(T)eXT,X0=lnS0,(5.1)
where μ̄(T)=r̄(0,T) for spot price and μ̄(T)=r̄(0,T)q̄(0,T) for futures.

As we conclude in Corollaries 4.2 and 4.4, we apply affine expansion given by either the first order in Eq. (4.16) or the second order in Eq. (4.24) as analytic solutions to the MGF under the dynamics (3.16). Given that the MFG is available analytically, we then apply Lewis–Lipton approach for valuation of vanilla options using Fourier transform, see Lewis (2000), Lipton (20012002).

5.1. Vanilla calls and puts

Using Eq. (5.1), we represent put and call payoff functions for strike price K using capped payoff by

ucall(PT,K)=max{PTK,0}=PTmin{PT,K}=PTmin{eμ¯(T)+XT,K},uput(PT,K)=max{KPT,0}=Kmin{PT,K}=Kmin{eμ¯(T)+XT,K}.(5.2)
Thus, we evaluate capped payoff u(PT)=min{PT,K} under MMA measure by
U(τ,X)=er¯(T)𝔼[min{eμ¯(T)+XT,K}|t].(5.3)

Proposition 5.1. The valuation formula for capped payoff is given by

U(τ,X)=er¯(T)Kπ0e(iy1/2)X×1y2+1/4E[m](τ,Y;Φ=iy1/2,Ψ=0,Θ=0;p=1)dy,(5.4)
where X=ln(S0/K)+μ¯(T) is log-moneyness, E[1] and E[2] are given in Eqs. (4.16) and (4.24), respectively.

Proof. Using affine expansion for MGF G with either first-order in Eq. (4.22) or second-order in Eq. (4.31), the PDF of Xτ is computed by Fourier inversion

PDF(τ,X,Y;X;p=1)=1π0exp{Φ(XX)}E[m](τ,Y;Φ,Ψ=0;p=1)dΦ.(5.5)

For valuation problem in Eq. (5.3), we obtain

U(τ,X)=er¯(T)min{eμ¯(T)+X,K}×1π0eΦ(XX)E[m](τ,Y;Φ,Ψ=0;p=1)dΦdX.

Assuming that inner integrals are finite, we exchange the integration order as

U(τ,X)=1πer¯(T)0û(Φ)E[m](τ,Y;Φ,Ψ=0;p=1)dΦ,(5.6)
where û(Φ) is the transformed payoff function
û(Φ)=eΦXeΦXmin{eX+μ¯(T),K}dX=eΦXeμ¯(T)e(Φ+1)kΦ+1ek+μ¯(T)1ΦeΦk=KeΦ(ln(S/K)+μ¯(T))1(Φ+1)Φ=KeΦX1(Φ+1)Φ,(5.7)
where X=ln(S/K)+μ¯(T) is log-moneyness, k=lnKμ¯(T) with the first integral being finite for [Φ]>1 and the second integral being finite for [Φ]<0. Integral in Eq. (5.7) is finite for 1<[Φ]<0. For Φ=iy1/2, we get Eq. (5.4)
û(Φ=iy1/2)=Ke(iy1/2)X1y2+1/4.(5.8)
 □

As a result, calls and puts on spot underlying are valued using Eqs. (5.2)

Ucall(τ,S,K)=er¯(T)+μ¯(T)SU(τ,X),Uput(τ,S,K)=er¯(T)KU(τ,X).(5.9)
The value of call option on future underlying, in turn, equals
Ucall(τ,F,K)=er¯(T)f0(T)U(τ,X),(5.10)
where f0(T) is the futures and X=ln(f0(T)/K) denotes log-moneyness. Hence, given a set of futures prices quoted in the market, we can evaluate options on this set of futures without need to compute the convenience yield explicitly.

We emphasize that in valuation equations (5.9) we implicitly assume that the price process Zt=exp(Xt), t(0,T], is -martingale under the MMA measure, otherwise formulas (5.9) are not valid. For that we need a restriction κ2β, see Theorem 3.7.

5.2. Inverse calls and puts

Using generic price process of underlying spot and futures given in Eq. (5.1), we represent the payoffs of inverse calls and puts by

ũcall(PT,K)=1PTmax{PTK,0}=1eXTμ¯(T)min{eμ¯(T)+XT,K},ũput(PT,K)=1PTmax{KPT,0}=KPTeXTμ¯(T)min{eμ¯(T)+XT,K}.(5.11)
As a result, we need to evaluate option on the inverse capped payoff under ̃
Ũ(τ,X)=𝔼̃[eXTμ¯(T)min{eμ¯(T)+XT,K}|t].(5.12)

Proposition 5.2. The valuation formula (5.6) for the inverse capped payoff becomes

Ũ(τ,X)=1π0e(iy+1/2)X1y2+1/4E[m]×(τ,Y;Φ=iy+1/2,Ψ=0,Θ=0,p=1)dy,(5.13)
where X=ln(S0/K)+μ¯(T) is log-moneyness, E[1] and E[2] are given in Eqs. (4.16) and (4.24), respectively.

Proof. As in Eq. (5.7), we compute the transform û(Φ) of the inverse capped option

û(Φ)=eΦXeΦXeXμ¯(T)min{eX+μ¯(T),K}dX=eΦXkeΦXdX+KkeΦXXμ¯(T)dX=eΦX1Φ(Φ1),(5.14)
where X=ln(S/K)+μ¯(T) is the log-moneyness and k=lnKμ¯(T), with the first integral being finite for [Φ]>0 and the second integral being finite for [Φ]<1. Integral in Eq. (5.14) is finite for 0<[Φ]<1. We set Φ=iy+1/2, then
û(Φ=iy+1/2)=e(iy+1/2)x1y2+1/4.(5.15)
 □

As a result, calls and puts are valued using capped payoff (5.12) :

Ũcall(τ,S,K)=1Ũ(τ,X),Ũput(τ,S,K)=KeXμ¯(T)Ũ(τ,X).(5.16)

Hereby, we need to compute

f(τ,X)=1PTt=[eXTμ¯(T)|t]=eμ¯(T)eX(5.17)
with the last equality following from Eq. (4.33). Given the term structure of futures prices f0(T) in Eq. (3.18), the put option on the future is
p(τ,F,K)=Kf0(T)Ũ(τ,X)(5.18)
with log-moneyness X=ln(f0(T)/K). We emphasize that Eq. (5.16) holds only if ̃, which requires restriction κ2β, see Theorem 3.6. In addition, we assume that the inverse process Rt=exp(Xt), t(0,T], is a true ̃-martingale. For this we need κ22β by Theorem 3.7.

5.3. Options on quadratic variance

We consider a call option on QV with the payoff function

u(I)=1Tmax{I(T)TK,0},(5.19)
where K is the strike in terms of the annualized variance.

Proposition 5.3. The value of the call option on the QV under  is given by

U(τ,X,Y,I)=er̄(T)𝔼[u(IT)|t]=er̄(T)T1π0+[û(Ψ)E[m](τ,Y;Φ=0,Ψ;Θ=0;p=1)]dΨ,(5.20)
where
û(Ψ)=eΨIΨ2eΨTK(5.21)
provided [Ψ]<0, with E[1] and E[2] given in Eqs. (4.16) and (4.24), respectively.

Proof. Consider an option on QV IT with terminal payoff u(IT). Using Sepp (2008), we find that option value is given by Eq. (5.20), where û(Ψ) is the transformed payoff function

û(Ψ)=0+u(I)eΨ(II)dI.(5.22)

For a call option on QV with strike K, provided [Ψ]<0, Eq. (5.22) becomes

û(Ψ)=eΨI0+u(I)eΨIdI=eΨI0+max{ITK,0}eΨIdI=eΨIΨ2eΨTK.
 □

Sepp (2008) derives transforms û(Ψ) for typical payoff on the QV including puts and calls on the square root of the QV. Assuming that the inverse measure ̃ is a martingale measure for generic spot and futures underlying Pt defined in Eq. (5.1), the inverse option on the QV is given under ̃ by

Ũ(τ,X,Y,I)=𝔼̃u(IT)PTt.(5.23)
Similarly to Eq. (2.5), values Ũ(τ,X,Y,I) and U(τ,X,Y,I) in Eqs. (5.23) and (5.20), respectively, are related by
U(τ,X,Y,I)=PtŨ(τ,X,Y,I).

Proposition 5.4. The value of the call option on the QV under the inverse measure ̃ is given by

Ũ(τ,X,Y,I)=er̄(T)T1π0+[û(Ψ)eXμ¯(T)E[m]×(τ,Y;Φ=1,Ψ,Θ=0;p=1)]dΨ,(5.24)
provided [Ψ]<0, with E[m] and û(Ψ) defined as in Eq. (5.21).

Proof. Using affine expansion for MGF G with either first-order in Eq. (4.22) or second-order in Eq. (4.31), we compute the joint density of {Xτ,Iτ} using inverse Fourier transform by

PDFX,I(τ,X,Y,I;X,I)=14π2++eΦ(XX)+Ψ(II)E[m](τ,Y;Φ,Ψ)dΦdΨ.

As a result, we obtain

Ũ(τ,X,Y,I)=er̄(T)T+0+eXμ̄(T)u(I)×14π2++eΦ(XX)+Ψ(II)E[m](τ,Y;Φ,Ψ)dΦdΨdXdI.

Exchanging the integration order, we have

Ũ(τ,X,Y,I)=er̄(T)T14π2++[E[m](τ,Y;Φ,Ψ)û(Φ,Ψ)]dΦdΨ,(5.25)
where û(Φ,Ψ) is the transformed payoff function
û(Φ,Ψ)=eΦXΨI+0+eXμ̄(T)u(I)eΦX+ΨIdXdI=eΦXΨIeμ̄(T)+e(Φ1)XdX0+u(I)eΨIdI=2πeΦXμ̄(T)δ0(Φ1)û(Ψ),û(Ψ)=eΨI0+u(I)eΨIdI,(5.26)
provided [Ψ]<0. We integrated out the complex exponential in Eq. (5.26) using results of Sepp (2007). Thus, Eq. (5.25) becomes
Ũ(τ,X,Y,I)=er̄(T)T12π+[eXμ̄(T)E[m](τ,Y;Φ=1,Ψ)û(Ψ)]dΨ.(5.27)
 □

6. Model Implementation and Calibration to Options Data

6.1. Implementation

In practical applications, either for model calibration or for options valuation, we need to compute call and put option prices on a grid consisting of several maturities and strikes, also referred to as an option chain. We consider grid with M maturities and J strikes. For option valuation using the MGF, we apply the affine expansion given by either the first order in Eq. (4.16) or the second order in Eq. (4.24).

First, we fix the space grid with size P (typically, P500) of transform variable Φ for the log-price. We solve the system of ODEs numerically for either the first-order expansion in Eq. (4.16) or for the second-order expansion in Eq. (4.24) in time up to the last maturity time. The computational cost of this step is

𝒞(1)=𝒪(P×Nmax),(6.1)
where Nmax is the number of time steps for ODE solver (typically we have one-two steps per one week) and 𝒪 is the number of arithmetic (elementary) operations.

Second, we compute values of the MGF on the grid of Φ for each maturity slice. We then compute values of capped payoffs for the MMA measure in Eq. (5.4) or for the inverse measure in Eq. (5.13) using Simpson rule for numerical integration. We then value all options in a given chain. The computational cost of this step is

𝒞(2)=𝒪(J×M×P).(6.2)

The main difference with numerical implementation of affine models is that the first step can be computed with cost close to 𝒪(P×M) because the solution to the MGF can be computed analytically without solving ODEs numerically. The cost ratio between the implementation of the log-normal SV model and an affine SV model compares favorably when the number of maturities becomes large with M close to Nmax, and when the number of strikes J becomes large as well.

6.2. Model calibration

To illustrate that the log-normal SV model can handle different market regimes, we use time series of option prices on Bitcoin observed on Deribit exchange from April 2019 to October 2023. We focus on short-dated options with expiries including one and two weeks and one month, because these are the most liquid and actively traded expiries. We use time series of options chains traded on Deribit. We fix calibration every week with option prices samples each Friday at 10:00 UTC. We include puts and calls with absolute values of option deltas between [0.10,0.50] and [0.10,0.50], respectively.

For the stability of calibration, we estimate mean-reversion parameters κ1 and κ2 beforehand. The reason is that mean-reversion parameters and volatility-of-volatility with volatility beta are co-dependent so that it is possible to produce similar smiles and skews using different values of these parameters. We follow the procedure in Sepp & Rakhmonov (2023) for estimation of parameters κ1 and κ2 by fitting the model implied auto-correlation of the log-normal volatility process to the empirical auto-correlation. This is efficient because the model auto-correlation function is determined solely by mean-reversion parameters. The estimated values of κ1 and κ2 are the following κ̂1=2.21 and κ̂2=2.18, which are kept fixed through the entire period (in practice we would adjust these estimate regularly using most recent data).

Thus, we need to estimate four model parameters σ0,𝜃,β,𝜀. Parameters σ0 and 𝜃 control the level and the slope of at-the-money (ATM) implied volatilities, and β and 𝜀 control the skewness and the convexity of implied volatilities, respectively. We apply a stronger constraint κ22β that guarantees martingale property for the inverse process Rt under ̃ using Theorem 3.7. To compute model prices we apply the second-order affine expansion in Eq. (5.9). For each set of sampled option prices, we minimize the weighted mean squared error (WMSE) between model implied volatilities, σnmodel(T,K), and market mid-point implied volatilities, σnimplied(T,K), where K and T are option’s strike and maturity, respectively, as followsd :

WMSE=nwn(T,K)(σnmodel(T,K)σnimplied(T,K))2(6.3)
with weight wn(T,K)=𝒱n(T,K) set to option Black–Scholes vega 𝒱n(T,K).

In Fig. 7(a), we show the time series with the quality of the model fit which is computed using the square root of average squared differences between model and market implied volatilities. As a benchmark, we show the average spread between implied volatilities of ask and bid quotes for options in the calibration set. We observe that most of the times, the average model mis-pricing is within the average bid-ask spread. We see that year 2020 of COVID-19 pandemic is characterized by high volatility and high spreads, yet the model is able to fit these markets well. In recent years of 2022 and 2023, bid-ask spreads declined while the model error (the average difference between market and model implied volatility) became less than 1% most of the times.

Fig. 7.

Fig. 7. Time series of the quality of the model calibration and of calibrated model parameters using weekly model calibrations to prices of option on Bitcoin traded on Deribit exchange from April 2019 to October 2023. (a) Time series of the mean squared error (MSE) between model and market implied volatilities and the average bid-ask (Bid/Ask) spread of quoted implied volatilities. (b)–(d) Time series of estimated initial volatility σ0 and mean volatility 𝜃, volatility beta β, and volatility-of-volatility 𝜀, respectively.

In Figs. 7(b)–7(d), we show the time series of calibrated values of initial volatility σ0 and mean volatility 𝜃, volatility beta β, and volatility-of-volatility 𝜀, respectively. We observe downward trend in the level of implied volatility, falling from average levels of 80% in years 2021 and 2022 to most recent averages of about 40%. At the same time, volatility beta and volatility-of-volatility remain range bound suggesting than relative value of calls and puts is intact despite falling level of the overall volatility. The volatility beta fluctuates between positive and negative values most of the times following the market sentiment. When the sentiment is bullish, calls are bid and volatility beta becomes positive and, vice versa, during bearish sentiment, puts are bid and the volatility beta becomes negative. Thus, as we discuss in Sec. 3.3, the conventional SV models may not be arbitrage-free for the valuation of options on cryptocurrencies when return–volatility correlation become positive.

In Fig. 8, we show the quality of model fit to Bitcoin options traded on Deribit exchange on 20 June 2023. On this day, fitted model parameters are the following :

σ̂0=0.41,𝜃̂=0.38,β̂=0.50,𝜀̂=3.06,κ̂1=2.21,κ̂2=2.18.(6.4)

Fig. 8.

Fig. 8. Quality of model fit to Bitcoin options on 20 June 2023 reported in Eq. (6.4) using the three most liquid slices: one week (a), two weeks (b), one month (c). Continuous black line is model implied volatilities computed using the second-order affine expansion in Eqs. (5.9) and (5.16). Bid and ask displays the market bid and ask quotes, the ATM is the ATM mid-point volatility. Model implied volatility is displayed using the continuous black line. MSE is the mean squared error between the model and the mid of market bid-ask volatilities.

In Fig. 8, we show the quality of model fit on this date. We observe positive skewness of implied volatilities, which results in positive volatility beta. Continuous black line shows model implied volatilities computed using the second-order affine expansion in Eq. (5.9). Bid and ask displays the market bid and ask quotes, the ATM is the ATM mid-point volatility. MSE denotes the mean squared error between the model implied volatilities and the mid of market bid-ask implied volatilities. We see that the log-normal SV model calibrated to Bitcoin options is able to capture the market implied skew very well across most liquid maturities with only four parameters. The average MSE is about 0.5% for longer maturities, which is within the bid-ask spread. Calibration to the ATM region can be further improved using a term structure of the mean volatility 𝜃. In a practical setting, the SV model dynamics can be augmented with a local volatility (Lipton (2002)) to accurately fit the implied volatility surface.

6.3. Comparison of valuation under MMA and inverse measures

Since Bitcoin options, which are traded on the Deribit exchange, have inverse payoffs, it is important that there is no-arbitrage between the MMA and the inverse measures. In Fig. 9, we compare the model implied volatilities computed from prices of vanilla and inverse options valued under the MMA and inverse measures, respectively. We use the second-order affine expansion with model parameters reported in Eq. (6.4) for computing model implied volatilities for the same maturities as in Fig. 8. As benchmark, we use the MC simulation of the model dynamics (3.12) using scheme in Eq. (3.59) with the number of paths equal 400,000 and daily time steps. We show the 95% confidence interval of MC estimates as dashed lines like bid/ask lines

Fig. 9.

Fig. 9. Implied volatilities of Bitcoin options which correspond to the slices used in model calibration and a uniform range of strikes and which are computed using model parameters reported in Eq. (6.4) under the MMA measure (MMA) and the inverse measure (Inverse). Continuous aqua and pink lines show model implied volatilities computed using the second-order affine expansion in Eqs. (5.9) and (5.16), respectively. Dashed lines labeled by MC0.95ci and MC+0.95ci are MC 95% confidence intervals computed using scheme in Eq. (3.59) with number of path equal 400,000 and daily time steps. MSE is the mean squared error between BSM volatilities inferred from model values and the MC estimates.

We observe that option values computed using the second order affine expansion under the MMA and inverse measures are very close to each other with the differences being within the numerical accuracy of an ODE solver and Fourier inversion (of order 104). The difference in terms of implied volatilities does not exceed 104.

In Fig. 10, we show model prices of call options on the QV computed using the second-order affine expansion under the MMA and inverse measures, and the comparison with MC estimates. We use the same expiries as for the model calibration and the same parameters reported in Eq. (6.4). We also observe that the solution under the both measures are consistent and agree with MC simulations. Interestingly, we observe that the implied volatility skew of options on the QV is upward sloping, which is observed in market data of options on the QV and listed options on VIX index. It is known that Heston model, even augmented with jumps, produces downward sloping skews on options on the QV (see for an example Drimus (2012) and Sepp (2012)). As a result, the log-normal SV model implies realistic patterns of implied volatility-of-volatility.

Fig. 10.

Fig. 10. Implied volatilities of call options on QV of Bitcoin using model parameters in Eq. (6.4) and for the same maturities as in model calibration in Fig. 8. We use the valuation under the MMA measure (MMA) and the inverse measure (Inverse). BSM volatilities are implied from model prices using expected QV Îτ={0.18,0.20,0.23} computed using Eq. (3.53). Continuous aqua and pink lines show model implied volatilities computed using the second-order affine expansion under MMA measure in Eq. (5.20) and under inverse measure in (5.24), respectively. Dashed lines labeled by MC0.95ci and MC+0.95ci are MC 95% confidence intervals computed using scheme in Eq. (3.59) with number of path equal 400,000 and daily time steps. MSE is the mean squared error between BSM volatilities inferred from model values and the MC estimates.

7. Conclusion

We have considered the log-normal SV model with the quadratic drift which plays an important role for the existence of martingale measures. We have shown that the log-normal volatility process has the strong solution in spite of having quadratic drift. We have derived admissible bounds of model parameters for the existence of both the MMA measure and the inverse measures. Based on that, we applied proposed SV model for modeling implied volatility surfaces of assets with positive return–volatility correlation.

Since the log-normal SV model is not affine, there is no analytical solution for the MGF in this model. To circumvent this, we have developed an affine expansion approach under which the joint MGF of three state variables (log-price, the QV, and the volatility) is decomposed into a leading term, which has exponential-affine form, and a residual term, whose estimate depends on the higher order moments of the volatility process. We have proved that the second-order leading term is consistent with the expected values and the variances of the state variables. We have applied Fourier inversion techniques for valuation of vanilla and inverse options on spot and futures underlying. By comparison of model values computed using our method with estimates obtained from MC simulations, we have shown that the second-order leading term is very accurate for option valuation. We have demonstrated that our model fits implied volatilities of assets with positive return–volatility correlation. We have shown that the log-normal SV model fits well different market regimes by calibrating model parameters to time series of options on Bitcoin for the past four years.

We have derived backward Euler–Maruyama scheme for discretization of the log-volatility process and show that it has a strong convergence rate of 1. Proposed scheme for the simulation of the log-normal SV model is robust, computationally efficient and does not require any special treatment in discretization.

As an extension of the frameworke we shall consider the applications to the rough SV models proposed by Gatheral et al. (2018) using log-normal volatility. We shall also consider a path-dependent model of the log-normal volatility following approaches in Guyon & Lekeufack (2023) and Lipton and Reghai (2023).f

Acknowledgments

The authors are thankful to Vladimir Lucic for valuable insights and comments. We thank Alan Lewis, Johannes Muhle-Karbe, Blanka Horvath, Kalev Pärna and participants at Imperial College Finance and Stochastics Seminar, TU Munich Oberseminar, ETH Seminar, EAJ Conference 2022 in Tartu for useful discussion and comments. We are grateful for anonymous referees for helpful comments.

Notes

a Github project https://github.com/ArturSepp/StochVolModels provides Python code with the implementation of option valuation using the affine expansion and examples of model calibration to implied volatility data applied in this paper.

b Exp-OU SV models are studied in Fouque et al. (2000), Detemple & Osakwe (2000), Masoliver & Perelló (2006), Perelló et al. (2008), Muhle-Karbe et al. (2022), Drimus (2012). Exp-OU SV models are widespread in industry as a basis for local SV models (Lipton (2002)), see Bergomi (2015), Bühler (2006), Ren et al. (2007), Henry-Labordère (2009), Bergomi & Guyon (2012).

c Such linear specification is the simplest functional form that allows to establish simple relationship between gains on a delta-hedged option portfolio and level of the market volatility. As shown in Bakshi & Kapadia (2003), such form allows to get precise information about the sign and strength of the risk premium.

d We apply scipy implementation of Sequential Least Squares Programming (SLSQP) algorithm.

e Sepp & Rakhmonov (2023a) apply the developed solution for modeling of SV in Cheyette-type interest rate model and obtain closed-form solution for valuation of swaptions under this model.

f The volatility beta is an interesting variable to make path-dependent as well. We observe that in many markets the implied options skewness follows most recent price performance. For example, in options on Bitcoin, the skewness implied from option prices becomes positive following strong performance of Bitcoin.

Appendix A. Proofs

A.1. Proof of Theorem 3.2

We consider the interval (l,r) and define the scale function and its density, denoted by S(x) and s(x), respectively, as follows (see Chap. IV.15 in Borodin (2017)) :

s(x)=expcx2μ(σ)v(σ)2dσ,S(x)=cxs(y)dy,(A.1)
where x(l,r) and c is an arbitrary but fixed interior point of (l,r). We define the speed function and its density, denoted by M(x) and m(x), respectively, by
m(x)=2v(x)2expcx2μ(σ)v(σ)2dσ,x(l,r),M(x)=cxm(y)dy.(A.2)

Choosing c such that cx2μ(σ)v(σ)2dσ=0 and setting η=2κ1κ2𝜃𝜗2, we obtain

s(x)=xηexp2κ1𝜃𝜗21x+2κ2𝜗2x,m(x)=2𝜗2xη2exp2κ1𝜃𝜗21x2κ2𝜗2x.S(x)=cxs(y)dycxyηexp2κ1𝜃𝜗21ydy,x0+,S(x)=cxs(y)dycxyηexp2κ2𝜗2ydy,x+.(A.3)
Using upper and lower bounds in (A.3), we find that
S(0+)=limx0+S(x)=,S(+)=limx+S(x)=+.(A.4)
Consequently, boundaries l=0, r=+ are not attractive.

Now we show that boundary r=+ is unattainable. We define Σ(l), Σ(r) by

Σ(l):=l<z<y<xs(z)m(y)dzdy,Σ(r):=x<y<z<rs(z)m(y)dzdy,(A.5)
Σ(r)=xryrs(z)dzm(y)dy=xrxym(z)dzs(y)dy=xrxy2𝜗2zη2exp2κ1𝜃𝜗21z2κ2𝜗2zdzyη×exp2κ1𝜃𝜗21y+2κ2𝜗2ydy.(A.6)

We note that the right hand side of (A.6) is finite if and only if

Σ̂(r):=xrxyzη2exp2κ2𝜗2zdzyηexp2κ2𝜗2ydy(A.7)
is finite. We split inner integral into two integrals, over [x,+) and [y,+)
xyzη2exp2κ2𝜗2zdz=x+dzy+dz=I1I2.(A.8)

We see that I1 is finite for all κ1,κ2>0. Using asymptotics of incomplete gamma function (see Eq. (6.5.32) in Abramowitz & Stegun (1972)), we obtain for y+

I2=y+zη2exp2κ2𝜗2zdz=2κ2𝜗21yη2×exp2κ2𝜗2y1+O1y.(A.9)

Substituting asymptotic expansion (A.9) into Eqs. (A.8) and (A.7), we have

Σ̂(r)=I1xryηexp2κ2𝜗2ydy2κ2𝜗21xry2dy+Oxry3dy,y+.

We conclude that Σ̂(+)=+. Hence, boundary r=+ is unattainable. Next, we show that boundary l=0 is unattainable as well. By definition of Σ(l)

Σ(l)=lxlys(z)dzm(y)dy=lxyxm(z)dzs(y)dy=lxyx2𝜗2zη2exp2κ1𝜃𝜗21z2κ2𝜗2zdzyη×exp2κ1𝜃𝜗21y+2κ2𝜗2ydy.
We make an reciprocal transformation {z1/z,y1/y} to obtain
Σ(l)=1/x1/l1/xy2𝜗2zηexp2κ1𝜃𝜗2z2κ2𝜗21zdzyη2×exp2κ1𝜃𝜗2y+2κ2𝜗21ydy.
The right-hand side of is finite if and only if
Σ̂(l):=1/x1/l1/xyzηexp2κ1𝜃𝜗2zdzyη2exp2κ1𝜃𝜗2ydy(A.10)
is finite. Splitting inner integral into two, over [1/x,+) and [y,+), respectively
1/xyzηexp2κ1𝜃𝜗2zdz=1/x+dzy+dz=I1I2,(A.11)
we see that I1 if finite for all κ1,κ2>0. As before, using asymptotic expansion of incomplete gamma function we have for y+ :
I2=y+zηexp2κ1𝜃𝜗2zdz=𝜗22κ1𝜃yηexp2κ1𝜃𝜗2y1+O1y.(A.12)

Substituting Eq. (A.12) into Eqs. (A.11) and (A.10), we have for y+

Σ̂(l)=I11/x1/lyη2exp2κ1𝜃𝜗2ydy𝜗22κ1𝜃1/x1/ly2dy+O1/x1/ly3dy.
We conclude that Σ̂(+)=+. Hence, boundary l=0 is unattainable.

A.2. Proof of Theorem 4.1

We provide a version of Feynman–Kac formula in general multidimensional setting, building on results in Heath & Schweizer (2000). Let T>0 be a fixed time horizon and D be a domain in d. We consider Cauchy problem

Ut+U=0,(t,x)Q,Q=(0,T)×D,U(T,x)=ϕ(x),xD,(A.13)
where bt:Dd, at:Dd×d are continuous functions and operator is
U=i=1dbti(x)Uxi(t,x)+12i,k=1datik(x)2Uxixk(t,x).(A.14)

We assume that matrix (atij) is nonnegative definite for every t. Under certain restrictions on the coefficients of operator and growth conditions on ϕ, it is possible to represent the solution of (A.13) by means of conditional expectation known as a Feynman–Kac formula. However, standard results require uniform ellipticity of the operator , see Friedman (1975), which is not satisfied in our case, because the QV It is degenerate. Thus, problem (A.13) becomes degenerate parabolic one.

We consider the multi-dimensional SDE

dXs=bs(Xs)ds+σs(Xs)dWs,Xt=xD,(A.15)
where W=(W(1),,W(d)) is d-dimensional Brownian motion and σt(x)=(σtij(x)) denotes square root of matrix at(x): atik(x)=(σt(x)σt(x))ik. We assume that (A.15) admits unique strong solution and define U:[0,T]×D[0,+] by U(τ,x)=𝔼[ϕ(XT)|Xt=x] which is well defined in [0,+] if X does not explode or leave D before T. We give sufficient conditions to ensure that U solves the problem (A.13).

Theorem A.1 (Existence). Suppose that following conditions hold:

(1)

Solution X of (A.15) neither explodes nor leaves D before T.

suptsT|Xs|<=(XsD,s[t,T])=1.

(2)

ϕ(x) is bounded in D.

Let UC(Q¯)C2(Q) be the solution of problem (A.13). Then U(t,x)=U(t,x) for any (t,x)Q¯.

Proof. Consider sequence of bounded domains {Dn}n=1 contained in D such that n=1Dn=D and Dn{xd:x<n}. We fix xD and find n such that xDn. Let η=inf{st:XsD}, ηn=inf{st:XsDn} denote exit time from domains D and Dn. By Itô’s formula

u(ηnT,XηnT)=u(x)+tηnTUsU(s,Xs)ds+tηnTUxiσij(Xs)dWs(j).

As domain Dn is bounded and UC2(Q), we have

𝔼t,xtηnTUxiσij(Xs)dWs(j)=0u(x)=𝔼t,x[u(ηnT,XηnT)].(A.16)

We show that t,x[τnT]c1(1+x2)n2. Using Markov inequality and estimate from Karatzas & Shreve (1991), p. 306 :

t,x[τnT]t,xsups[t,T]Xsn𝔼t,xsups[t,T]|Xs|2n2,𝔼t,xsups[t,T]|Xs|2c(1+x2).

Thus limn+u(ηnT,XηnT)=ϕ(XT) a.s. It implies that u(t,x) is bounded by the maximum principle. Finally, by the dominated convergence theorem :

limn+𝔼[u(ηnT,XηnT)]=𝔼[ϕ(XT)].(A.17)
Combining (A.16) and (A.17) concluded the proof. □

A.3. Consistency of moment for second-order affine expansion

We apply the following lemma relating moments to derivatives of MGF.

Lemma A.1. The MGF defined by the following equation:

G(Φ)=𝔼[eΦZ],(A.18)
where Z is a random variable with finite MGF has the following properties:
G(0)=1,ΦG(Φ)|Φ=0GΦ(0)=𝔼[Z],Φ2G(Φ)|Φ=0GΦΦ(0)=𝔼[Z2].(A.19)

Corollary A.1. Given that MGF assumes an exponential-affine form

G(Φ)=ef(Φ)(A.20)
with f(0)=0, we have
fΦ(0)=𝔼[Z],fΦΦ(0)=𝕍ar[Z].(A.21)

A.3.1. Proof of Proposition 4.4

We set Φ=Ψ=0 in the second-order expansion in Proposition 4.4 and suppress arguments for brevity. We consider R[2](τ,Y;Θ) in Eq. (4.26). Differentiating it w.r.t. Θ, we see that remainder term RΘ[2] solves the following problem :

RΘ,τ[2]+(Y)RΘ[2]=k=58Yk[CΘ(k)(τ;Θ)E[2]+C(k)(τ;Θ)EΘ[2]],(A.22)
subject to boundary condition RΘ[2](0,Y;Θ)=0, where C(k)(τ;Θ) are given by Eq. (4.27). We note that A(k)(τ;Θ) vanish at Θ=0.
A(k)(τ;Θ=0)=0.(A.23)

Calculating CΘ(k)(τ;Θ) and taking into account conditions (A.23) at Θ=0

CΘ(k)(τ;Θ)|Θ=0=4κ2AΘ(4)(τ;Θ)|Θ=0,k=5,0,k6.(A.24)

Using condition C(k)(τ;Θ=0)0, we simplify problem (A.22) at Θ=0 :

RΘ,τ[2]+(Y)RΘ[2]=4κ2AΘ(4)Y5,RΘ[2](0,Y;Θ)=0.(A.25)

We see that RΘ[2](τ,Y;Θ=0)0 when κ2=0. Hence, second order decomposition E[2](τ,Y;Θ) is consistent with expected value of Yt for κ2=0.

To prove the consistency for variance, we first differentiate (4.25) w.r.t. Θ to obtain that AΘ(k)(τ,Θ) solve the system of ODEs (A.26) at Θ=0 as functions of τ.

AΘ,τ(0)=𝜃2𝜗2AΘ(2)+λ(p)AΘ(1),AΘ,τ(1)=κAΘ(1)+2(𝜃𝜗2+λ(p))AΘ(2)+3𝜃2𝜗2AΘ(3),AΘ,τ(2)=κ2AΘ(1)+(𝜗22κ)AΘ(2)+6𝜃𝜗2AΘ(3)+3λ(p)AΘ(3)+6𝜃AΘ(4),AΘ,τ(3)=2κ2AΘ(2)+3(𝜗2κ)AΘ(4)+12𝜃𝜗2AΘ(4)+4λ(p)AΘ(4),AΘ,τ(4)=3κ2AΘ(3)+2(3𝜗22κ)AΘ(4)(A.26)
with boundary condition AΘ(1)(τ;Θ)|Θ=0=1, AΘ(k)(τ;Θ)|Θ=0=0, k1.

We verify that last two functions in solution of Eq. (A.26) are zero for κ2=0 :

A(3)(τ;Θ)A(4)(τ;Θ)0.(A.27)

Now, taking the second derivative w.r.t. Θ in (4.26), we obtain for RΘΘ[2]

RΘΘ,τ[2]+(Y)RΘΘ[2]=k=58{Yk[CΘΘ(k)(τ;Θ)E[2]+2CΘ(k)(τ;Θ)EΘ[2]+C(k)(τ;Θ)EΘΘ[2]]}
with RΘΘ[2](0,Y;Θ)=0. Taking the second derivative of C(k)(τ;Θ) in (4.27) w.r.t. Θ, and accounting for conditions (A.23) and (A.27) at Θ=0, we find that
CΘΘ(k)(τ;Θ)|Θ=0=4κ2AΘ(4)(τ;Θ)|Θ=0,k=5,0,k6.(A.28)

We note that following straightforward relationships are valid :

E[2](τ;Θ)|Θ=0=1,EΘ[2](τ;Θ)|Θ=0=Y.(A.29)

Combining Eqs. (A.28), (A.24), and (A.29), we simplify Eq. (A.25) at Θ=0 :

RΘΘ,τ[2]+(Y)RΘΘ[2]=4κ2(Y5AΘΘ(4)2Y6AΘ(4)),RΘΘ[2](0,Y;Θ)=0.(A.30)
Thus, when κ2=0, RΘΘ[2](τ,Y;Θ=0)0, and E[2](τ;Θ) reproduces the variance of Yt.

A.3.2. Proof of Proposition 4.5

We proceed as in Proposition 4.4. We set Θ=Ψ=0 and consider the remainder term R[2](τ,Y;Φ) in (4.26). Differentiating it w.r.t. Φ, we see that RΦ[2] solves following problem :

RΦ,τ[2]+(Y)RΦ[2]=Φk=58C(k)(τ;Φ)E[2]Yk=k=58Yk[CΦ(k)(τ;Φ)E[2]+C(k)(τ;Φ)EΦ[2]],RΦ[2](0,Y;Φ)=0,(A.31)
where C(k)(τ;Φ) are given by (4.27). We note that A(k)(τ;Φ) vanish at Φ=0 :
A(k)(τ;Φ)|Φ=0=0.(A.32)

Differentiating C(k)(τ;Φ) w.r.t. Φ and considering (A.32) at Φ=0, we find

CΦ(k)(τ;Φ)|Φ=0=4κ2AΦ(4)(τ;Φ)|Φ=0,k=5,0,k6.(A.33)

Due to condition C(k)(τ;Φ=0)=0, we simplify Eq. (A.31) at Φ=0 as

RΦ,τ[2]+(Y)RΦ[2]=4κ2AΦ(4)Y5,RΦ[2](0,Y;Φ)=0.(A.34)

We see that RΦ[2](τ,Y;Φ=0)0 when κ2=0. Hence, the second-order expansion term E[2](τ,Y;Φ) is consistent with the expected value of Xτ for κ2=0.

To show that the second-order expansion reproduces the variance of the log-price, we differentiate Eq. (4.25) with respect to Φ and find that AΦ(k)(τ,Φ) solve the system of ODEs (A.35) at Φ=0 as functions of τ :

AΦ,τ(0)=𝜃2𝜗2AΦ(2)+λ(p)AΦ(1)+12𝜃2,AΦ,τ(1)=κAΦ(1)+2(𝜃𝜗2+λ(p))AΦ(2)+3𝜃2𝜗2AΦ(3)+𝜃,AΦ,τ(2)=κ2AΦ(1)+(𝜗22κ)AΦ(2)+6𝜃𝜗2AΦ(3)+3λ(p)AΦ(3)+6𝜃AΦ(4)+12,AΦ,τ(3)=2κ2AΦ(2)+3(𝜗2κ)AΦ(4)+12𝜃𝜗2AΦ(4)+4λ(p)AΦ(4),AΦ,τ(4)=3κ2AΦ(3)+2(3𝜗22κ)AΦ(4)(A.35)
with boundary condition
AΦ(k)(τ;Φ)|Φ=0=1,k=1,0,k1.(A.36)

Now, taking the second derivative in (4.26) w.r.t. Φ, we obtain for RΦΦ[2]

RΦΦ,τ[2]+(Y)RΦΦ[2]=2Φ2k=58C(k)(τ;Φ)E[2]Yk=k=58{Yk[CΦΦ(k)(τ;Φ)E[2]+2CΦ(k)(τ;Φ)EΦ[2]+C(k)(τ;Φ)EΦΦ[2]]},×RΦΦ[2](0,Y;Φ)=0.(A.37)

Calculating CΦΦ(k)(τ;Φ) in (4.27) and taking into account boundary conditions (A.32) at Φ=0, we find that

CΦΦ(k)(τ;Φ)|Φ=0=4κ2AΦ(4)(τ;Φ)|Φ=0,k=5,0,k6.(A.38)
We note that following straightforward relationships are valid :
E[2](τ;Φ)|Φ=0=1,EΦ[2](τ;Φ)|Φ=0=0.(A.39)

Combining conditions (A.38), (A.33), and (A.39), we are able to simplify problem (A.34) at Φ=0 as follows :

RΦΦ,τ[2]+(Y)RΦΦ[2]=4κ2(Y5AΦΦ(4)2Y6AΦ(4)),RΦΦ[2](0,Y;Φ)=0.(A.40)
Thus, when κ2=0, RΦΦ[2](τ,Y;Φ=0)0, so that E[2](τ,Y;Φ) is consistent with the variance of Xt.

A.4. Proof of Theorem 4.7

We consider following system of d nonlinear ODEs :

τAi(τ;Γ)=A(τ;Γ)MiA(τ;Γ)+LiA(τ;Γ)+Hi,Ai(0,u)=Ai,0(A.41)
for i=1,,d. We assume that Γ3 and each matrix Mi is symmetric real-valued, and vectors Li and Hi can be complex-valued. Hence solutions Ai(τ;Γ), i=1,,d are complex-valued functions. It is well known that complex-valued solution A(τ;Γ) is defined on maximal interval of existence [0,τ+(Γ;A)) such that either τ+(Γ;A)=+ or τ+(Γ;A)<+ and A(τ;Γ)+ as ττ+(Γ;A).

We combine d nonlinear ODEs in (A.41) into single (nonlinear) vector ODE :

τA(τ;Γ)=M(A(τ;Γ))+LA(τ;Γ)+H,A(0,u)=A0,(A.42)
where M:dd is a vector-valued function of argument Ad. For convenience, we denote the right-hand side of (A.42) as follows :
R(A):=M(A)+LA+H.(A.43)

Lemma A.2. There exists finite and nonnegative function g such that

A(τ;Γ)2A02+A020th(s)e0sh(r)drds,h(s)=g((A(τ;Γ)).(A.44)

Proof. The proof is essentially contained in Keller-Ressel & Mayerhofer (2015) and based on the application of Gronwall’s inequality. □

Lemma A.3. Assume that following conditions are satisfied:

(1)

HI=HI(u)=(0,,0)d when Γ is a real vector,

(2)

LI=LI(u) is zero d×d matrix when Γ is a real vector,

(3)

zero initial condition A(0,Γ)=(0,,0)d.

Then solution A(τ;Γ) of system (A.42) remains real for real Γd.

Proof. The proof is available upon request. □

We now consider the problem of continuity of solutions of quadratic differential systems arising in option valuation problem. It is important to highlight that linear L(p) and free terms H(p) satisfy assumptions of Lemma (A.3) when we set p=1 and restrict Φ=1/2, Φ=Ψ=0 for pricing vanilla options under MMA measure. We use same argument for the inverse options when we set p=1 and restrict Φ=1/2, Φ=Ψ=0. We omit straightforward calculations.

We use τ+(Φ;A) and τ+(Φ;AR) to denote blow-up times τ+(Γ;A) and τ+(Γ;AR) when Γ=(Φ,Ψ=0,Θ=0) and Γ=(Φ,Ψ=0,Θ=0).

Theorem 4.7. We assume first that Φ. We denote

D(τ):={Φ:τ<τ+(Φ;A)},M(τ):={Φ:G(τ;X,I,Y;Φ)<}.
We argue by contradiction. We assume that solution A(τ;Φ) blows up before τ0 :
τ+(Γ;A)<τ0.(A.45)

We set α:=sup{α0:αΓD(τ)}. Then assumption (A.45) implies that α<1. On the one hand

G(τ;X,I,Y;αΦ,Ψ=0,Θ=0)G(τ;X,I,Y;αΦ,Ψ=0,Θ=0)α<(A.46)
for all α<α<1 due to Jensen inequality. On the other hand, we have
G(τ;X,I,Y;αΦ,Ψ=0,Θ=0)=expαΦX+k=02nAk(τ;αΦ,Ψ=0,Θ=0)Yk+exp{αΦX}R[2](τ;X,I,Y;αΦ,Ψ=0,Θ=0).

Thus, we obtain

limααG(τ;X,I,Y;αΦ,Ψ=0,Θ=0)=+(A.47)
under assumptions 2 and 3. Comparing (A.46) and (A.47) we get contradiction, so
τ+(Φ;A)τ0.(A.48)

Now, we let Φ be complex. Due to assumption 4.7 and Lemma A.2, we have τ+(Φ;A)τ+(Φ;AR)τ(Φ;A). Repeating the first part of the proof for Φ, leads to (A.48), so that τ+(Φ;A)τ0. Hence, solution A(τ;Φ,Ψ=0,Θ=0) must remain continuous on [0,τ0). □

ORCID

Artur Sepp  https://orcid.org/0000-0002-7038-1748

Parviz Rakhmonov  https://orcid.org/0000-0001-9571-7378