^{1}

^{*}

^{2}

^{*}

Extended correlations, i.e. correlations that can take values less than − 1 and/or larger than 1, occur naturally in mathematical models of financial processes. Extended correlations also occur in financial practice, especially in dispersion trading, implying arbitrage opportunities. Based on theoretical and practical emergence of extended correlations, we derive a mathematical framework for extended correlations explaining interpretations and applications. We develop a broader mathematical approach, which can model conventional as well as extended correlations.

New discoveries are often met with skepticism and resistance. When negative numbers came to Europe in books of Eastern mathematicians, critics dismissed their sensibility. Many well-known European mathematicians, e.g., Jean le Rond d’Alembert (1717-1783) or Augustus De Morgan (1806-1871), rejected the sensibility of negative numbers until the 18^{th} century and referred to them as “absurd” or “meaningless” (Kline, 1980 [^{th} century, it was a common practice to criticize any negative results derived from equations, on the assumption that they were meaningless (Martinez, 2006 [

Likewise, irrational numbers and later imaginary numbers were firstly rejected. Today these concepts are accepted and applied in numerous scientific and practical fields, such as physics, chemistry, biology and finance.

Similarly, probabilities less than 0 and greater than 1 were long considered non-sensible. However, these extended probabilities and especially their important case-negative probabilities with values between 1 and −1, have been applied in physics for quite a while (cf. Wigner, 1932 [

Mathematical patterns of negative probabilities were studied in Bartlett 1945 [

In this paper, we study correlation coefficients used in mathematical models of financial markets. By their conventional construction, correlation coefficients cannot be larger than 1 and smaller than −1. However, by exploring the theory and practice of financial markets, we have discovered emergence of correlation coefficients beyond these limits. We call them extended correlations and develop a mathematical theory for them.

The significance of our research lies on the fact that conventional mathematical theories cannot fully model all existing correlation values and processes in finance. That is why we introduce and study the concept of extended correlations, which is more general than conventional correlations including them as a special case. Hence all correlations in finance (and other sciences), i.e. conventional, and correlations that are smaller than −1 or larger than 1 can be evaluated using the concept of extended correlations.

The rest of the paper is organized as follows. In Section 2, we show that extended correlations can naturally occur in mathematical models of financial processes. In Section 3, we find extended correlations in the practice of financial markets, which imply arbitrage opportunities. In Section 4, we build mathematical foundations for extended correlations. Section 5 addresses the limitations of our concept. Section 6 concludes.

Financial variables such as stocks, bonds, interest rates, commodities, and volatilities are stochastic, i.e. they can be only predicted with a certain probability. Therefore, it is a good idea to model financial variables with stochastic processes as it is done in mathematical finance.

Recent events, such as the global financial crisis 2007-2009, have highlighted an important critical financial variable: correlation. In the crisis, correlations between many financial variables such as stocks, bonds, loans, and especially, sub-prime mortgage loans often securitized in a CDO, increased sharply and led to large unexpected losses. Hence, “Correlation Risk”, the risk of unfavorable change in correlation, has recently been addressed in financial modeling as well as in risk management and regulation^{1}.

It has been suggested (see Emmerich, 2006 [

where:

ρ is the Pearson correlation coefficient^{2},

a is the mean reversion parameter (speed, gravity) i.e. degree with which the correlation at time t,

ρ_{t} is pulled back to its long term mean m_{ρ}_{, }0 ≤ a ≤ 1,

σ_{ρ} is the volatility of ρ; σ_{ρ} > 0,

m_{ρ} is the long term mean of the correlation ρ,

h is the upper boundary level, f: lower boundary level, i.e. h ≥ ρ ≥ f,

ε_{t} is the random drawing from a standard normal distribution at time t, ε = n~(0,1).

Applying traditional Pearson correlation values, i.e. −1 ≤ ρ ≤ +1, the upper bound h becomes +1 and the lower bound f becomes −1 and Equation (1) reduces to

For high values of the correlation volatility σ_{ρ} and low values of the mean reversion parameter “a”, Equations (1) and (2) can result in correlation values ρ of < −1 and > +1, and the equations cannot be evaluated (since the

terms

tions, which are

for Equation (1) and

for Equation (2).

However, there are problematic issues with Equations (1) and (2) and its boundary conditions (3) and (4). When modeling financial correlations in practice, we have to discretize Equations (1) and (2). Equations (1) and (2) then become

respectively.

For Equations (5) and (6) the boundary conditions (3) and (4) are invalid, i.e. even if the boundary conditions are met, it can happen that Equations (5) and (6) cannot be evaluated. This is especially the case for high correlation volatility σ_{ρ} and low values of the mean reversion parameter “a”. There are several solutions to this problem:

1) We can introduce limits which the correlation values can take. The limits would be h and f for Equation (5) and −1 and +1 for Equation (6). We could compute Equation (5) as

Equation (7) reads: If the simulated correlation coefficient at t, ρ_{t}, is smaller than the lower boundary f, take the value f; if the simulated correlation coefficient at t, ρ_{t}, is greater than the upper boundary h, take the value h,

otherwise apply Equation (5)

lower boundary f = −1, and the higher boundary h = +1. However, the approach (7) arbitrary and model-incon- sistent.

2) We can allow extended correlations increasing the upper boundary h above +1 and decreasing the lower boundary f below −1 in Equation (5). However, increasing h and decreasing f would effectively increase the volatility of the correlation ρ since the last terms of Equations (5) and (6) are amplified.

3) A viable solution to this problem is to allow utilization of extended correlations and to describe correlation by means of a standard mean-reverting Vasicek model of the form

or by its discrete version

Equations (8) and (9) can be evaluated in every simulation with the standard parameter values 0 ≤ a ≤ 1 and σ_{ρ} > 0.

In summary, the Jacobi process, which has been suggested to model correlation, can lead to errors when simulating discrete correlations in reality. Increasing the boundaries in a Jacobi process solves the problem, however at the cost of increased correlation volatility.a viable solution is to apply a standard Vasicek model and allow extended correlations. This will guarantee that every real-world discrete correlation simulation is executed without receiving error values.

Extended Correlations as an Input for Financial ModelsIn finance, the Pearson correlation coefficient or a Pearson correlation matrix serves as an input for many financial models. For example, Copulas typically apply the Pearson correlation coefficient or a Pearson correlation matrix. Due to its convenient properties, the thin-tailed Gaussian copula is often used in finance. For the bivariate Gaussian copula, the density function is

where ^{−1} is the inverse of the standard normal distribution, and ρ is the Pearson correlation coefficient.

From

To derive the dependency for more than two variables, often a factorization is applied. This OFGC (one-fac- tor Gaussian copula) model is

where:

M is the systematic factor, which impacts all variables x_{i}. As ԑ, M is a random drawing from a standard normal distribution, M = n~(0,1).

Z_{i} is the idiosyncratic factor of entity i. Just like M, Z_{i} is a random drawing from a standard normal distribution, Z_{i} = n~(0,1).

x_{i}: The variables x_{i} for _{i} are equal to M in a certain simulation, i.e. all x_{i} are identical. For ρ = 0, all x_{i} are equal to their idiosyncratic factor Z_{i}, i.e. they are independent.

The seminal Heston model applies a similar model as Equation (11). It correlates two Brownian motions dz_{1} and dz_{2} with the Pearson correlation coefficient ρ, where dz = ε_{t}dt and ε_{t}_{ }is defined as in Equation (1). The core equation is

Numerous extensions of the Heston (1993) model exist, see for example Hagan et al. (2002) [

By construction, Equations (10) to (12) limit the correlation parameter ρ to −1 ≤ ρ ≤ 1. Hence in order to apply extended correlations, we have to alter the equations. Changing the term _{i} or dz_{1}(t) have a higher than 100% positive dependence on M or dz_{2}(t) respectively, and a negative dependency on Z_{i} or dz_{3} respectively. Vice versa, for ρ < −1, the dependent variable x_{i} or dz_{1}(t) have a higher than 100% negative dependence on M or dz_{2}(t) respectively, and a positive dependency on Z_{i} or dz_{3} respectively. The higher flexibility comes at the cost of loss of standard normality for x_{i} and dz_{1}. While the mean, skewness and kurtosis of dz_{1} and x_{i} would still be zero, the variance is unequal to 1 for all ρ\{0,1}. Equation (10) is not a good candidate for extended correlations since they can change the sign of the equation.

Extended correlations occur in finance if arbitrage opportunities exist. We will show this with the example of dispersion trading.

Dispersion trading emerged in the late 1990s from index arbitrage. In a long index arbitrage trade, the trader buys certain components (e.g. stocks) of an index (e.g. the S&P 500) and shorts the whole index. The index

components are expected to outperform the index, so that _{i} are the component weights,

r_{i} is the return of the index components and r_{i} is the return of the Index.

Dispersion trading applies the same idea, just with respect to component volatility and index volatility. The strategy can be well implemented with options. For details on dispersion trading see Willmott (2009) [

Let’s briefly derive the core equation of dispersion trading. We start with the variance equation for two assets i and j,^{2}, we derive

where _{i} and w_{j} are weighting factors. Solving equation (13) for the average implied pairwise correlation coefficient between assets i and j, ρ_{ij}, we derive

Equation (14) shows the general concept of dispersion trading. The correlation between the components i and j, ρ_{ij}, is not derived by data points in a two-dimensional coordinate system as in the Pearson model, but by the relationship between the index implied volatility σ_{i} and component implied volatility σ_{i}._{ }

The CBOE disseminates the implied correlation index of the S&P 500 derived by Equation (14) since 2007, ticker symbol ICJ, JCJ, and KCJ, see http://www.cboe.com/micro/impliedcorrelation. So far the CBOE has reported 3 extended correlations. Not surprisingly, they occurred at the height of the global financial crisis: On November 6, 2008, implied correlation was 100.8%, on November 13, 2008, 105.93% and on November 20, 2008, 103.04% for the KCJ, the January 2009 option maturity. This confirms that extended correlations exist in financial practice.

The extended correlations imply arbitrage opportunities: If ρ_{ij} > 1, from Equation (14), we observe that implied index volatility _{ij} > 1, not ρ_{ij} < −1, is plausible. For instance, in the severe crisis of 2008, traders assumed that many stocks would decline jointly. Hence, they bought puts on the whole index I, driving up index volatility _{ij} > 1, i.e., they became not conventional but extended correlations.

In the following section, we derive a mathematical model for extended correlations constructing two basic types of extended correlation coefficients: complete correlation coefficients and total correlation coefficients.

Let us first review constructions and properties of Pearson’s correlation coefficient, also called population correlation coefficient, if a whole population is modeled. For two variables X and Y, it is defined by

where cov(X, Y) is the covariance of X and Y, while s_{X} and s_{Y} are the standard deviations of X and Y, respectively.

If we model random variables, it is possible to express the correlation coefficient in terms of expectations

In the case when random variables X and Y are represented by samples, the sample Pearson correlation coefficient, or sample correlation coefficient, r_{X}_{,Y} is defined by

where _{x }and _{y}.

The properties of the Pearson correlation coefficients are well-known:

1) The population correlation coefficient has the following boundaries

2) The sample correlation coefficient has the same boundaries

3) The population correlation coefficients symmetric, i.e., r_{X}_{,Y} = r_{Y}_{,X}.

4) The sample correlation coefficients symmetric, i.e., r_{X}_{,Y} = r_{Y}_{,X}.

5) The population correlation coefficients invariant with respect to linear transformations of the two variables, namely, changing X to a + bX and Y to c + dY, where a, b, c, and d are constants with b, d > 0, does not change the population correlation coefficient r_{X}_{,Y}.

6) The sample correlation coefficients invariant with respect to linear transformations of the measurement scales, i.e., of the x-y-coordinates, namely, changing the coordinate x to a + bx and the coordinate y to c + dy, where a, b, c, and d are constants with b, d > 0, does not change the sample correlation coefficient r_{X}_{,Y}.

7) The equality r_{X}_{,Y} = 1 implies that a linear equation perfectly describes the relationship between X and Y, with all data points lying on a line for which Y increases as X increases.

8) The equality r_{X}_{,Y} = −1 implies that a linear equation perfectly describes the relationship between X and Y, with all data points lie on a line for which Y decreases as X increases.

9) The equality r_{X}_{,Y} = 0 implies that there is no linear dependency between the variables X and Y.

However, as demonstrated above, to better model financial reality, it is necessary to use extended correlation coefficients. At first, we consider population correlation coefficients.

To understand how extended correlation coefficients emerge in a mathematical context, we delineate a set A of aspects of the random variables X and Y assuming that it is possible to numerically represent each aspect A from A, i.e., there are variables X_{A} and Y_{A} that represent the aspect A in the variables X and Y.

This allows us to define the aspect population correlation coefficient for variables X and Y as

To find an integral correlation characteristic, it is possible to use the complete population correlation coefficient for variables X and Y, which is computed by the following formula:

When the set A consists of n aspects, the value

It is also possible to use the aspect sample correlation coefficient for variables X and Y

Then the complete sample correlation coefficient for variables X and Y is based on the aspect sample correlation coefficients and is computed by the following formula:

When the set A consists of n aspects, the value

Example 1. Let us consider a situation when a company wants to find the correlation between its profit and expenses. The company has three factories and wants to take into account all of them. To find the necessary correlation, a statistician represents the profit by the random variable X and the expenses by the random variable Y. Each factory represents a factor of these variables. Then it’s possible to use the complete correlation coefficient, which better represents relations between profit and expenses than the conventional correlation coefficient.

In this case, the profit of the first factory is represented by the random variable X_{1} and the expenses related to the first factory - by the random variable Y_{1}. The profit of the second factory is represented by the random variable X_{2} and the expenses related to the second factory-by the random variable Y_{2}. The profit of the first factory is represented by the random variable X_{3} and the expenses related to the first factory-by the random variable Y_{3}. Then the complete (or 3-aspect) sample correlation coefficient for variables X and Y is equal to

For instance,

If

If

Example 2. Let us consider a situation when a biologist wants to find correlation between traits of fathers and sons. Then it possible to use the complete correlation coefficient taking such features as the height, weight, color of eyes, IQ and education in the role of aspects of the compared trait.

In this case, the height of the father is represented by the random variable X_{1} and the height of the son―by the random variable Y_{1}. The weight of the father is represented by the random variable X_{2} and the weight of the son―by the random variable Y_{2}. The IQ of the father is represented by the random variable X_{3} and the IQ of the son by the random variable Y_{3}. In addition, it is possible to assign numerical values to the color of eyes and education, for example, assigning 0 to the case when a person does not have any education and 10 when a person has PhD. This makes possible representation of father’s color of eyes by the random variable X_{4} and son’s color of eyes by the random variable Y_{4}. In a similar way, we represent father’s education by the random variable X_{5} and son’s education by the random variable Y_{5}.

Then the complete sample correlation coefficient for variables X and Y is equal to

For instance,

Some properties of complete correlation coefficients are the same as properties of conventional correlation coefficients, while other properties are different. Based on the properties 1 - 8 of conventional correlation coefficients considered above, we obtain the following results.

1) The complete (n-aspect) population correlation coefficient

2) The complete (n-aspect) correlation coefficient

3) The complete (n-aspect) population correlation coefficient is symmetric, i.e.,

4) The complete (n-aspect) sample correlation coefficient is symmetric, i.e.,

5) The complete (n-aspect) population correlation coefficient is invariant with respect to linear transformations of the two variables and their aspects, namely, changing X to a + bX and Y to c + dY, where a, b, c, and d are constants with b, d > 0, does not change the complete population correlation coefficient

6) The complete (n-aspect) sample correlation coefficient is invariant with respect to linear transformations of the measurement scales, i.e., of the x-y-coordinates, namely, changing the coordinate x to a + bx and the coordinate y to c + dy, where a, b, c, are constants with b, d > 0, does not change the complete sample correlation coefficient _{ }.

7) The equality

8) The equality

Proofs of these properties are based on the properties 1 - 8 of conventional correlation coefficients.

Another way to build extended correlation coefficients is based on factors of random variables. Let us assume that we have a set F of factors of the random variables X and Y assuming that it is possible to numerically represent each factor F from F, i.e., there are variables X_{F} and Y_{F} that represent the factor A in the variables X and Y.

This allows us to define the factor population correlation coefficient for variables X and Y as

To find an integral correlation characteristic, it is possible to use the total population correlation coefficient for variables X and Y, which is computed by the following formula:

When the set F consists of n factors, the value

It is also possible to use the factor sample correlation coefficient for variables X and Y

Then the total sample correlation coefficient for variables X and Y is based on the factor sample correlation coefficients and is computed by the following formula:

Example 3. Let us consider a situation when an investor wants to find correlation between prices of stocks of companies A and B. Then it possible to use the total correlation coefficient, taking into account factors that influence prices of stocks, which could be the PE (price earnings ratio), EPS (earnings per share), and the dividend yield. This allows achieving better representation of dependencies than the conventional correlation coefficient.

In this case, the PE of the company A is represented by the random variable X_{1} and the PE of the company B- by the random variable Y_{1}. The EPS of the company A is represented by the random variable X_{2} and the EPS of the company B- by the random variable Y_{2}. The dividend yield of the company A is represented by the random variable X_{3} and the dividend yield of the company B-by the random variable Y_{3}. Then the total (or 3-aspect) sample correlation coefficient for variables X and Y is equal to

For instance,

If

If

Some properties of total correlation coefficients are the same as properties of conventional correlation coefficients, while other properties are different. Based on the properties 1 - 8 of conventional correlation coefficients considered above, we obtain the following results.

1) The total (n-factor) population correlation coefficient has the following boundaries

2) The total (n-factor) sample correlation coefficient has the same boundaries

3) The total (n-factor) population correlation coefficient is symmetric, i.e.,

4) The total (n-factor) sample correlation coefficient is symmetric, i.e.,

5) The total (n-factor) population correlation coefficient is invariant with respect to linear transformations of the two variables, namely, changing X to a + bX and Y to c + dY, where a, b, c, and d are constants with b, d > 0, does not change the total population correlation coefficient

6) The total (n-factor) sample correlation coefficient is invariant with respect to linear transformations of the measurement scales, i.e., of the x-y-coordinates, namely, changing the coordinate x to a + bx and the coordinate y to c + dy, where a, b, c, and d are constants with b, d > 0, does not change the total sample correlation coefficient

7) The equality

8) The equality

Proofs of these properties are based on the properties 1 - 8 of conventional correlation coefficients.

The correlations, which are created in this paper, are extended from the Pearson correlation model. While the Pearson correlation model is by far the most applied correlation model in finance, it suffers from several limitations. The main limitations are

1) The Pearson can only evaluate linear associations between variables

2) As a consequence of 1), The Pearson coefficients can only be meaningfully interpreted if the data distribution is approximately elliptical.

3) Outliers can distort the correlation results

4) Different time frames can lead to very different results

5) The causality has to be determined exogenously

6) Spurious correlation can occur

For a detailed discussion on ten limitations of the Pearson model, see Meissner 2015 [

When modeling financial processes with discrete stochastic processes, extended correlations, i.e. correlations which can be <−1 and >1, naturally occur. Rather than discarding the whole model or applying arbitrary boundaries, extended correlations can be implemented to utilize the model.

Correlations often serve as an input for more complex mathematical models such as copulas or Heston models. Utilization of extended correlations in these models adds flexibility such as extending of the dependencies between variables beyond unity.

Extended correlations occur in financial practice as the 2008 global financial crisis demonstrated. Consequently, there is a need to create a sound mathematical model for extended correlations.

We derive one type of extended correlation coefficients by delineating a set A of aspects of random variables and creating aspect correlation coefficients and combining them into the complete, or n-aspect, correlation coefficient. Some properties of the complete correlation coefficient are different from the traditional Pearson correlation coefficient, for example, boundaries of the coefficient become −n and +n, while other properties, such as symmetry, remain unchanged.

Another way to build extended correlation coefficients is based on factors of random variables. Taking a factor F from a set F, which consists of n factors of considered processes, we build the quantity

In conclusion, conventional mathematical correlation approaches cannot fully model all existing correlation values and processes. Therefore we introduce the model of extended correlations, which is more general than the conventional correlation models. Hence all correlations in finance (and other sciences), i.e. conventional, and correlations that are smaller than −1 or larger than 1, can be represented by extended correlations.

MarkBurgin,GunterMeissner,11,11, (2016) Extended Correlations in Finance. Journal of Mathematical Finance,06,178-188. doi: 10.4236/jmf.2016.61017