In this paper, we test the null hypothesis that the prices of stock index futures of China follow a random-walk process. Five hypothesis tests are applied to test the random-walk hypothesis (RWH). Each test uses both inter-day and intra-day returns. Compared with inter-day analysis, test results on intra-day data can describe the movements of intra-day markets more effectively, because intra-day analysis eliminates overnight news propagation, thus generating more precise conclusions on the intra-day market for intra-day traders. Three out of the five tests reject the RWH, whereas the other two cannot reject the RWH. Overall, the market is not fully efficient.
It has been studied for several decades that whether there exist laws that capture stock price movements, or the opposite, stock markets are efficient. Fama published his ground-breaking work on efficient market hypothesis (EMH) in 1970 [
The EMH is a key assumption when developing the models and theories of financial markets; it has implications on the profitability of various forms of analysis. Moreover, the EMH has been extensively studied and debated in the academic community. Those who believe in the EMH contend that all of the public information has been reflected in the current stock price. Thus, any attempt to forecast the future stock price based on past information is doomed to fail. However, the financial crises in 2007 to 2012 have induced renewed scrutiny and criticism of the hypothesis.
The possibility of market inefficiency and subsequent profit opportunities appear to portfolio managers and investors. Possibilities exist for investors to determine undervalued stocks at low prices in an inefficient market, such that investors can buy the stocks and make money. From a broader perspective, if a stock market is inefficient, then the pricing mechanism does not efficiently allocate capital within an economy, with negative effects for the overall economy. The issue of how efficiently markets incorporate information into security prices is central to investors and policymakers who deal with equity markets. Specifically, this paper focuses on one form of market efficiency, that is, the empirical validity of random-walk hypothesis (RWH) in security markets. The RWH asserts that prices follow random walks, and price changes can be modeled as independently distributed random variables; in other words, in a given time point, the size and direction of the subsequent price change are random with respect to the current available knowledge.
A large amount of literature has tested the RWH of various markets through various statistical tests. When testing the RWH, all works use the returns (one representation of price changes) calculated from the spreads of every two consecutive closing prices. This type of returns is typically called inter-day returns. This paper inherits the routine of testing inter-day returns; intra-day returns are considered as well. Intra-day returns are calculated as the spreads of closing and opening prices on every day divided by the opening price of that day. Intra-day returns are more suitable for intra-day strategy developers because the effects of rolling days and overnight information are eliminated.
This paper aims to test the null hypothesis that the stock index futures prices of China follow a random-walk process. If the hypothesis cannot be rejected, then no trading strategy can generate the expected positive returns. Otherwise, the inefficiency induces profitability. Five hypothesis tests are applied to test the RWH. Each test uses both inter-day and intra-day returns. The fifth test deploys the idea of variance-ratio test and incorporates high-frequency data into the model. Three out of the five tests reject the RWH, whereas the other two cannot reject the RWH. Overall, the market is not fully efficient.
The contribution of this paper lies in three aspects. First, we use inter-day returns to test the RWH, in order to eliminate the overnight news propagation. Second, we deploy the idea of variance-ratio to test the RWH. Third, we find that the Chinese market is not fully efficient.
The remainder of this paper is organized as follows. Section 2 presents a survey on the random walk tests of worldwide markets. Section 3 performs five regular tests on the RWH of the stock index futures of China. Section 4 discusses the indications of the inefficacy of the Chinese market. Section 5 concludes this paper.
Considerable research has examined the efficiency of stock market price formation. The empirical evidence for the validity of RWH is mixed.
The seminal work of Lo and MacKinlay [
Researchers who support the RWH through statistics include Buguk and Brorsen [
Aside from statistical methods, Gersdorff and Bacon [
A major strand of literature employs a wide array of statistical tools for opposing the EMH. Gallagher and Taylor [
Particularly, Azad et al. [
Another stream of research contradicts random walk theory by predicting the market movements or devising trading strategies that generate excess earnings. Huang et al. [
This study focuses on the normality of the stock index futures of China. The recent introduction of index futures has induced only a few studies. In addition, most studies are based on daily or weekly returns; this research classifies daily returns into intra-day returns and inter-day returns, and subsequently examines the two types of returns. Furthermore, this research proposes a new variant of the variance-ratio test that is based on high-frequency data (obtaining prices every half-second).
This section tests the observed distribution of prices of CSI 300 index futures contracts. The CSI 300 index is a capitalization-weighted stock market index designed to replicate the performance of top 300 stocks traded in the Shanghai and Shenzhen stock exchanges. The nominal value of a contract is derived by multiplying the points of CSI 300 by 300 RMB. The basic information of futures contracts is summarized in
As shown in
Underlying | CSI 300 index |
---|---|
Contract multiplier | 300 RMB |
Unit | Index point |
Tick size | 0.2 point |
Contract months | Monthly: current month, next month, next two calendar quarters (four contracts in total) |
Trading hours | 09:15-11:30, 13:00-15:15; 09:15-11:30, 13:00-15:00 (last trading day) |
Limit up/down | +/−10% of settlement price on the previous trading day |
Margin requirement | 12% of the contract value |
Last trading day | Third Friday of the contract month, postponed to the next business day if it falls on a public holiday |
Delivery day | Third Friday, same as “Last Trading Day” |
Settlement method | cash settlement |
Transaction code | IF |
Exchange | China Financial Futures Exchange |
Inception date | April 16, 2010 |
Capital requirement | 500,000 RMB |
calculation of inter-day returns. Assume that the closing prices of day d and d − 1 are denoted by c d and c d − 1 , respectively, and the daily return of day d are calculated as r d = ( c d − c d − 1 ) / c d − 1 . As the main contract changes at the end of a delivery day, c d and c d − 1 may refer to different contracts; day d in such a case is called a rolling day.
We collected the opening and closing prices of index futures main contracts automatically from a public available platform. The time range is from April 16, 2010 to February 13, 2014. We collected 923 opening and closing prices. In the variant of variance-ratio test, we use tick (500 ms) price data and the data is bought from a company.
Applying statistical models to test the distribution of recorded data is highly common. In various financial theories, price changes (returns) are assumed to follow normal distributions. Therefore, we test the distribution of intra-day and inter-day returns. Intra-day and inter-day returns are calculated by the spread between opening and closing prices within one day and the spread between two closing prices, respectively. Since a total of 923 opening and closing prices are found, we have 923 intra-day returns. Due to the existence of rolling days, we delete 46 rolling days and obtain 877 inter-day returns.
The Jarque-Bera test, which examines how the kurtosis and skewness of tested data effectively match a normal distribution, is used to investigate the validity of a normal assumption. The calculation equation is as follows:
J B = 1 6 [ S 2 + 1 4 ( K − 3 ) 2 ]
K = 1 n ∑ i = 1 n ( x i − x ¯ ) 4 [ 1 n ∑ i = 1 n ( x i − x ¯ ) 2 ] 2
S = 1 n ∑ i = 1 n ( x i − x ¯ ) 3 [ 1 n ∑ i = 1 n ( x i − x ¯ ) 2 ] 3 2
where n is the number of observations in the sample and x ¯ is the sample mean. The observations in this paper are the inter-day and intra-day returns. The null hypothesis is a joint hypothesis of the skewness being zero and the excess kurtosis being zero. Samples from a normal distribution have an expected skewness of zero and an expected excess kurtosis of zero, which is the same as a kurtosis of three.
Code | Return type | x ¯ | variance | K | S | JB statistic | p-value |
---|---|---|---|---|---|---|---|
2*IF | Intra-day | −0.00034 | 0.00021 | 5.60202 | 0.05962 | 261.21300 | <1E−15 |
Inter-day | −0.00041 | 0.00021 | 5.53576 | −0.01441 | 234.99600 | <1E−15 |
This section deploys a runs test to verify the serial independence of the returns. The runs test is a non-parametric test. Run pertains to a series of increasing values or a series of decreasing values, (i.e., a successive sequence with the same sign). The number of increasing (or decreasing) values is the length of the run. For example, in a series of + , + , − , − , − , + , + 4 positive signs and 3 negative signs are present, and the number of runs is 3.
The first step in a runs test is to count the number of runs in the data sequence. The literature provides several means of defining runs; however, the formulation in all of the cases must generate a dichotomous sequence of values. In this section, runs are defined in two forms. The first form defines returns larger than (less than) zero by positive (negative) signs. The second form defines relative returns larger than (less than) zero by positive (negative) signs, where relative returns are real returns minus the average of all returns. The second form of runs removes the effect of trend in the testing period. The first and second forms of runs are called absolute and relative forms, respectively.
In a random data set, the probability that the ( i + 1 ) th value is larger or smaller than the ith value follows a binomial distribution, which forms the basis of the runs test. Under the null hypothesis that a two-valued data sequence is random, the number of runs n r in a sequence of n elements is a random variable whose conditional distribution given the observation of n + positive values and n − negative values is approximately normal, with
mean μ r = 2 n + n − n + 1
variance σ r = 2 n + n − ( 2 n + n − − n ) n 2 ( n − 1 )
Z = n R − μ R σ R ≈ N ( 0 , 1 )
These parameters do not assume that the positive and negative elements have equal probabilities of occurring, but only assume that the elements are independent and identically distributed. If the number of runs is significantly higher or lower than expected, the hypothesis of statistical independence of the elements may be rejected. Theoretically, if the returns of two consecutive trading days have positive (negative) correlation, fewer (more) runs will be observed.
Return type | Runs form | n + | n − | μ r | σ r | n r | z-statistic |
---|---|---|---|---|---|---|---|
Inter-day | Absolute | 396 | 481 | 435.381 | 14.6595 | 441 | 0.383312 |
Inter-day | Relative | 415 | 462 | 438.241 | 14.7561 | 447 | 0.792604 |
Intra-day | Absolute | 421 | 502 | 458.946 | 15.0652 | 467 | 0.567811 |
Intra-day | Relative | 436 | 487 | 461.091 | 15.1358 | 471 | 0.833324 |
test. The test results, to some extent, infer the difficulty in determining a profitable trading strategy for daily returns. In addition, the z-statistics of intra-day type are larger than those of inter-day type, reconfirming that the price change within one day is less random than the overnight change.
This section applies an ADF test to verify the existence of a unit root. If the return of index futures follows a random walk, then the time series should have a unit root. The following regression is estimated for the inter-day return series of index futures:
Δ Y t = α + β t + γ Y t − 1 + ∑ i = 1 k θ i Δ Y t − i + ε t = log c t c t − 1 = α + β t + γ log c t − 1 + ∑ i = 1 k θ i log c t − i c t − i − 1 + ε t
where Y t = log c t is the log of daily closing price c t , α , β , γ and θ i are parameters, Δ represents first difference, k is the number of lagged terms, and ε t is white noise that is normally distributed with a mean of zero. The t-test statistic for the null hypothesis of a random walk is γ = 0 versus the alternative of γ ≠ 0 .
Next, a time series representing intra-day returns is devised for the ADF test. Closing prices contain intra-day and overnight drifts, which are shown as the first and second fractions on the right side of Equation (1).
c d = c d o d o d o d − 1 c d − 1 (1)
where o d and c d are the opening and closing prices on day d.
As the major consideration is the intra-day drift, only prices driven by intra- day drifts are retained. Let c ^ d denotes the price driven by intra-day drifts,
Model | T-statistic | p-value |
---|---|---|
Level | ||
without trend | −2.1157 | 0.2386 |
with trend | −2.8782 | 0.1701 |
First | ||
without trend | −31.1456 | <0.0001 |
with trend | −31.1286 | <0.0001 |
as the closing price of the first trading day is purely driven by the intra-day drift, therefore c ^ 1 = c 1 ; on the second day, the price driven by the intra-day drifts of
the first two days is c ^ 2 = c ^ 1 c 2 o 2 ; Similarly, c ^ 3 driven by intra-day drifts of the first three days is c ^ 3 = c ^ 2 c 3 o 3 . Generally, the closing price c ^ d driven by intra-
day drifts equals
c ^ d = c ^ d − 1 c d o d
In the ADF test, Y t is l o g c ^ d .
This section presents the single variance-ratio test. The idea of this test is that if a series Y t follows a random walk process, then the variance of q-period σ 2 ( q ) would be q times of the variance of 1-period σ 2 ( 1 ) [
σ 2 ( q ) = 1 m ∑ t = q n q ( Y t − Y t − q − q μ ^ ) 2
σ 2 ( 1 ) = 1 n q − 1 ∑ t = 1 n q ( Y t − Y t − 1 − μ ^ ) 2
V R ( q ) = σ 2 ( q ) σ 2 ( 1 )
where m = q ( n q − q + 1 ) ( 1 − q n q ) and μ ^ = 1 n q ( Y n q − Y 0 ) . Y 0 and Y n q are
l o g c 0 and l o g c n q , respectively. V R ( q ) is the variance ratio.
This paper deploys the variance-ratio test presented in the work of Liu and He [
Model | t-statistic | p-value |
---|---|---|
Level | ||
without trend | −1.973771 | 0.2986 |
with trend | −3.41058 | 0.0505 |
First | ||
without trend | −31.63682 | <0.0001 |
with trend | −31.62466 | <0.0001 |
developed under the hypothesis of homoskedasticity, where
ϕ ( q ) = 2 ( 2 q − 1 ) ( q − 1 ) 3 q ( n q ) .
z ( q ) = V R ( q ) − 1 [ ϕ ( q ) ] 1 2 ~ N ( 0,1 )
The second statistic value z * ( q ) is robust to heteroskedasticity:
z * ( q ) = V R ( q ) − 1 [ ϕ * ( q ) ] 1 2 ~ N ( 0,1 )
where ϕ * ( q ) = ∑ j = 1 q − 1 [ 2 ( q − 1 ) q ] 2 δ ^ ( j ) and δ ^ ( j ) = ∑ t = j + 1 n q ( Y t − Y t − 1 − μ ^ ) 2 ( Y t − j − Y t − j − 1 − μ ^ ) 2 [ ∑ t = 1 n q ( Y t − Y t − 1 − μ ^ ) 2 ] 2 .
Lags of q = 2 , q = 4 , q = 8 and q = 16 days are tested in this study.
In the test of intra-day returns, time series Y t takes l o g c ^ d .
This section extends the idea of the variance-ratio test. The volatilities computed from high frequency prices and daily prices are compared. Their features are unique because the high-frequency prices are at a tiny granularity.
q (days) | 2 | 4 | 8 | 16 |
---|---|---|---|---|
V R ( q ) | 0.491 | 0.2422 | 0.1243 | 0.0656 |
ϕ ( q ) | 0.001 | 0.0038 | 0.0095 | 0.021 |
z ( q ) | −15.4721 | −12.3116 | −8.99896 | −6.453 |
ϕ ∗ ( q ) | 0.0003 | 0.0017 | 0.0055 | 0.0134 |
z ∗ ( q ) | −30.9624 | −18.5105 | −11.7548 | −8.0657 |
q (days) | 2 | 4 | 8 | 16 |
---|---|---|---|---|
V R ( q ) | 0.4864 | 0.2389 | 0.1213 | 0.0607 |
ϕ ( q ) | 0.0011 | 0.0038 | 0.0095 | 0.0210 |
z ( q ) | −15.6132 | −12.3666 | −9.0296 | −6.4866 |
ϕ ∗ ( q ) | 0.0002 | 0.0013 | 0.0046 | 0.0115 |
z ∗ ( q ) | −34.6845 | −20.7901 | −12.9067 | −8.7582 |
short period, the bid1 and ask1 prices do not change. In
The covariance of price changes would be
Cov ( Δ p t , Δ p t + 1 ) = 1 8 ( 0 + 0 − s 2 + 0 + 0 − s 2 + 0 + 0 ) = − s 2 4 , which does not equal 0.
However, from a macro perspective, such a covariance should be 0 in a random process. Generally, the bid1- and ask1-prices are random walks, and the series of closing prices at ticks is formed by selecting either bid1- or ask1-price each time. When the tick is short (i.e., a half-second in the China stock market), the series of high-frequency stock prices does not follow a random walk, although daily closing prices might follow a random walk or the deviation from a random walk is small and therefore it is ignored. To conclude, when statistically investigating high-frequency prices, either bid1- or ask1-prices should be used, instead of closing prices. This paper uses bid1-prices.
Originally, the daily volatility σ can be estimated by daily data. For a data set of D ( D > 1 ) days, an estimate of σ would be
σ ^ D = ∑ d = 1 D ( r d − r ¯ ) 2 D − 1
where r d and r ¯ are daily returns on day d and average returns over D days, respectively. The expectation and variance of σ ^ D 2 are as follows:
exp [ σ ^ D 2 ] = σ 2
var ( σ ^ D 2 ) = 2 σ 4 D − 1
The null hypothesis is that the bid1-price follows a random walk. Under this hypothesis, σ can be estimated by high frequency data. The tick and one day are denoted by τ and T = N τ ( T ≫ τ ), respectively; the tick and daily return volatilities δ and σ subsequently satisfy:
σ = T τ δ (2)
The tick returns volatility δ can be estimated by
δ ^ = ∑ i = 1 N ( b i − b ¯ ) 2 N − 1 (3)
where b i pertains to the returns of bid1 price at the ith tick. Consequently, σ estimated by Equation (2) and (3) is as follows:
σ ^ H = T τ δ ^
Based on the preceding equation, it has
exp [ σ ^ H 2 ] = T τ exp [ δ ^ 2 ] = T τ δ 2 = σ 2
var ( σ ^ H 2 ) = T 2 τ 2 var ( δ ^ 2 ) = 2 σ 4 N − 1
In the real market data, τ is only 500 ms, and T is 270 minutes. Thus N = 32400 . D typically takes 30 days or one year (i.e., D = 30 or D = 250 ). Therefore, N ≫ D . If the stock price follows a random walk, then the estimate by high frequency data σ ^ H is better than that by daily data, in that the former has a smaller variance.
Finally, Welch’s t-test is applied. Welch’s test is an improvement of the student’s t-test, and this test is used to verify the hypothesis that two populations have equal means but have possibly unequal variances [
Welch’s t-test is defined by the following formula:
t = X ¯ 1 − X ¯ 2 s 1 2 N 1 + s 2 2 N 2
where X ¯ i , s i 2 and N i are the ith sample mean, sample variance, and sample size, respectively. The degree of freedom is calculated by the Welch-Statterh Waite equation:
ν ≈ ( s 1 2 N 1 + s 2 2 N 2 ) 2 s 1 4 N 1 2 ν 1 + s 2 4 N 2 2 ν 2
where ν i = N i − 1 is the degree of freedom associated with the ith variance estimate.
In this paper, σ ^ D are computed in a rolling manner with D = 30 , and σ ^ H is computed every trading day. The statistical results are summarized in
The five tests investigate the random of returns from different viewpoints. Jarque-Bera examines the kurtosis and skewness of the curve of returns. Runs test focuses on the direction of changes instead of the specific value of changes. ADF test verifies the existence of a unit root from the perspective of time series. Variance-ratio test and its variant check the relationship of variances.
Mean | Standard deviation | Sample size | |
---|---|---|---|
σ ^ D | 0.01405 | 0.004012 | 893 |
σ ^ H | 0.02256 | 0.003728 | 924 |
Difference | −37.72% | ||
t-statistic | −46.8 | ||
ν | 1794 |
Not all of the aforementioned tests can reject the randomness of the index futures market of China. However, given that some tests reject the randomness, the China market does deviate from the normality to a certain extent. The reasons for such a deviation are beyond the scope of this work; nevertheless, the characteristics of the China market provide several indications.
First, the financial system is unsound. Financial activities are frequently led by policies, such as the crash due to the circuit breaker in 2016. Individual investors are the most active players in the market [
Resolving the inefficiency entails four aspects. First, government can eliminate unnecessary control over the market and return the freedom to the market. This process can be conducted in stages. The market can be enlivened by encouraging and helping professional talents with overseas experience to work in China, devising various derivatives, and rationally introducing foreign investors. Second, a sound legal system is required. Government should establish a complete legal system, enhance laws, protect the interests of investors, and severely punish lawbreakers. Third, SEC should improve the supervision level and clear delisting conditions. Any activity of a company during the listing should be monitored. Fourth, the government can devote efforts to educating investors and changing the investor composition. Moreover, it should encourage the development of funds to allow individual investors to identify a better means of investing than managing capital by themselves.
This paper verifies the normality of the index futures market of China through five statistical tests. Both inter-day and intra-day returns are used. In the literature, intra-day returns are seldom used in statistical tests. The five tests are revised to suit the characteristics of intra-day returns. The fifth test, which is newly proposed, is based on the idea that volatilities calculated from high-frequency and daily data should be consistent if the market is random. However, the Welch’s t-test shows that the two volatilities are inconsistent, indicating the abnormality of the market.
In the Jarque-Bera test, single variance-ratio test, and variant of variance-ratio test, the normal hypothesis of the index futures market of China is rejected with 5% confidence; meanwhile, in runs and ADF tests, the randomness cannot be rejected. In particular, the Jarque-Bera statistics for intra-day and inter-day returns are 261.213 and 234.996, respectively. Although both types of returns reject the hypothesis of normal distribution with a confidence of 99.99%, the value of inter-day returns is approximately 10% less than that of intra-day returns, which suggests that the intra-day market is less random than the overnight market. This phenomenon indicates that the news (information) digestion during market closing increases the transparency of the market, and the daily opening price is as important as the daily closing price.
This research was partially supported by the National Natural Science Foundation of China [grant number 71602109]; Shanghai Pujiang Program [grant number 16PJC038]; and Funding Program for Teachers of Shanghai High Education [grant number ZZSD15095].
Wang, N., Chen, Y.B. and Wang, B. (2018) Normality of the Stock Index Futures of China. Journal of Mathematical Finance, 8, 86-101. https://doi.org/10.4236/jmf.2018.81007