**Applied Mathematics**

Vol.08 No.11(2017), Article ID:80735,19 pages

10.4236/am.2017.811122

Analysis of 48 US Industry Portfolios with a New Fama-French 5-Factor Model

Liuling Li^{1}, Xiao Rao^{2}, Wentao Zhou^{3}, Bruce Mizrach^{4 }

^{1}The Institute of Statistics and Econometrics, School of Economics, Nankai University, Tianjin, China

^{2}School of Business, Nankai University, Tianjin, China

^{3}Department of Economics, University of Wisconsin-Madison, Madison, WI, USA

^{4}The Economics Department, Rutgers University, New Brunswick, NJ, USA

Copyright © 2017 by authors and Scientific Research Publishing Inc.

This work is licensed under the Creative Commons Attribution International License (CC BY 4.0).

http://creativecommons.org/licenses/by/4.0/

Received: August 12, 2017; Accepted: November 27, 2017; Published: November 30, 2017

ABSTRACT

In this paper, we analyze US stock market with a new 5-factor model in Zhou and Li (2016) [1] . Data we use are 48 industry portfolios (Jul. 1963-Jan. 2017). Parameters are estimated by MLE. LR and KS are used for model diagnostics. Model comparison is done with AIC. The results show Fama-French 5 factors are still alive. This new model in Zhou and Li (2016) [1] fits the data better than the one in Fama and French (2015) [2] .

**Keywords:**

Fama-French 5-Factor Model (FF5), Standardized Standard Asymmetric Exponential Power Distribution (SSAEPD), GARCH

1. Introduction

In 2015, Fama and French suggest a 5-factor model (denoted as FF5-Normal)^{1} to capture the market, size, value, profitability and investment patterns in stock returns, which is found better than their 3-factor model in [3] . Since then, many researches about the 5-factor model are developed (see Table 1). These researches can be divided into following 2 groups. The 1st group of researches empirically tests the FF5-Normal model using different data. For example, [4] [5] find out that the FF5-Normal model works well in India.

The 2nd group is to extend Fama-French’s 5-factor model. For example, [6] [7] choose Betting against Beta (BaB), Gross Profitability (GP) and other 9 factors to create a 14-factor model and find that the market factor is the most important factor for describing expected returns. [1] [8] [9] add the SSAEPD of [10] ( [11]

Table 1. Researches about the 5-Factor model for stock market.

Based on the new model of [1] , in this paper, we try to test following hypothesis: If different data such as 48 industry portfolios^{3} are considered, can the new model of [1] still beat the 5-factor model in [2] ? To find answers for above question, simulation is used to check the validity of [1] ’s MatLab program^{4}. Then, 48 industry portfolios are analyzed. Data are downloaded from the French’s Data Library, and the sample period is from Jul. 1963 to Jan. 2017. Parameters are estimated by Method of Maximum Likelihood Estimation (MLE). Likelihood Ratio test (LR) and Kolmogorov-Smirnov test (KS) are used for model diagnostics. Model comparison is done with Akaike Information Criterion (AIC).

Simulation results show the MatLab program is valid and can be used for empirical analysis. Empirical results show the 5 factors in [2] are still alive! The GARCH-type volatility and SSAEPD can successfully capture the excess kurtosis. The new model of [1] fits the data well and has better in-sample fit than the 5-factors model of Fama and French.

The organization of this paper is as follows. Section 2 is the model and methodology. Section 3 presents the empirical results. Section 4 provides the conclusions and future extensions. The appendices contain additional information that may be helpful to understand our paper.

2. Model and Methodology

2.1. The FF5-SSAEPD-GARCH Model

[1] extend Fama-French’s 5-factor model based on the GARCH-type volatility in [13] and non-Normal error distribution of SSAEPD in [10] , and show their new model is better for 25 Fama-French portfolios. This new model in [1] is listed as follows (denoted as FF5-SSAEPD-GARCH).

$\begin{array}{l}{R}_{t}-{R}_{ft}={\beta}_{0}+{\beta}_{1}\left({R}_{mt}-{R}_{ft}\right)+{\beta}_{2}SM{B}_{t}+{\beta}_{3}HML{O}_{t}+{\beta}_{4}RM{W}_{t}\\ \text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}+{\beta}_{5}CM{A}_{t}+{u}_{t},\text{\hspace{0.17em}}t=1,2,\cdots ,T,\end{array}$ (1)

${u}_{t}={\sigma}_{t}{z}_{t},{z}_{t}~SSAEPD\left(\alpha ,{p}_{1},{p}_{2}\right),$ (2)

${\sigma}_{t}^{2}={a}_{0}+{\displaystyle \sum _{i=1}^{r}}\text{\hspace{0.05em}}\text{\hspace{0.05em}}{a}_{i}{u}_{t-i}^{2}+{\displaystyle \sum _{i=1}^{s}}\text{\hspace{0.05em}}\text{\hspace{0.05em}}{b}_{i}{\sigma}_{t-i}^{2}.$ (3)

$\sum _{i=1}^{\mathrm{max}\left(r,s\right)}}\left({a}_{i}+{b}_{i}\right)<1,{a}_{0}>0,{a}_{i}\ge 0,{b}_{i}\ge 0.$ (4)

where $\theta =\left({\beta}_{0},{\beta}_{1},{\beta}_{2},{\beta}_{3},{\beta}_{4},{\beta}_{5},{a}_{0},{\left\{{a}_{i}\right\}}_{i=1}^{r},{\left\{{b}_{i}\right\}}_{i=1}^{s},\alpha ,{p}_{1},{p}_{2}\right)$ is the parameter vector to be estimated. T is the sample size. The error term ${z}_{t}$ is distributed as the Standardized Standard Asymmetric Exponential Power Distribution (SSAEPD) proposed by Zhu and Zinde-Walsh. ${\sigma}_{t}$ is the conditional standard deviation, i.e., volatility.

${R}_{t}$
is the return on stock portfolio.
${R}_{ft}$
is the risk-free return.
${R}_{mt}$
is the value-weighted market return.
$SM{B}_{t}$
is the return of small minus big.
$RM{W}_{t}$
stands for the return of robust minus weak.
$CM{A}_{t}$
stands for the returns of conservative minus aggressive.
$HML{O}_{t}$
is the return of high minus low orthogonalized^{5}, which is the sum of the intercept and the residual from the regression of
$HM{L}_{t}$
on
${R}_{mt}-{R}_{ft}\mathrm{,}SM{B}_{t}\mathrm{,}RM{W}_{t}\mathrm{,}CM{A}_{t}$
.

Especially, with

${\left\{{a}_{i}=0\right\}}_{i=1}^{r},{\left\{{b}_{i}=0\right\}}_{i=1}^{s},\alpha =0.5,{p}_{1}={p}_{2}=2$

^{5}The reason of using
$HML{O}_{t}$
instead of
$HM{L}_{t}$
can be found in Appendix 2.

the FF5-SSAEPD-GARCH model is reduced to Fama-French’s 5-factor model and in the following section we will compare these two models.

2.2. MLE

Maximum Likelihood Estimation (MLE) is used to estimate previous model. The likelihood function is

$L\left({\left\{{R}_{t}-{R}_{ft},{R}_{mt}-{R}_{ft},SM{B}_{t},HML{O}_{t},RM{W}_{t},CM{A}_{t}\right\}}_{t=1}^{T};\theta \right)={\displaystyle \prod _{t=1}^{T}}f\left({R}_{t}-{R}_{ft}\right)$ (5)

$={\displaystyle \prod _{t=1}^{T}}\{\begin{array}{ll}\frac{\delta}{\eta}\left(\frac{\alpha}{{\alpha}^{\ast}}\right)K\left({p}_{1}\right)\mathrm{exp}\left(-\frac{1}{{p}_{1}}{\left|\frac{\omega +\delta {z}_{t}}{2{\alpha}^{\ast}}\right|}^{{p}_{1}}\right),\hfill & {z}_{t}\le -\frac{\omega}{\delta},\hfill \\ \frac{\delta}{\eta}\left(\frac{1-\alpha}{1-{\alpha}^{\ast}}\right)K\left({p}_{2}\right)\mathrm{exp}\left(-\frac{1}{{p}_{2}}{\left|\frac{\omega +\delta {z}_{t}}{2\left(1-{\alpha}^{\ast}\right)}\right|}^{{p}_{2}}\right),\hfill & {z}_{t}>-\frac{\omega}{\delta}.\hfill \end{array}$ (6)

where

${z}_{t}=\frac{{R}_{t}-{R}_{ft}-{\beta}_{0}-{\beta}_{1}\left({R}_{mt}-{R}_{ft}\right)-{\beta}_{2}SM{B}_{t}-{\beta}_{3}HML{O}_{t}-{\beta}_{4}RM{W}_{t}-{\beta}_{5}CM{A}_{t}}{{\sigma}_{t}},$ (7)

${\sigma}_{t}^{2}={a}_{0}+{\displaystyle \sum _{i=1}^{r}}\text{\hspace{0.05em}}\text{\hspace{0.05em}}{a}_{i}{u}_{t-i}^{2}+{\displaystyle \sum _{i=1}^{s}}\text{\hspace{0.05em}}\text{\hspace{0.05em}}{b}_{i}{\sigma}_{t-i}^{2}.$ (8)

3. Empirical Analysis

3.1. Data

Different from [1] , the data we analyze are the monthly returns of 48 industry portfolios for US stock market downloaded from French’s Data Library, which include Agriculture, Food, Real Estate, Finance et al. The sample period is from 1963:07 to 2017:01. The descriptive statistics of sample data are calculated by MatLab and listed in Table 2. For each observation, the skewness (except one

Table 2. Descriptive Statistics (1963:07-2017:01).

Notes: The sample period of Hlth is 1969:7-2017:01 due to the data availability; Mea. = mean, Med. = median, Max. = maximum, Min. = minmum St Dev. = standard deviation, Ske. = skewness Kur. = kurtosis, P = P-value of Jarque-Bera Test.

portfolio, the “Ships’’ industry) is not 0 and the kurtosis is more than 3. The p-value of Jarque-Bera test for each portfolio is 0, which is smaller than 5% significance level. Hence, we can reject the null hypothesis and conclude that the asset returns do not follow Normal distribution. Thus, non-Normal error assumption of SSAEPD might be able to fit the data better.

3.2. Estimation Results

The estimates for our new model are displayed in Table 3. We find out that our model can successfully capture the skewness, fat-tailness and excess kurtosis of the data. More specifically, the skewness parameter α of 46 out of 48 estimates are not equal to 0.5, which captures the skewness in the data. 84 out of 96 estimates for the tail parameters
${p}_{i}\text{\hspace{0.05em}}\left(i=1,2\right)$
are smaller than 2, which suggests that portfolio returns are fat-tailed distributed. Besides, all the tail parameters
${p}_{1}$
and
${p}_{2}$
(except one potfolio, the “Other’’ industry) are not equal to each other, which documents the asymmetric fat-tailedness. And 28 out of 48 portfolios have bigger estimates for the left tail parameter P_{1} which means that these returns tend to have thinner left tails.

3.3. Model Diagnostics

To test the significance of coefficients in FF5-SSAEPD-EGARCH, Likelihood Ratio test (LR) is applied^{6}, which is calculated using Equation (9).

$LR=-2ln\left(\text{likelihoodfornull}\right)+2ln\left(\text{likelihoodforalternative}\right)$ (9)

3.3.1. Tests for Parameter Restrictions

• Tests for Parameters in the Mean Equation

The P-values of LR are listed in Table 4. The null hypothesis of the joint significance test is ${H}_{0}:{\beta}_{1}={\beta}_{2}={\beta}_{3}={\beta}_{4}={\beta}_{5}=0$ . The P-values of the joint significance test for all the 48 portfolios are 0, which means ${\beta}_{1}\mathrm{,}{\beta}_{2}\mathrm{,}{\beta}_{3}\mathrm{,}{\beta}_{4}$ and ${\beta}_{5}$ are statistically jointly significant under 5% significance level.

The individual significance tests show that under 5% significance level the coefficient ${\beta}_{1}$ in all 48 portfolios are statistically significant; 40/48, 28/48, 38/48 and 32/48 portfolios have a statistically coefficient ${\beta}_{2}\mathrm{,}{\beta}_{3}\mathrm{,}{\beta}_{4}$ and ${\beta}_{5}$ , respectively. As for coefficient ${\beta}_{0}$ (i.e., the Alpha return), 31 out of the 48 portfolios are statistically significant under 5% significance level. Thus, most of the 48 portfolios seem to be able to earn the $Alpha$ returns.

As a whole, since the 5 factors are significant in most of the 48 portfolios, therefore we can conclude that with non-Normal errors such as SSAEPD and GARCH-type volatilities, the Fama-French 5 factors are still alive.

• Tests for Parameters in the GARCH Equation

In this part, some restrictions on the parameters in the GARCH equation are tested with Likelihood Ratio test (LR). And the results are listed in Table 5. Results show the GARCH-type volatility should be included in Fama-French

Table 3. Estimates for FF5-SSAEPD-GARCH (Monthly, 1963:07-2017:01).

Notes: The data period of Hlth is 1969:7-2017:01 due to the data availability.

Table 4. P-values of Likelihood Ratio Test (LR).

Notes: Sample of Hlth is from 1969:7 to 2017:01 due to the availability of data. TJ means test of joint hypothesis of ${H}_{0}:{\beta}_{1}={\beta}_{2}={\beta}_{3}={\beta}_{4}={\beta}_{5}=0$ . T0 means ${H}_{0}:{\beta}_{0}=0$ . T1 means ${H}_{0}:{\beta}_{1}=0$ , T2 means ${H}_{0}:{\beta}_{2}=0$ . T3 means ${H}_{0}:{\beta}_{3}=0$ . T4 means ${H}_{0}:{\beta}_{4}=0$ . T5 means ${H}_{0}:{\beta}_{5}=0$ .

Table 5. P-values of likelihood ratio test (LR).

Notes: The data period of Hlth is 1969:7-2017:01 due to the lack of data from 1963:7-1969:6. T8 means ${H}_{0}:b=c=0$ . T9 means ${H}_{0}:a=0$ . T10 means ${H}_{0}:b=0$ . T11 means ${H}_{0}:c=0$ . T12 means ${H}_{0}:\alpha =0.5,{p}_{1}={p}_{2}=2$ . T13 means ${H}_{0}:\alpha =0.5$ . T14 means ${H}_{0}:{p}_{1}={p}_{2}=2$ . T15 means ${H}_{0}:{p}_{1}=2$ . T16 means ${H}_{0}:{p}_{2}=2$ .

5-factor model. For instance, we do the joint significance test for hypothesis ${H}_{0}:b=c=0$ . For 46 out of the 48 portfolios, the p-value of the LR are smaller than the significance level 5%, which means our GARCH-type volatilities are quite necessary. As for individual hypotheses, we discover that most P-values of LR are smaller than the significance level 5%. And to be specific, ARCH term ( ${H}_{0}:b=0$ ) is significant in 39 out of 48 portfolios and GARCH term ( ${H}_{0}:c=0$ ) is significant in 27 out of 48 portfolios.

• Tests for Parameters in SSAEPD

We also run significance tests for the parameters in the SSAEPD and the results of parameter restrictions show strong non-Normality. For example, for the Hypothesis ${H}_{0}:\alpha =0.5,{p}_{1}={p}_{2}=2$ , 39 out of 48 p-values are smaller than the significance level 5%, which means that Normal error assumption is not supported by most of our data. Besides, Asymmetry is documented ( ${H}_{0}:\alpha =0.5$ is rejected by 14 out of 48 portfolios). And non-normality is found ( ${H}_{0}:{p}_{1}=2$ is rejected by 21 out of 48 portfolios and 29 out of 48 portfolios reject the null ${H}_{0}:{p}_{2}=2$ ).

3.3.2. Residual Check

In this subsection, the residuals for previous models are checked with both Kolmogorov-Smirnov test and graphs. Our results show 41 out of the 48 portfolios have residuals which do follow SSAEPD. That means, the FF5-SSAEPD-GARCH is adequate for the 48 industry portfolios. But the FF5-Normal model is not adequate for the data, since all the 48 portfolios have residuals which do not follow the Normal error distribution.

• Kolmogorov-Smirnov Test for Residuals

To check the residuals, the Kolmogorov-Smirov test (KS)^{7} is employed. The p-value of KS test is displayed in Table 6. The p-values of KS test^{8} show the residuals from the new model do follow SSAEPD. For instance, the p-value of the portfolio of Agriculture industry is 0.79, greater than 5%, which means under 5% significance level, the null hypothesis is not rejected and the residuals from FF5-SSAEPD-EGARCH do follow the SSAEPD. Similarly, the null hypothesis cannot be rejected for all other 40 portfolios.

Then, we apply the KS test for the residuals from the FF5-Normal model^{9}. The p-values of the KS test are also listed in Table 6. All of the 48 portfolios have smaller p-values than 0.05, which means these 48 industry portfolios reject the nulls. Hence, the error terms of the portfolios do not follow Normal distribution. And the FF-Normal model is not adequate for the data.

• PDFs of Residuals

By method of “eye-rolling’’, the PDF of residuals is compared with theoretical PDFs. Taking the portfolio of Agriculture industry for example, in Figure 1(a), the probability density function (PDF) for the estimated residuals ${\widehat{z}}_{t}$ in FF5-SSAEPD-EGARCH and that of $SSAEPD\left(\widehat{\alpha},{\widehat{p}}_{1},{\widehat{p}}_{2}\right)$ are plotted. These curves are very close to each other, which means the residuals are distributed as SSAEPD. Hence, the FF5-SSAEPD-GARCH model fits the data well.

Similarly, the probability density function (PDF) for the estimated residuals ${\widehat{u}}_{t}$ in FF5-Normal and that of $Normal\left(\widehat{\mu},{\widehat{\sigma}}^{2}\right)$ are shown in Figure 1(b). And there are big differences between these two curves, which means the residuals do not follow Normal distribution.

Table 6. P-values of KS Test for Residuals.

Notes: 1. The data period of Hlth is 1969:7-2017:01 due to the data availability. 2.* means the data doesn’t follow the specified distribution under 5% significance level. M1 = FF5-SSAEPD-GARCH, M2 = FF5-Normal.

3.4. Model Comparison

In this subsection, we compare the model in [1] with the 5-factor model of Fama and French. The Akaike Information Criterion (AIC) is used as the model selection criterion. Table 7 lists the AIC values. We find that 47 out of 48 AIC values of the FF5-SSAEPD-GARCH model are smaller than those of the FF5-Normal model. Hence, we conclude that the new model we used (FF5-SSAEPD-GARCH) is better than the 5-factor model of Fama and French.

4. Conclusions

In this paper, we empirically test the new 5-factor model suggested in [1] . Their new model generalizes the 5-factor model in [2] by introducing a non-normal

(a)(b)

Figure 1. Comparison of PDFs. (a) PDFs of the Residuals (FF5-SSAEPD-GARCH) and $SSAEPD\left(\widehat{\alpha},{\widehat{p}}_{1},{\widehat{p}}_{2}\right)$ ; (b) PDFs of the Residuals (FF5-Normal) and $Normal\left(\widehat{\mu},{\widehat{\sigma}}^{2}\right)$ .

error term and time-varying volatilities. The non-normal error assumption their used is the SSAEPD of [10] and the time-varying volatilities is the GARCH model of [13] . For comparison, monthly US stock returns of 48 industry portfolios (1963:07-2017:01) are analyzed.

Table 7. AIC Values (Monthly, 1963:07-2017:01).

Notes: 1. The data period of Hlth is 1969:7-2017:01 due to the data availability. 2. Numbers with * are smaller AIC values. M1 = FF5-SSAEPD-GARCH, M2 = FF5-Normal.

Method of Maximum Likelihood (MLE) is used for parameters estimation. Likelihood Ratio Test (LR) is used to test the hypotheses of parameter restrictions. Kolmogorov-Smirnov test (KS) is used to check residuals. Akaike Information Criterion (AIC) is used to compare models.

Simulation results show the MatLab program for the new 5-factor model in [1] is valid. And empirical results show 1) this new model can capture the skewness, fat tails and asymmetric fat-tailedness in the data. 2) the Fama-French 5 factors are still alive even if the non-normal errors and GARCH-type volatilities are considered. Since we find out the 5 factors are statistically significant in most of the 48 portfolios. And 3) FF5-SSAEPD-GARCH model can fit the data much better than the 5-factor model in [2] .

Future extensions will include but not limited to follows. First, we can exam our results with different data. Second, we can compare our results with those from other models such as ARIMA model. Last but not the least, other factors can be introduced into this model to explain the $Alpha$ returns of industry portfolios.

Cite this paper

Li, L.L., Rao, X., Zhou, W.T. and Mizrach, B. (2017) Analysis of 48 US Industry Portfolios with a New Fama-French 5-Factor Model. Applied Mathematics, 8, 1684-1702. https://doi.org/10.4236/am.2017.811122

References

- 1. Zhou, W. and Li, L. (2016) A New Fama-French 5-Factor Model Based on SSAEPD Error and GARCH-Type Volatility. Journal of Mathematical Finance, 6, 711-727. https://doi.org/10.4236/jmf.2016.65050
- 2. Fama, E.F. and French, K.R. (2015) A Five-Factor Asset Pricing Model. Journal of Financial Economics, 116, 1-22. https://doi.org/10.1016/j.jfineco.2014.10.010
- 3. Fama, E.F. and French, K.R. (1993) Common Risk Factors in Returns on Stocks and Bonds. Journal of Financial Economics, 33, 3-56. https://doi.org/10.1016/0304-405X(93)90023-5
- 4. Harshita, S.S. and Yadav, S.S. (2015) Indian Stock Market and the Asset Pricing Models. Procedia Economics & Finance, 30, 294-304. https://doi.org/10.1016/S2212-5671(15)01297-6
- 5. Gharghori, P., Chan, H. and Faff, R. (2007) Are the Fama-French Factors Proxying Default Risk? Australian Journal of Management, 32, 223-249. https://doi.org/10.1177/031289620703200204
- 6. Hou, K., Xue, C. and Zhang, L. (2015) Digesting Anomalies: An Investment Approach. Review of Financial Studies, 28, 650-705.
- 7. Harvey, C.R. and Liu, Y. (2015) Lucky Factors. Social Science Electronic Publishing, San Francisco.
- 8. Li, L., Zhu, Q.Y. and Mu, Y. (2017) Analysis of the Sector of Software & Computer Services with a New Carhart 4-Factor Model. Journal of Mathematical Finance, 7, 65-82.
- 9. Li, L., Zhu, Q. and Yang, Y. (2017) A New 5-factor Model Based on the EGARCH-Typed Volatilities and the SSAEPD Errors. Journal of Multidisciplinary, 4, 83-98.
- 10. Zhu, D. and Zinde-Walsh, V. (2009) Properties and Estimation of Asymmetric Exponential Power Distribution. Journal of Econometrics, 148, 86-99. https://doi.org/10.1016/j.jeconom.2008.09.038
- 11. Subbotin, M.T. (1923) On the Law of Frequency of Error. Матем. сб., 31, 296-301.
- 12. Azzalini, A.A. (1985) Class of Distributions Which Includes the Normal Ones. Scandinavian Journal of Statistics, 12, 171-178.
- 13. Engle, R.F. and Bollerslev, T. (1986) Modeling the Persistence of Conditional Variances. Econometric Reviews, 5, 1-50. https://doi.org/10.1080/07474938608800095

Appendix 1.

Four-digit SIC codes are used to assign firms to 48 industries. The variables defined in the 1st column in Table 8 are used as the dependent variables in this paper.

Appendix 2. Fama-French 5-Factor Model (FF5-Normal)

Equation (10) is the new 5-factor model (denoted as FF5-Normal) suggested by Fama and French (2015). And they show this model empirically outperforms Fama-French (1993)’s 3-factor model.

$\begin{array}{l}{R}_{t}-{R}_{ft}\mathrm{=}{\beta}_{0}+{\beta}_{1}\ast \left({R}_{mt}-{R}_{ft}\right)+{\beta}_{2}\ast SM{B}_{t}+{\beta}_{3}\ast HML{O}_{t}\\ \text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}+{\beta}_{4}\ast RM{W}_{t}+{\beta}_{5}\ast CM{A}_{t}+{u}_{t}\mathrm{,}\text{\hspace{0.17em}}{u}_{t}~Normal\left(\mu \mathrm{,}{\sigma}^{2}\right)\mathrm{.}\end{array}$ (10)

where $\theta =\left({\beta}_{0},{\beta}_{1},{\beta}_{2},{\beta}_{3},{\beta}_{4},{\beta}_{5},\mu ,\sigma \right)$ are parameters in this model. $\text{\hspace{0.05em}}t=1,2,\cdots ,T$ ${R}_{t}$ is the return on stock portfolio. ${R}_{ft}$ is the risk-free return. ${R}_{mt}$ is the value-weighted market return. $SM{B}_{t}$ is the return of small minus big. $RM{W}_{t}$ stands for the return of robust minus weak. $CM{A}_{t}$ stands for the returns of conservative minus aggressive.

$HML{O}_{t}$ is the return of high minus low orthogonalized, which is the sum of the intercept and the residual from the regression of $HM{L}_{t}$ on ${R}_{mt}-{R}_{ft}\mathrm{,}SM{B}_{t}\mathrm{,}RM{W}_{t}\mathrm{,}CM{A}_{t}$ . The reason of using $HML{O}_{t}$ instead of $HM{L}_{t}$ is that Fama and French (2015) show $HM{L}_{t}$ (the high minus low book-to-market ratio) is redundant in following 5-factor model.

$\begin{array}{c}{R}_{t}-{R}_{ft}={\beta}_{0}+{\beta}_{1}\ast \left({R}_{mt}-{R}_{ft}\right)+{\beta}_{2}\ast SM{B}_{t}+{\beta}_{3}\ast HM{L}_{t}\\ \text{\hspace{0.17em}}\text{\hspace{0.17em}}+{\beta}_{4}\ast RM{W}_{t}+{\beta}_{5}\mathrm{*}CM{A}_{t}+{u}_{t}\mathrm{,}\end{array}$ (11)

${u}_{t}~Normal\left(\mu ,{\sigma}^{2}\right),t=1,2,\cdots ,T.$

Appendix 3. SSAEPD

If a random variable X is distributed as AEPD, we denote $X~AEPD\left(\mu \mathrm{,}\sigma \mathrm{,}\alpha \mathrm{,}{p}_{1}\mathrm{,}{p}_{2}\right)$ If a random variable X is distributed as standard AEPD, we denote $X~SAEPD\left(\mu =0,\sigma =1,\alpha ,{p}_{1},{p}_{2}\right)$ or in short, $X~SAEPD\left(\alpha \mathrm{,}{p}_{1}\mathrm{,}{p}_{2}\right)$ If a random variable Z is distributed as standardized standard AEPD, we denote $Z~SSAEPD\left(\mu =0,\sigma =1,\alpha ,{p}_{1},{p}_{2}\right)$ or $Z~SSAEPD\left(\alpha \mathrm{,}{p}_{1}\mathrm{,}{p}_{2}\right)$ with mean zero and the variance 1. That is, $E\left(Z\right)=0$ ,$Var\left(Z\right)=1$ The brief history of SSAEPD is listed in Table 9.

The probability density function (PDF) of the Standardized Standard AEPD (SSAEPD) proposed by Zhu and Zinde-Walsh (2009)^{10} is

$f\left({z}_{t}|\beta \right)=(\begin{array}{ll}\delta \left(\frac{\alpha}{{\alpha}^{*}}\right)K\left({p}_{1}\right)\mathrm{exp}\left(-\frac{1}{{p}_{1}}{\left|\frac{w+{z}_{t}\delta}{2{\alpha}^{*}}\right|}^{{p}_{1}}\right),\hfill & \text{if}\text{\hspace{0.05em}}\text{\hspace{0.17em}}{z}_{t}\le -\frac{w}{\delta},\hfill \\ \delta \left(\frac{1-\alpha}{1-{\alpha}^{*}}\right)K\left({p}_{2}\right)\mathrm{exp}\left(-\frac{1}{{p}_{2}}{\left|\frac{w+{z}_{t}\delta}{2\left(1-{\alpha}^{*}\right)}\right|}^{{p}_{2}}\right),\hfill & \text{if}\text{\hspace{0.17em}}{z}_{t}>-\frac{w}{\delta}.\hfill \end{array}$ (12)

Table 8. Variable definitions for 48 industries.

where

${\alpha}^{\mathrm{*}}=\frac{\alpha K\left({p}_{1}\right)}{\alpha K\left({p}_{1}\right)+\left(1-\alpha \right)K\left({p}_{2}\right)}\mathrm{,}$ (13)

Table 9. The History of the SSAEPD distribution.

Notes: EPD = Exponential Power Distribution; SEPD = Skewed Exponential Power Distribution; SSAEPD = Standardized Standard Asymmetric Exponential Power Distribution. This table is a revision of the one in Jin (2011).

$K\left(p\right)=\frac{1}{2{p}^{1/p}\Gamma \left(1+1/p\right)},$ (14)

$\Gamma \left(x\right)={\displaystyle {\int}_{0}^{\infty}}{y}^{x-1}{\text{e}}^{-y}\text{d}y\mathrm{,}$ (15)

$w=\frac{1}{B}\left[{\left(1-\alpha \right)}^{2}\frac{{p}_{2}\Gamma \left(2/{p}_{2}\right)}{{\Gamma}^{2}\left(1/{p}_{2}\right)}-{\alpha}^{2}\frac{{p}_{1}\Gamma \left(2/{p}_{1}\right)}{{\Gamma}^{2}\left(1/{p}_{1}\right)}\right],$ (16)

$\begin{array}{c}{\delta}^{2}=\frac{1}{{B}^{2}}\{{\left(1-\alpha \right)}^{3}\frac{{p}_{2}^{2}\Gamma \left(3/{p}_{2}\right)}{{\Gamma}^{3}\left(1/{p}_{2}\right)}+{\alpha}^{3}\frac{{p}_{1}^{2}\Gamma \left(3/{p}_{1}\right)}{{\Gamma}^{3}\left(1/{p}_{1}\right)}\\ \text{\hspace{0.17em}}\text{\hspace{0.17em}}-{\left[{\left(1-\alpha \right)}^{2}\frac{{p}_{2}\Gamma \left(2/{p}_{2}\right)}{{\Gamma}^{2}\left(1/{p}_{2}\right)}-{\alpha}^{2}\frac{{p}_{1}\Gamma \left(2/{p}_{1}\right)}{{\Gamma}^{2}\left(1/{p}_{1}\right)}\right]}^{2}\},\end{array}$ (17)

$B=\alpha K\left({p}_{1}\right)+\left(1-\alpha \right)K\left({p}_{2}\right).$ (18)

$\mu \in R$ , $\sigma >0$ , ${p}_{1}>0$ , ${p}_{2}>0$ , $\alpha \in \left(\mathrm{0,1}\right)$ . ${p}_{1}$ (or ${p}_{2}$ ) is the parameter controlling the left (or right) tail. $\alpha $ controls the skewness. The mean of ${z}_{t}$ is zero and its variance is 1. When $\alpha =0.5$ , ${p}_{1}={p}_{2}=2$ , SSAEPD can be reduced to Normal (0, 1).

Appendix 4. Simulation Results

We check the MatLab program written by Zhou and Li (2016) by following simulation and find out the program is valid and can be used to analyze our empirical data. The FF5-SSAEPD-GARCH (1,1) is simulated as follows.

${R}_{t}-{R}_{ft}={\beta}_{0}+{\beta}_{1}\left({R}_{mt}-{R}_{ft}\right)+{\beta}_{2}SM{B}_{t}+{\beta}_{3}HML{O}_{t}$ (19)

$+{\beta}_{4}RM{W}_{t}+{\beta}_{5}CM{A}_{t}+{u}_{t},\text{\hspace{0.17em}}t=1,2,\cdots ,T,$ (20)

${u}_{t}={\sigma}_{t}{z}_{t},\text{\hspace{0.17em}}{z}_{t}~SSAEPD\left(\alpha ,{p}_{1},{p}_{2}\right),$

${\sigma}_{t}^{2}={a}_{0}+{a}_{1}{u}_{t-1}^{2}+{b}_{1}{\sigma}_{t-1}^{2}.$

The data generation process is as follows:

1) Given
$\alpha =0.5,{p}_{1}={p}_{2}=2$
, generate SSAEPD random numbers
${\left\{{z}_{t}\right\}}_{t=1}^{T}$
^{11}.

*note

2) Set ${\sigma}_{0}^{2}=1,{\epsilon}_{0}=0,a=0.3,b=0.5,c=0.4$ , generate ${\left\{{\sigma}_{t}^{2}\right\}}_{t=1}^{T}$ and ${\left\{{u}_{t}\right\}}_{t=1}^{T}$ with following formula:

Table 10. Simulation results.

Notes: T means the true value of parameters. E means the estimates. P means the error in percentage.

${\sigma}_{1}^{2}={a}_{0}+{a}_{1}{\sigma}_{0}^{2}{z}_{t}^{2}+{b}_{1}{\sigma}_{0}^{2},$

${u}_{1}={z}_{1}{\sigma}_{1}.$

3) Generate
${\left\{{X}_{1t}\right\}}_{t=1}^{T}$
,
${\left\{{X}_{2t}\right\}}_{t=1}^{T}$
,
${\left\{{X}_{3t}\right\}}_{t=1}^{T}$
,
${\left\{{X}_{4t}\right\}}_{t=1}^{T}$
,
${\left\{{X}_{5t}\right\}}_{t=1}^{T}$
from Uniform (0,1)^{12}.

4) Set ${\beta}_{0}=0.2,{\beta}_{1}=1,{\beta}_{2}=0.5,{\beta}_{3}=0.5,{\beta}_{4}=0.5,{\beta}_{5}=0.5$ , and we can get ${\left\{{Y}_{t}\right\}}_{t=1}^{T}$ .

${Y}_{t}={\beta}_{0}+{\beta}_{1}{X}_{1t}+{\beta}_{2}{X}_{2t}+{\beta}_{3}{X}_{3t}+{\beta}_{4}{X}_{4t}+{\beta}_{5}{X}_{5t}+{u}_{t},\text{\hspace{0.17em}}t=1,2,\cdots ,T.$

^{12}For simplicity, we use Xs to represent the 5 factors in simulation.

After getting the simulated data ${\left\{{X}_{1t},{X}_{2t},{X}_{3t},{X}_{4t},{X}_{5t}{Y}_{t}\right\}}_{t=1}^{T}$ , we can use them to estimate the parameters in the FF5-SSAEPD-GARCH model. The simulation results are reported in Table 10, almost all the estimates are close to the true values of the parameters. Hence, we can draw the conclusion that this MatLab program is valid from empirical analysis.

NOTES

^{1}The FF5-Normal model of [2] is in Appendix 2.

^{2}The history of SSAEPD is displayed in Appendix 3.

^{3} [1] analyze 25 Fama-French portfolios, which is different from the dataset we use.

^{4}Simulation results are listed in Appendix 4.

^{5}The reason of using
$HML{O}_{t}$
instead of
$HM{L}_{t}$
can be found in Appendix 2.

^{6}LR formula is from Neyman and Pearson (1993).

^{7}The null hypothesis of KS test is
${H}_{0}$
: Data follows a specified distribution. If the P-value of KS test is bigger than 5% significance level, the null hypothesis is not rejected. Otherwise, the null hypothesis is rejected.

^{8}The null hypothesis is
${H}_{0}$
: FF5-SSAEPD-GARCH residuals are distributed as
$SSAEPD\left(\widehat{\alpha},{\widehat{p}}_{1},{\widehat{p}}_{2}\right)$
.

^{9}The FF5-Normal model is listed in Appendix 2. The null hypothesis
${H}_{0}$
: FF5-Normal residuals are distributed as
$Normal\left(\widehat{\mu},{\widehat{\sigma}}^{2}\right)$
.

^{10}For more information about SSAEPD, one can refer to Appendix 3.