Open Journal of Statistics
Vol.08 No.01(2018), Article ID:82245,13 pages
10.4236/ojs.2018.81003

A New Stochastic Restricted Liu Estimator for the Logistic regression model

Weibing Zuo, Yingli Li

College of Mathematics and Statistics, North China University of Water Resources and Electric Power, Zhengzhou, China

Copyright © 2018 by authors and Scientific Research Publishing Inc.

This work is licensed under the Creative Commons Attribution International License (CC BY 4.0).

http://creativecommons.org/licenses/by/4.0/

Received: November 28, 2017; Accepted: January 29, 2018; Published: February 1, 2018

ABSTRACT

In order to overcome the well-known multicollinearity problem, we propose a new Stochastic Restricted Liu Estimator in logistic regression model. In the mean square error matrix sense, the new estimation is compared with the Maximum Likelihood Estimation, Liu Estimator Stochastic Restricted Maximum Likelihood Estimator etc. Finally, a numerical example and a Monte Carlo simulation are given to explain some of the theoretical results.

Keywords:

Multicollinearity, Liu Estimator, Stochastic Restricted Liu Estimator, Scalar Mean Squared Error Matrix

1. Introduction

Consider the following multiple logistic regression model is

y i = π i + ε i , i = 1 , , n , (1.1)

which follows Bernoulli distribution with parameter π i as

π i = exp ( x i β ) 1 + exp ( x i β ) , (1.2)

where β is a ( p + 1 ) × 1 vector of coefficients and x i is the ith row of X, which is an n × ( p + 1 ) data matrix with P explanatory variables, ε i is independent with mean zero and variance π i ( 1 π i ) of the response y i . The maximum likelihood method is the most commonly used method of estimating parameters and the Maximum Likelihood Estimator (MLE) is defined as

β ^ MLE = C 1 X W ^ Z , (1.3)

where C = X W ^ X ; W ^ = d i a g [ π ^ i ( 1 π ^ i ) ] and Z is the column vector with ith element equals log ( π ^ i ) + y i π ^ i π ^ i ( 1 π ^ i ) , which is an asymptotically unbiased estimate of β . The covariance matrix of β ^ M L E is

C o v ( β ^ M L E ) = ( X W ^ X ) 1 = C 1 , (1.4)

Multicollinearity inflates the variance of the Maximum Likelihood Estimator (MLE) in the logistic regression. Therefore, MLE is no longer the best estimate of parameter in the logistic regression model.

To overcome the problem of multicollinearity in the logistic regression, many scholars conducted a lot of research. Schaffer et al. (1984) [1] proposed Ridge Logistic Regression (RLR). Aguilera et al. (2006) [2] proposed Principal Component Logistic Estimator (PCLE). Nja et al. (2013) [3] proposed Modified Logistic Ridge Regression Estimator (MLRE). Inan and Erdogan (2013) [4] proposed Liu-type estimator (LLE).

Some scholars also improve estimation by limiting unknown parameters in the model which may be exact or stochastic. Where additional linear restriction on parameter vector is assumed to hold, Duffy and Santer (1989) [5] proposed Restricted Maximum Likelihood Estimator (RMLE), Siray et al. (2014) [6] proposed Restricted Liu Estimator (RLE), Asar Y et al. (2016) [7] proposed Restricted Ridge Estimator. Where additional stochastic linear restriction on parameter vector is assumed to hold, Nagarajah V, Wijekoon P (2015) [8] proposed Stochastic Restricted Maximum Likelihood Estimator (SRMLE), Varathan N, Wijekoon P (2016) [9] proposed Stochastic Restricted Liu Maximum Likelihood Estimator (SRLMLE), Varathan N, Wijekoon P (2016) [10] proposed Stochastic Restricted Ridge Maximum Likelihood Estimator (SRRMLE).

In this article, we propose a new estimator which is called the Stochastic Restricted Liu Estimator (SRLE) when the linear stochastic restrictions are available in addition to the logistic regression model. The article is structured as follows. Model specifications and the new estimators are proposed in Section 2. Section 3 is derived to compare the mean square error matrix (MSEM) of SRLE, MLE etc. Section 4 is a Numerical Example. A Monte Carlo Simulation is used to verify the above theoretical results shown in Section 5.

2. The Proposed Estimators

For the unrestricted model given in Equation (1.1), the LLE proposed by Liu (1993), Urgan and Tez (2008), Mansson et al. (2012) is defined as

β ^ L L E = Z d β ^ M L E , (2.1)

where 0 < d < 1 is a parameter and Z d = ( C + I ) 1 ( C + d I ) . The bias and variance matrices of the LLE:

B i a s ( β ^ L L E ) = ( Z d I ) β = b 1 , (2.2)

C o v ( β ^ L L E ) = Z d C 1 Z d , (2.3)

In addition to sample model (1.1), let us be given some prior information about β in the form of a set of j independent linear stochastic restrictions as follows:

h = H β + v ; E ( v ) = 0 , C o v ( v ) = Ψ , (2.4)

where H is a q × ( p + 1 ) of full rank q ( p + 1 ) known elements, h is an q × 1 stochastic known vector and v is an q × 1 random vector of disturbances with dispersion matrix Ψ and mean 0, and Ψ is assumed to be known q × q positive definite matrix. Further, it is assumed that v is stochastically independent of ε * = ( ε 1 , , ε n ) , i.e. E ( ε * v ) = 0 .

For the restricted model specified by Equations (1.1) and (2.4), the SRMLE proposed by Varathan Nagarajah and Pushpakanthie (2015), the SRLMLE proposed by Varathan N, Wijekoon P (2016) are denoted as

β ^ S R M L E = β ^ M L E + C 1 H ( Ψ + H C 1 H ) 1 ( h H β ^ M L E ) , (2.5)

β ^ S R L M L E = Z d β ^ S R M L E , (2.6)

respectively, the bias and variance matrices of the SRMLE and SRLMLE:

B i a s ( β ^ S R M L E ) = 0 , (2.7)

B i a s ( β ^ S R L M L E ) = ( Z d I ) β = b 1 , (2.8)

C o v ( β ^ S R M L E ) = C 1 C 1 H ( Ψ + H C 1 H ) 1 H C 1 = A , (2.9)

and

C o v ( β ^ S R L M L E ) = Z d A Z d , (2.10)

respectively.

We propose the Mix Maximum Likelihood Estimator (MME) [11] in logistic regression model which through analogy OME [12] in linear model. Defined as follows

β ^ M M E = ( C + H Ψ 1 H ) 1 ( X W ^ y + H Ψ 1 h ) , (2.11)

the bias and variance matrices of the MME: B i a s ( β ^ M M E ) = 0 ,

C O V ( β ^ M M E ) = ( C + H Ψ 1 H ) 1 = C 1 C 1 H ( Ψ 1 + H C 1 H ) 1 = B .

In this paper, we propose a new estimator which is named Stochastic Restricted Liu Estimator. Defined as follows

β ^ S R L E = Z d β ^ M M E , (2.12)

the bias and variance matrices of the SRLE:

B i a s ( β ^ S R L E ) = E ( β ^ S R L E ) β = ( Z d I ) β = b 1 , (2.13)

and

C o v ( β ^ S R L E ) = D ( β ^ S R L E ) = Z d B Z d , (2.14)

respectively.

Now we will give a theorem and a lemma that will be used in the following paragraphs.

Theorem 2.1. [13] (Rao and Toutenburg, 1995) Let A : n × n such that A > 0 and B 0 . Then A + B 0 .

Lemma 2.1. [14] (Rao et al., 2008) Let the two n × n matrices M > 0 , N 0 , then M > N if λ max ( N M 1 ) < 1 .

3. Mean Square Error Matrix (MSEM) Comparisons of the Estimators

In this section, we will compare SRLE with MLE, LLE, SRMLE, SRLMLE under the standard of MSEM.

First, the MSEM of β ^ which is an estimator of β is

M S E M ( β ^ ) = C o v ( β ^ ) + [ B i a s ( β ^ ) ] [ B i a s ( β ^ ) ] , (3.1)

where B i a s ( β ^ ) is the bias vector and C o v ( β ^ ) is the dispersion matrix. For two given estimators β ^ 1 and β ^ 2 , the estimator β ^ 2 is considered to be better than β ^ 1 in the MSEM criterion, if and only if

Δ ( β ^ 1 , β ^ 2 ) = M S E M ( β ^ 1 ) M S E M ( β ^ 2 ) 0 , (3.2)

The scalar mean square error matrix (MSE) is defined as

M S E ( β ^ ) = t r ( M S E M ( β ^ ) ) , (3.3)

Note that the MSEM criterion is always superior over the scalar MSE criterion, we only consider the MSEM comparisons among the estimators.

3.1. MSEM Comparisons of the MLE and SRLE

In this section, we make the MSEM comparison between the MLE and SRLE.

First, the MSEM of MLE and SRLE as

M S E M ( β ^ M L E ) = C 1 , (3.4)

and

M S E M ( β ^ S R L E ) = Z d B Z d + b 1 b 1 , (3.5)

respectively.

We now compare these two estimates to the criterion of the MSEM

Δ 1 = M S E M ( β ^ M L E ) M S E M ( β ^ S R L R E ) = C 1 Z d B Z d b 1 b 1 = C 1 ( Z d B Z d + b 1 b 1 ) = M 1 N 1 , (3.6)

where M 1 = C 1 and N 1 = Z d B Z d + b 1 b 1 . Obviously, b 1 b 1 is non-negative definite matrices, C 1 and Z d B Z d are positive definite. Using Theorem 2.1, it is clear that N 1 is positive define matrix. By Lemma 2.1, if λ max ( N 1 M 1 1 ) < 1 , where λ max ( N 1 M 1 1 ) is the largest eigen value of N 1 M 1 1 then M 1 N 1 is positive definite matrix. Based on the above discussions, the following theorem can be proved.

Theorem 3.1. For the restricted linear model specified by Equations (1.1) and (2.4), the SRLE is superior to MLE if and only if λ max ( N 1 M 1 1 ) < 1 in the MSEM sense.

3.2. MSEM Comparisons of the LLE and SRLE

First, the MSEM of LLE as

M S E M ( β ^ L L E ) = Z d C 1 Z d + b 1 b 1 . (3.7)

We now compare these two estimates to the criterion of the MSEM

Δ 2 = M S E M ( β ^ L L E ) M S E M ( β ^ S R L R E ) = Z d C 1 Z d Z d B Z d + b 2 b 2 b 2 b 2 = Z d D Z d (3.8)

where D = C 1 H ( Ψ 1 + H C 1 H ) 1 H C 1 . Obviously, Z d D Z d is positive definite. Based on the above discussions, the following theorem can be proved.

Theorem 3.2. For the restricted linear model specified by Equations (1.1) and (2.4), the SRLE is always superior to LLE in the MSEM sense.

3.3. MSEM Comparisons of the SRMLE and SRLE

First, the MSEM of SRMLE as

M S E M ( β ^ S R L E ) = A . (3.9)

We now compare these two estimates to the criterion of the MSEM

Δ 3 = M S E M ( β ^ S R M L E ) M S E M ( β ^ S R L R E ) = C 1 C 1 H ( Ψ + H C 1 H ) 1 H C 1 Z d B Z d b 1 b 1 = C 1 [ F + Z d B Z d + b 1 b 1 ] = M 1 N 3 (3.10)

where F = C 1 H ( Ψ + H C 1 H ) 1 H C 1 and N 3 = F + Z d B Z d + b 1 b 1 . Obviously, b 1 b 1 is non-negative definite matrices, F and Z d B Z d are positive definite. Using Theorem 2.1, it is clear that N 3 is positive define matrix. By Lemma 2.1, if λ max ( N 3 M 1 1 ) < 1 , where λ max ( N 3 M 1 1 ) is the largest eigen value of N 3 M 1 1 then M 1 N 3 is positive definite matrix. Based on the above discussions, the following theorem can be proved.

Theorem 3.3. For the restricted linear model specified by Equations (1.1) and (2.4), the SRLE is superior to SRMLE if and only if λ max ( N 3 M 1 1 ) < 1 in the MSEM sense.

3.4. MSEM Comparisons of the SRLMLE and SRLE

First, the MSEM of SRMLE as

M S E M ( β ^ S R L M L E ) = Z d A Z d + b 1 b 1 . (3.11)

Now, we consider the following difference

Δ 4 = M S E M ( β ^ S R L M L E ) M S E M ( β ^ S R L R E ) = Z d A Z d Z d B Z d + b 1 b 1 b 1 b 1 = Z d D Z d Z d F Z d = M 4 N 4 (3.12)

where M 4 = Z d D Z d and N 4 = Z d F Z d . Obviously, D , M 4 and N 4 are positive definite matrices. By Lemma 2.1, if λ max ( N 4 M 4 1 ) < 1 , where λ max ( N 4 M 4 1 ) is the largest eigen value of N 4 M 4 1 then M 4 N 4 is positive definite matrix. Based on the above discussions, the following theorem can be proved.

Theorem 3.4. For the restricted linear model specified by Equations (1.1) and (2.4), the SRLE is superior to SRLMLE if and only if λ max ( N 4 M 4 1 ) < 1 in the MSEM sense.

4. Numerical Example

In this section, we now consider the data set of IRIS from UCI to illustrate our theoretical results.

A binary logistic regression model is set where the dependent variable is as follows. If the plant is Iris-setosa, it is indicated with 0 and if the plant is Iris-versicolor, it is 1. The explanatory variables is as follows. x 1 : Sepal. Length; x 2 : Petal. Length; and x 3 : Petal. Width.

The sample consists of the first 80 observations. The correlation matrix can be seen in Table A1 (Appendix A). From Table A1 (Appendix A), it can be seen that the correlations among the regressors are all greater than 0.80 and some of them are close to 0.98 and the condition number is 55.4984 showing that there is a severe multicollinearity problem in this data.

From Table A2 (Appendix A) we can conclude that:

1) With the increase of d, the MSE values of the estimators are decreasing which are LRE, SRRMLE, SRLRE, SRLMLE, SRLE. 2) With the increase of d, the MSE values of the estimators are same which are MLE, SRMLE, MME. 3) The new estimator is always superior to the other estimators.

5. Monte Carlo Simulation

To illustrate the above theoretical results, the Monte Carlo Simulation is used for data Simulation. Following McDonald and Galarneau (1975) [15] and Kibria (2003) [16] , the explanatory variables are generated using the following equation.

x i j = ( 1 ρ 2 ) 1 / 2 z i j + ρ z i , p , i = 1 , 2 , , n , j = 1 , 2 , , p , (5.1)

where z i j are pseudo-random numbers from standardized normal distribution and ρ 2 represents the correlation between any two explanatory variables.

In this section, we set ρ to take 0.70, 0.80, 0.99 and n to take 20, 100, 200 for the dependent variable with two and four explanatory variables. The dependent variable y i in (1.1) is obtained from the Bernoulli ( π i ) distribution where

π i = exp ( x i β ) 1 + exp ( x i β ) . The parameter values of β 1 , , β p are chosen so that j = 1 p β j 2 = 1 and β 1 = = β p . Further for the Liu parameter d, some selected values is chosen so that 0 d 1 . Moreover, for the restriction, we choose

H = ( 1 1 0 0 0 1 1 0 0 0 1 1 ) , h = ( 1 2 1 ) and Ψ = ( 1 0 0 0 1 0 0 0 1 ) , (5.2)

The simulation is repeated 2000 times by generating new pseudo-random numbers and the simulated MSE values of the estimators are obtained using the following equation

M S E ^ ( β ^ * ) = M e a n { t r [ M S E M ( β ^ , β ) ] } = 1 2000 n = 1 2000 ( β ^ β ) ( β ^ β ) (5.3)

The results of the simulation are reported in Tables A3-A9 (Appendix A) and also displayed in Figures A1-A3 (Appendix B).

From Tables A3-A9, Figures A1-A3, we can conclude that:

1) The MSE values of all the estimators are increasing along with the increase of ρ ; 2) The MSE values of all the estimators are decreasing along with the increase of n; 3) SRLE is always superior to the MLE, LLE, SRMLE, SRLMLE for all d, n and ρ .

6. Conclusion Remarks

In this paper, we proposed the Stochastic Restricted Liu Estimator (SRLE) for logistic regression model when the linear stochastic restriction was available. In the sense of MSEM, we got the necessary and sufficient condition or sufficient condition that SRLE was superior to MLE, LLE, SRMLE and SRLMLE and Verify its superiority by using Monte Carlo simulation. How to reduce the new estimation’s bias is the focus of our next step which guaranteed mean square error does not increase.

Acknowledgements

This work was supported by the Natural Science Foundation of Henan Province of China (No. 152300410112).

Cite this paper

Zuo, W.B. and Li, Y.L. (2018) A New Stochastic Restricted Liu Estimator for the Logistic Regression Model. Open Journal of Statistics, 8, 25-37. https://doi.org/10.4236/ojs.2018.81003

References

  1. 1. Schaefer, R.L., Roi, L.D. and Wolfe, R.A. (2007) A Ridge Logistic Estimator. Communication in Statistics-Theory and Methods, 13, 99-113.

  2. 2. Aguilera, A.M., Escabias, M. and Valderrama, M.J. (2006) Using Principal Components for Estimating Logistic Regression with High-Dimensional Multicollinear Data. Computational Statistics & Data Analysis, 50, 1905-1924. https://doi.org/10.1016/j.csda.2005.03.011

  3. 3. Ogoke, U.P., Nduka, E.C. and Nja, M.E. (2013) The Logistic Regression Model with a Modified Weight Function in Survival Analysis. Mathematical Theory & Modeling, 3, 12-17.

  4. 4. Inan, D. and Erdogan, B.E. (2013) Liu-Type Logistic Estimator. Communication in Statistics—Simulation and Computation, 42, 1578-1586. https://doi.org/10.1080/03610918.2012.667480

  5. 5. Duffy, D.E. and Santner, T.J. (1989) On the Small Sample Prosperities of Norm-Restricted Maximum Likelihood Estimators for Logistic Regression Models. Communications in Statistics—Theory and Methods, 18, 959-980. https://doi.org/10.1080/03610928908829944

  6. 6. Siray, G.ü., Toker, S. and Kaciranlar, S. (2014) On the Restricted Liu Estimator in the Logistic Regression Model. Communication in Statistics—Simulation and Computation, 44, 217-232. https://doi.org/10.1080/03610918.2013.771742

  7. 7. Asar, Y., Arashi, M. and Wu, J. (2017) Restricted Ridge Estimator in the Logistic Regression Model. Communication in Statistics—Simulation and Computation, 46. https://doi.org/10.1080/03610918.2016.1206932

  8. 8. Nagarajah, V. and Wijekoon, P. (2015) Stochastic Restricted Maximum Likelihood Estimator in Logistic Regression Model. Open Journal of Statistics, 5, 837-851. https://doi.org/10.4236/ojs.2015.57082

  9. 9. Varathan, N. and Wijekoon, P. (2016) Logistic Liu Estimator under Stochastic Linear Restrictions. Statistical Papers, 1-18.

  10. 10. Varathan, N. and Wijekoon, P. (2016) Ridge Estimator in Logistic Regression under Stochastic Linear Restrictions. British Journal of Mathematics & Computer Science, 15, 1-14. https://doi.org/10.9734/BJMCS/2016/24585

  11. 11. Zuo, W.-B. and Li, Y.-L. (2017) Mixed Maximum Likelihood Estimator in Logistic Regression Model. Journal of Henan Institute of Education, 26, 1-6.

  12. 12. Theil, H. and Goldberger, A.S. (1961) On Pure and Mixed Estimation in Economics. International Economic Review, 2, 65-77. https://doi.org/10.2307/2525589

  13. 13. Rao, C.R. and Toutenburg, H. (1995) Linear Models: Least Squares and Alternatives. Second Edition, Springer-Verlag, New York. https://doi.org/10.1007/978-1-4899-0024-1

  14. 14. Rao, C.R., Toutenburg, H., Shalabh and Heumann, C. (2008) Linear Model and Generalizations. Springer, Berlin.

  15. 15. Golam Kibria, B.M. (2003) Performance of Some New Ridge Regression Estimators. Communication in Statistics—Simulation and Computation, 32, 419-435. https://doi.org/10.1081/SAC-120017499

  16. 16. McDonald, G.C. and Galarneau, D.I. (1975) A Monte Carlo Evaluation of Some Ridge-Type Estimators. Journal of the American Statistical Association, 70, 407-416. https://doi.org/10.1080/01621459.1975.10479882

Appendix A

Table A1. The correlation matrix of the dataset.

Table A2. The estimated MSEM values for different d.

Table A3. The estimated MSEM values for different d when n = 20 and ρ = 0.70 .

Table A4. The estimated MSEM values for different d when n = 20 and ρ = 0.80 .

Table A5. The estimated MSEM values for different d when n = 20 and ρ = 0.99 .

Table A6. The estimated MSEM values for different d when n = 100 and ρ = 0.7 .

Table A7. The estimated MSEM values for different d when n = 100 and ρ = 0.8 .

Table A8. The estimated MSEM values for different d when n = 200 and ρ = 0.8 .

Table A9. The estimated MSEM values for different d when n = 200 and ρ = 0.99 .

Appendix B

Figure A1. The estimated MSE values for MLE, LLE, SRMLE, SRLMLE and SRLE for n = 20 .

Figure A2. The estimated MSE values for MLE, LLE, SRMLE, SRLMLE and SRLE for n = 100 .

Figure A3. The estimated MSE values for MLE, LLE, SRMLE, SRLMLE and SRLE for n = 200 .