Open Journal of Statistics
Vol.06 No.04(2016), Article ID:69976,8 pages
10.4236/ojs.2016.64056
Shrinkage Estimation in the Random Parameters Logit Model
Tong Zeng1, R. Carter Hill2
1Department of Applied Business Sciences and Economics, University of La Verne, La Verne, CA, USA
2Department of Economics, Louisiana State University, Baton Rouge, LA, USA

Copyright © 2016 by authors and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/



Received 10 May 2016; accepted 20 August 2016; published 23 August 2016
ABSTRACT
In this paper, we explore the properties of a positive-part Stein-like estimator which is a stochastically weighted convex combination of a fully correlated parameter model estimator and uncorrelated parameter model estimator in the Random Parameters Logit (RPL) model. The results of our Monte Carlo experiments show that the positive-part Stein-like estimator provides smaller MSE than the pretest estimator in the fully correlated RPL model. Both of them outperform the fully correlated RPL model estimator and provide more accurate information on the share of population putting a positive or negative value on the alternative attributes than the fully correlated RPL model estimates. The Monte Carlo mean estimates of direct elasticity with pretest and positive-part Stein-like estimators are closer to the true value and have smaller standard errors than those with fully correlated RPL model estimator.
Keywords:
Pretest Estimator, Stein-Rule Estimator, Positive-Part Stein-Like Estimator, Likelihood Ratio Test, Random Parameters Logit Model

1. Introduction
The random parameters logit (RPL) model is a generalization of the conditional logit model for multinomial choices. The conditional logit model is derived from an assumption that the errors in the underlying random utility functions for each choice alternative are statistically independent and identically distributed (iid) extreme value type I. This leads to the property known as the Independence of Irrelevant Alternatives (IIA): The ratio of the probability of two alternatives remains constant no matter how many choices there are. This is widely regarded to be a very restrictive assumption.
The key feature of the RPL model is that response parameters can vary randomly, following a chosen distribution, across the population from which samples are drawn. The random coefficients capture individual heterogeneity and the model does not suffer from the independence of irrelevant alternatives assumption. The random coefficients can be correlated in the RPL model as generally expected in reality, because the unobservable preference of each individual is used to evaluate the attributes of all alternatives in each choice situation. Estimation is by maximum simulated likelihood (MSL), which is described by [1] .
In this paper we explore a problem that can exist in any correlated random parameters model. Let
,
be an observable outcome variable from a density
, where
is a vector of K explanatory variables and
are random parameters with mean
and covariance matrix
. Using MSL we estimate the population parameters
and
. Allowing the random parameters to be correlated introduces potentially many new parameters,
covariance terms, that are difficult to estimate.
Most applied researchers will test the significance of the covariance parameters before deciding to rely on the fully correlated random parameter model instead the model in which the parameters are random but uncorrelated, so that
is diagonal. We explore whether a pretesting strategy improves postestimation inference. We also explore the use of a Stein-like shrinkage estimator as an alternative to pretesting. This estimator shrinks the estimates from the fully correlated parameter model towards the estimates of the uncorrelated random parameter model. In numerical experiments using the RPL model we find that both the pretest estimator and shrinkage estimators have improved mean squared error (MSE) relative to the MSL estimator of the fully correlated parameter model. Last, we analyze the share of the population putting a positive or negative value on the alternative attributes, and the Monte Carlo mean estimates of direct elasticity with fully correlated RPL model estimates and pretest and shrinkage estimates. Based on our Monte Carlo experiment results, pretest and shrinkage estimates provide more accurate estimates on both of them than the fully correlated RPL model estimates.
2. The Random Parameters Logit Model
The RPL model is described in [2] . Consider individual n facing M alternatives. The random utility associated with alternative i is
, where
are K observed explanatory variables for alternative i,
is an iid type I extreme value error which is independent of
and






bability that individual n chooses alternative i is of the usual logistic form,
that 






For estimation purposes we use Cholesky’s decomposition and write


where 




where
3. Stein-Like Shrinkage Estimation
Stein-rule estimators follow the work of [3] and [4] and combine sample information with non-sample infor- mation in a way that improve the precision of the estimation process and the quality of subsequent predictions. The Stein-rule estimator is a weighted average of the restricted and unrestricted estimators, the weight being a function of the magnitude of the test statistic used to test the restrictions.
Following is the Stein-rule estimator which dominates the maximum likelihood estimator (MLE) in linear regression under weighted quadratic loss with weight matrix W, 









where 
strictions,


Sufficient conditions for minimaxity, meaning that the estimator minimizes the maximum risk over the entire parameter space, are 


where 


where u is the test statistic for the hypothesis





The positive-part Stein-like estimator 


where 

pothesis that the coefficient covariance matrix is diagonal, or equivalently that the Cholesky elements in A below the diagonal are zero. The scalar a controls the amount of shrinkage towards the UCRPLM estimates. The shrinkage estimator 






With test statistic u, the pretest estimator 

where 









4. Monte Carlo Experiments
4.1. Design
In our experiments the number of choice alternatives is 















Our simulation and RPL model estimation were carried out in NLOGIT 5.0. Based on our Monte Carlo experiment results, [7] and [8] , we use 100 Halton draws to simulate choice probabilities during MSL estimation. The positive-part Stein-like and pretest estimators were calculated based on the likelihood ratio (LR), Lagrange multiplier (LM) and Wald test statistics with 25%, 5% and 1% significance level. Because the empirical percentile values of LR test are closer to the related critical values than those of LM and Wald tests, we only provide the results based on the LR test statistic. Using Monte Carlo experiments to study the RPL model, especially with correlated parameters, is numerically challenging. Key elements that are worth mentioning are 1) for the uncorrelated parameter model conditional logit estimates were used as starting values; 2) for the correlated parameter model the estimates from 1) were used as starting values; 3) samples for which con- vergence was not achieved were discarded, only 0.3% of the results are unconverged in our Monte Carlo experiments.
4.2. Results
To study how the pretest and shrinkage estimators reduce the estimation risk of the FCRPLM estimators, we calculate the MSEs of the estimated parameters mean, variance, covariance with the pretest, shrinkage and FCRPLM estimators respectively. First, we compare the MSE of the fully correlated estimators and those of
UCRPLM estimators, where MSE is the Monte Carlo average of the squared error loss
Table 1, the MSEs of UCRPLM estimators are all smaller than those of FCRPLM estimators. The risk of the estimated parameters mean with the FCRPLM is more than twice that of the UCRPLM. The MSEs of the estimated variance with the UCRPLM are about 25% of those with the FCRPLM. With nonzero correlation

Table 1. The ratios of uncorrelated RPL model estimator MSE to the FCRPLM estimator MSE.
In Table 2, we compare the MSEs of LR based pretest and shrinkage estimators to those of FCRPLM estimators. All Table 2 ratios are less than one. The pretest and shrinkage estimators all perform better than the FCRPLM estimators. With a smaller level of test significance



The covariance elements reveal important information about the joint effect of alternative attributes on people' decisions. If two random coefficients are highly positively correlated with each other, it means people are attracted and motivated by both of the related attributes. In our Monte Carlo experiments, the shrinkage estimators with higher shrinkage constant a outperform estimators with less shrinkage and most of the pretest estimators.
Since one of the advantages of RPL model is providing the information on the share of population that places a positive or negative value on the alternative attributes, we also calculate the joint probability of the first two estimated parameters are less than zero. Table 3 shows the share of population putting a negative value on the attributes. Compared to the results with UCRPLM and FCRPLM estimates, the joint probability with FCRPLM estimates are closer to the true value with larger MSEs, except for the
To analyze the sensitivity of the RPL model in response to a change in the level of alternative attribute, we calculate the mean estimates of direct elasticity with the true parameters
Table 2. The ratios of LR Based pretest, shrinkage estimator MSE to the FCRPLM estimator MSE.
Table 3. The Share of population putting negative value on the first two attributes of each alternative,
Note: [ ] provides the MSE results, {} provides bias results.
Table 4. The mean estimates of direct elasticity with true parameters
Table 5. Since the pretest estimator with smaller level of test significance has smaller MSE, we use the pretest estimator with 1% significance level. The first explanatory variable in each alternative 
Comparing the results in Table 4 to Table 5, we find that the results with FCRPLM estimates are all higher than the true value. When the
5. Conclusion
According to our Monte Carlo experiment results, the UCRPLM estimators have smaller estimation risk than the
Table 5. The Monte Carlo mean estimates of direct elasticity based on pretest, shrinkage and FCRPLM estimates.
Note: ( ) provides the standard error results.
FCRPLM estimators. The pretest and positive-part Stein-like estimators both perform better than the FCRPLM estimators. The positive-part Stein-like estimators with higher shrinkage constant a outperform those with a smaller one and the pretest estimators. Shrinkage estimation reduces the risk of the FCRPLM estimators by shrinking the FCRPLM estimates towards the UCRPLM estimates. Providing the information on the share of population putting a negative or positive value on the alternative attributes is one of the advantages of the RPL model. When the random coefficients are correlated to each other, the FCRPLM estimator of this quantity has a smaller bias and slightly larger MSE than the UCRPLM estimator. Based on our Monte Carlo experiments, the pretest and shrinkage estimates can reduce the MSEs of the estimated results of share of the population putting a positive or negative value on alternative attributes as well. The Monte Carlo mean estimates of direct elasticity based on the pretest and shrinkage estimators with a larger shrinkage constant are closer to the true value with smaller standard errors than those based on the FCRPLM estimators.
Cite this paper
Tong Zeng,R. Carter Hill, (2016) Shrinkage Estimation in the Random Parameters Logit Model. Open Journal of Statistics,06,667-674. doi: 10.4236/ojs.2016.64056
References
- 1. Greene, W.H. (2012) Econometric Analysis. Pearson Education, Inc., NJ.
- 2. Train, K.E. (2009) Discrete Choice Methods with Simulation. Cambridge University Press, Cambridge.
http://dx.doi.org/10.1017/CBO9780511805271 - 3. James, W. and Stein, C.M. (1961) Estimation with Quadratic Loss. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, 1, 361-379, University California Press, Berkeley.
- 4. Stein, C.M. (1956) Inadmissibility of the Usual Estimator for the Mean of a Multivariate Normal Distribution. Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, 1954-1955, I, 197-206. University California Press, Berkeley.
- 5. Kim, M. and Hill, R.C. (1995) Shrinkage Estimation in Nonlinear Regression: the Box-Cox Transformation. Journal of Econometrics, 66, 1-33.
http://dx.doi.org/10.1016/0304-4076(94)01606-Z - 6. McFadden, D. (1989) A Method of Simulated Moments for Estimation of Discrete Response Models without Numerical Integration. Econometrica, 57, 995-1026.
http://dx.doi.org/10.2307/1913621 - 7. Bhat, C.R. (2001) Quasi-Random Maximum Simulated Likelihood Estimation of the Mixed Multinomial Logit Model. Transportation Research Part B, 35, 677-693.
http://dx.doi.org/10.1016/S0191-2615(00)00014-X - 8. Zeng, T. (2016) Using Halton Sequences in Random Parameters Logit Models. Journal of Statistical and Econometric Methods, 5, 59-86.
NOTES
1Other choices are possible. See Train (2009, 136).




