Open Journal of Statistics
Vol.09 No.04(2019), Article ID:94146,15 pages
10.4236/ojs.2019.94028
Robust Continuous Quadratic Distance Estimation Using Quantiles for Fitting Continuous Distributions
Andrew Luong
École d’actuariat, Université Laval, Ste Foy, Québec, Canada

Copyright © 2019 by author(s) and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY 4.0).
http://creativecommons.org/licenses/by/4.0/



Received: June 19, 2019; Accepted: August 3, 2019; Published: August 6, 2019
ABSTRACT
Quadratic distance estimation making use of the sample quantile function over a continuous range is introduced. It extends previous methods which are based only on a few sample quantiles and it parallels the continuous GMM method. Asymptotic properties are established for the continuous quadratic distance estimators (CQDE) and the implementation of the methods are discussed. The methods appear to be useful for balancing robustness and efficiency and useful for fitting distribution with model quantile function being simpler than its density function or distribution function.
Keywords:
Covariance Kernel, Influence Function, Hilbert Space, Linear Operator, GMM Estimation, Spectral Decomposition

1. Introduction
For estimation in a classical setup, we often assume to have n independent, identically distributed observations from a continuous density which belongs to a parametric family , i.e., where , and is the true vector of parameters, is assumed to be compact. One of the main objectives of inferences is to be able to estimate . In an actuarial context, the sample observations might represent losses of a certain type of contracts and an estimate of is necessary if we want to make rates or premiums for the type of contract where we have observations.
Maximum likelihood (ML) estimation are density based and often the domain of the density function must not depend on the parameters is one of the regularity conditions so that ML estimators attain the lower bound as given by the information matrix. In many applications, this condition is not met. We can consider the following example which gives the Generalized Pareto distribution (GPD) and draw the attention on the properties of the model quantile function which appears to have nicer properties than the density function and hence motivate us to develop continuous quadratic distance (CQD) estimation using quantiles on a continuum range which generalizes the quadratic distance (QD) methods based on few quantiles as proposed by LaRiccia and Wehrly [1] which can be viewed as based on a discrete range and hence CQD estimation might overcome the arbitrary choice of quantiles of QD as CQD will essentially make use of all the quantiles over the range with .
Example (GPD).
The GP family is a two parameters family with the vector of parameter .
The density, distribution function and quantile function are given respectively by
and
,
the distribution function is given by
and
,
the quantile function is given by
These functions can be found in Castillo et al. [2] (pages 65-66). Among these functions only the domain of the quantile function does not depend on the parameters and naturally if the model quantile function satisfies some additional conditions such as differentiability, it is natural to develop statistical inference methods using the sample quantile function instead of the sample distribution function which are defined respectively as
and
with being the degenerate distribution at is the commonly used sample distribution. The counterpart of is the model quantile function , see Serfling [3] (pages 74-80).
Due to the complexity of the density function for the GP model, alternative methods to ML have been developed in the literature for example with the probability weighted moments (PWM) method proposed by Hosking and Wallis [4] which leads to solve moment type of equations to obtain estimators by matching selected empirical moments with their model counterpart. The drawback of the PWM method is the range of the parameters must be restricted for the selected moments to exist, see Hosking and Wallis [4] , Kotz and Nadarajah [5] (p 36). The PWM method might not be robust and some robust methods have been proposed by Dupuis [6] , Juarez and Schucany [7] for estimation for the GP model.
For estimating parameters of the GPD, the percentiles matching (PM) method for fitting loss distributions as described by Klugman et al. [8] (pages 256-257) can also be used. It consists of first selecting two points , with as we only have two parameters and solve the following moment type of estimating equations to obtain the estimators, i.e., is the vector of solutions of
or equivalently,
and
or equivalently,
.
The method is robust but not very efficient as only two points are used here to obtain moment type of equations and there is also arbitrariness on the choice of these two points. Castillo and Hadi [9] have improved this method by first selecting a set of two points,
and obtain a set of corresponding PM estimators and finally define the final estimators according to a rule to select from the set of PM estimators generated by the set
. The question on arbitrariness on selecting the set
is still not resolved with this method.
Instead of solving moment type of equations, for parametric estimation in general not necessary for the GPD with the vector of parameters
, LaRiccia and Wehrly [1] proposed to construct quadratic distance based on the discrepancy of
using
selected points
’s with
, so that we can define the following two vectors
and
with

which is based on the sample and its model counterpart defined as
.
This leads to a class of quadratic distance of the form
(1)
and the quadratic distance (QD) estimators are found by minimizing the objective function given by expression (1),
is a class of symmetric positive definite matrix which might depend on
. Goodness-of-fit test statistics can also be constructed using expression (1), see Luong and Thompson [10] .
By quadratic distance estimation without further specializing it is continuous we mean that it is based on quadratic form as given by expression (1), it also fits into classical minimum distance (CMD) estimation and closely related to Generalized Methods of moment (GMM) and by GMM without further specializing that it is continuous GMM, we mean GMM based on a finite number of moment conditions, see Newey and McFadden [11] (p 212-2128).
Using the asymptotic theory of QD estimation or CMD estimation, it is well known that by letting
to be the inverse of the asymptotic covariance matrix of 




In fact, it has been shown that it suffices to use a consistent estimate for 



then we can construct a consistent estimate which is given 


In practice, for QD estimation we let 

For GMM estimation, it is quite straightforward to construct


Continuous GMM theory makes use of Hilbert space linear operator theory and have been developed in details by Carrasco and Florens [12] and as mentioned it is closely related to the theory for continuous QD, we shall make use of their results to establish consistency and asymptotic normality of continuous quadratic distance estimators and since the paper aims at providing results for practitioners for their applied works, the presentation will emphasize methodologies with less technicalities so that it might be more suitable for applied researchers for their works. First, we shall briefly outline how to form the quadratic distance to obtain the CQD estimators and postpone the details for later sections of the paper.
CQD estimators can be viewed as estimators based on minimizing a continuous quadratic form as given by

with:
1) 
2) a and b are chosen values with a being close to 0 and b close to 1 and
In practice, we work with an asymptotic equivalent objective function 




Since the kernel 


As in the spectral decomposition of a symmetric positive defined matrix for the Euclidean space, spectral decomposition in Hilbert space allows the kernel


representation, we can express 

which is similar to the expression used to obtain continuous GMM estimators as given by Carrasco and Florens [12] (page 799).
Spectral decompositions in functional space have been used in the literature, see Feuerverger and McDunnough [13] (page 312), Durbin [14] (page 292-294). Furthermore, if 


to obtain CQD estimators. Unless otherwise stated, by CQD estimators we mean estimators using the objective function of the form as defined by expression (5).
Carrasco and Florens [6] (page 799) developed perturbation technique, a technique to obtain 

The objectives of the paper are to develop CQD estimation based on quantiles with the aims to have estimators which are robust in the sense of bounded influence functions and have good efficiencies. For technicalities, we refer to the paper by Carrasco and Florens [12] who have introduced continuous GMM estimation.
The paper is organized as follows. Section 2 gives some preliminary results such as statistical functional and its influence function from which the sample quantiles can be viewed as robust statistics with bounded influence functions. CQD estimation using quantiles will inherit the same robustness property. Some of the standard notions for the study of kernel functions will also be reviewed. By linking a kernel to a linear operator in the Hilbert space of functions which are square integrable over the range 

Finally, we shall mention that simulation studies are not discussed in this paper as numerical quadrature methods are involved for evaluating the integrals over the range 
2. Some Preliminaries
In this section we shall review the notion of statistical functional and its influence function and view a sample quantile as a statistical functional. Using its influence function, we can see that the sample quantile is a robust statistic and using the influence functions of two sample quantiles, we can also obtain the asymptotic covariance of the two sample quantiles.
2.1. Statistical Functional and Its Influence Function
Often, a statistic can be represented as a functional of the sample distribution 











Furthermore, since a Taylor type of approximation in a functional space can be used, we then have the following approximation expressed with a remainder term
or equivalently using




If 




Therefore, if we want to find the asymptotic variance of 

The influence function of the sth-sample quantile 

from which we can obtain the asymptotic variance of

See Serfing [3] (page236), Hogg et al. [15] (page 593). Also, using the influence function representation for the sth-sample quantile 



see LaRiccia and Wehrly [1] (page 743).
If we define the covariance kernel as

then associated to this kernel there is a linear operator 


We can see that for a suitable functional space, it is natural to consider the Hilbert space of functions which are square integrable so that a norm and linear operators can be defined in this space. This will facilitate the studies of kernels which are function of
2.2. Linear Operators Associated with Kernels in a Hilbert Space
The functional space that we are interested is the space of integrable function with the range 



For a Euclidean space, the composition of two linear operators 










Just as a matrix 








More precisely, given 
Furthermore,
if 

In this paper we focus on positive definite symmetric kernel 





Unless otherwise stated, we work with linear operators associated with positive definite symmetric kernels. For the Euclidean space if the covariance matrix 




see Hogg et al. [15] (pages 179-180) for square root of a symmetric positive definite matrices and they can be computed using the technique of spectral decomposition of matrices.
If 





For our purpose, we shall focus on a linear operator 




2.3. Estimation of K and K−1
The methods used to construct an estimator for 
1) We need a preliminary consistent estimate 



2) Use 





For our set-up, i.e., CQD estimation, we should use the influence function of the sample quantiles as given by expression (6) to specify

The notion of influence function was not mentioned in Carrasco and Florens [12] .
3) Since 







4) Use the spectral decomposition to express 

The above expression is similar to the representation of a positive definite matrix 




We can proceed as follows in order to find 



It turns out that 





The eigenfunction can be expressed as 
can be computed as statistical packages often offer routines to compute eigenvalues and eigenvectors for a given matrix.
For numerical evaluations of 

Now we turn into attention of constructing 



It appears then the kernel of 






Now we can define define 





For example, if we let 

This also means that the kernel for 

and again 

This also means that 



In Section 3 we shall turn our attention to asymptotic properties of CQD estimators using the objective function 




For consistency, we shall make use the basic consistency Theorem, i.e., Theorem 2.1 as given by Newey and McFadden [12] (page 2121). For establishing asymptotic normality for the CQD estimators, the procedures are similar to those used for establishing asymptotic normality of continuous GMM estimators as given by Theorem 8 given by Carrasco and Florens [12] (page 811, page 825).
3. Asymptotic Properties
3.1. Consistency
Assuming 


Now if we assume that the integrand can be dominated by a function 













3.2. Asymptotic Normality
The basic assumption used to establish asymptotic normality for the CQD estimators is the model quantile function is twice differentiable which allows a standard Taylor expansion the estimating equations.
Assuming the first derivative vector 

Before considering the Taylor expansion, we also need the following notation and the notion of a random element with zero mean and covariance given by the kernel of the associated linear operator K, i.e., 








as 

Using expression (12), we then have

Now using 


Note that 

Let
And 

so that

with the symbol 
The matrix 


using the spectral decomposition technique, the 


4. Summary and Conclusion
The proposed method is similar to the continuous GMM method with the estimators obtained using sample distribution function obtained by minimizing

being an optimum kernel but using a sample distribution function 



The authors also showed that by letting
For robustness sake for continuous GMM estimation we might want to let 𝑇 be finite and the lower bound be 






Acknowledgements
The helpful and constructive comments of a referee which lead to an improvement of the presentation of the paper and support from the editorial staffs of Open Journal of Statistics to process the paper are all gratefully acknowledged.
Conflicts of Interest
The author declares no conflicts of interest regarding the publication of this paper.
Cite this paper
Luong, A. (2019) Robust Continuous Quadratic Distance Estimation Using Quantiles for Fitting Continuous Distributions. Open Journal of Statistics, 9, 421-435. https://doi.org/10.4236/ojs.2019.94028
References
- 1. LaRiccia, V.N. and Wehrly, T.E. (1985) Asymptotic Properties of a Family of Minimum Quantile Distance Estimators. Journal of the American Statistical Association, 80, 742-747. https://doi.org/10.1080/01621459.1985.10478178
- 2. Castillo, E., Hadi, A.S., Balakrishnan, N. and Sarabia, J.M. (2005) Extreme Value and Related Models with Applications in Engineering and Science. Wiley, New York.
- 3. Serfling, R.J. (1980) Approximation Theorems of Mathematical Statistics. Wiley, New York. https://doi.org/10.1002/9780470316481
- 4. Hosking, J.R.M. and Wallis, J.R. (1987) Parameter and Quantile Estimation for the Generalized Pareto Distribution. Technometrics, 29, 339-349. https://doi.org/10.1080/00401706.1987.10488243
- 5. Kotz, S. and Nadarajah, S. (2000) Extreme Value Distributions. Imperial College Press, London. https://doi.org/10.1142/p191
- 6. Dupuis, D.J. (1988) Exceedances over High Thresholds: A Guide to Threshold Selection. Extremes, 1, 251-261.
- 7. Juarez, S.F. and Schucany, W.R. (2004) Robust and Efficient Estimation for the Generalized Pareto Distribution. Extremes, 7, 237-251. https://doi.org/10.1007/s10687-005-6475-6
- 8. Klugman, S.A., Panjer, H.H. and Willmot, G.E. (2012) Loss Models: From Data to Decisions. Fourth Edition, Wiley, New York. https://doi.org/10.1002/9781118787106
- 9. Castillo, E. and Hadi, A.S. (1997) Fitting the Generalized Pareto Distribution to Data. Journal of the American Statistical Association, 92, 1619-1620. https://doi.org/10.1080/01621459.1997.10473683
- 10. Luong, A. and Thompson, M.E. (1987) Minimum Distance Methods Based on Quadratic Distance for Transforms. Canadian Journal of Statistics, 15, 239-251. https://doi.org/10.2307/3314914
- 11. Newey, W.K. and McFadden, D. (1994) Large Sample Estimation and Hypothesis Testing. In: Engle, R. and McFadden, D., Eds., Handbook of Econometrics, Volume 4, Elsevier, Amsterdam, 419-554.
- 12. Carrasco, M. and Florens, J.-P. (2000) Generalization of GMM to a Continuum of Moment Condition. Econometric Theory, 16, 797-834. https://doi.org/10.1017/S0266466600166010
- 13. Feuerverger, A. and McDunnough, P. (1984) On Statistical Transform Methods and Their Efficiency. Canadian Journal of Statistics, 12, 303-317. https://doi.org/10.2307/3314814
- 14. Durbin, J. and Knott, M. (1972) Components of the Cramer-von Mises Statistics. Journal of the Royal Statistical Society, Series B, 34, 290-307. https://doi.org/10.1111/j.2517-6161.1972.tb00908.x
- 15. Hogg, R.V., McKean, J.W. and Craig, A.T. (2013) Introduction to Mathematical Statistics. Seventh Edition, Pearson, Hoboken.
- 16. Luenberger, D.G. (1968) Optimization by Vector Space Methods. Wiley, New York.









