Applied Mathematics
Vol.06 No.12(2015), Article ID:61470,19 pages
10.4236/am.2015.612183
Asymptotic Confidence Bands for Copulas Based on the Local Linear Kernel Estimator
Diam Bâ1, Cheikh Tidiane Seck2, Gane Samb Lô1,3
1LERSTAD, Université Gaston Berger de Saint-Louis, Saint-Louis, Sénégal
2LERSTAD, Université Alioune Diop, Bambey, Sénégal
3LSTA, Université Pierre et Marie Curie, Paris, France

Copyright © 2015 by authors and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY).


Received 29 September 2015; accepted 22 November 2015; published 25 November 2015

ABSTRACT
In this paper, we establish asymptotically optimal simultaneous confidence bands for the copula function based on the local linear kernel estimator proposed by Chen and Huang [1] . For this, we prove under smoothness conditions on the derivatives of the copula a uniform in bandwidth law of the iterated logarithm for the maximal deviation of this estimator from its expectation. We also show that the bias term converges uniformly to zero with a precise rate. The performance of these bands is illustrated by a simulation study. An application based on pseudo-panel data is also provided for modeling the dependence structure of Senegalese households’ expense data in 2001 and 2006.
Keywords:
Copula Function, Kernel Estimation, Local Linear Estimator, Uniform in Bandwidth Consistency, Simultaneous Confidence Bands

1. Introduction
Let us consider a random vector
with joint cumulative distribution function H and marginal distribution functions F and G. The Sklar’s theorem (see [2] ) says that there exists a bivariate distribution function C on
with uniform margins such that

The function C is called a copula associated with the random vector
. If the marginal distribution functions F and G of H are continuous, then the copula C is unique and is defined as

where
and
,
, are the generalized inverses of F and G respectively.
From these facts, estimating bivariate distribution function can be achieved in two steps: 1) estimating the margins F and G; 2) estimating the copula C.
In this paper, we are dealing with nonparametric copula estimation. We consider a copula function C with uniform margins U and V defined on
. Then, we can write

The aim of this paper is to construct asymptotic optimal confidence bands, for the copula C, from the local linear kernel estimator proposed by Chen and Huang [1] . Our approach, based on modern functional theory of empirical processes, allows the use of data-driven bandwidths for this estimator, and is largely inspired by the works of Mason [3] and Deheuvels and Mason [13] .
There are two main methods for estimating copula functions: parametric and nonparametric methods. The Maximum likelihood estimation method (MLE) and the moment method are popular parametric approaches. It happens that one may use a nonparametric approach like the MLE-method and, at the same time, estimates margins by using parametric methods. Such an approach is called a semi-parametric estimation method (see [4] ). A popular nonparametric method is the kernel smoothing. Scaillet and Fermanian [5] presented the kernel smoothing method to estimate bivariate copulas for time series. Genest and Rivest [6] gave a nonparametric empirical distribution method to estimate bivariate Archimedean copulas.
A pure nonparametric estimation of copulas treats both the copula and the margins in a parameter-free way and thus offers the greatest generality. Nonparametric estimation of copulas goes back to Deheuvels [7] who proposed an estimator based on a multivariate empirical distribution function and on its empirical marginals. Weak convergence studies of this estimator can be found in Fermanian et al. [8] . Gijbels and Mielniczuk [9] proposed a kernel estimator for a bivariate copula density. Another approach of kernel estimation is to directly estimate a copula function as explored in [5] . Chen and Huang [1] proposed a new bivariate kernel copula estimator by using local linear kernels and a simple mathematical correction that removes the boundary bias. They also derived the bias and the variance of their estimator, which reveal that the kernel smoothing produces a second order reduction in both the variance and mean square error as compared with the unsmoothed empirical estimator of Deheuvels [7] .
Omelka, Gijbels and Veraverbeke [10] proposed improved shrinked versions of the estimators of Gijbels and Mielniczuk [9] and Chen and Huang [1] . They have done this shrinkage by including a weight function that removes the corner bias problem. They also established weak convergence for all newly-proposed estimators.
In parallel, powerful technologies have been developed for density and distribution function kernel estimation. We refer to Mason [3] , Dony [11] , Einmahl and Mason [12] , Deheuvels and Mason [13] . In this paper, we’ll apply these recent methods to kernel-type estimators of copulas. The existence of kernel-type function estimators should lead to nonparametric estimation by confidence bands, as shown in [13] , where general asymptotic simultaneous confidence bands are established for the density and the regression function curves. Furthermore, to our knowledge, there is not yet such type of results in nonparametric estimation of copulas. This motivated us to extend such technologies to kernel estimation of copulas by providing asymptotic simultaneous optimal confidence bands.
Let
be an independent and identically distributed sample of the bivariate random vector
, with continuous marginal cumulative distribution function F and G. To construct their estimator, Chen and Huang proceed in two steps. In the first step, they estimate margins by

where
and
are some bandwidths and K is the integral of a symmetric bounded kernel function k supported on




unknown copula C. To prevent boundary bias, Chen and Huang suggested using a local linear version of the kernel k given by
where





where



Our best achievement is the construction of asymptotic confidence bands from a uniform in bandwidth law of the iterated logarithm (LIL) for the maximal deviation of the local linear estimator (1), and the uniform convergence of the bias to zero with the same speed of convergence.
The paper is organized as follows. In Section 2, we expose our main results in Theorems 1, 2 and 3. Simulation studies and applications to real data sets are also made in this section to illustrate these results. In Section 3, we report the proofs of our assertions. The paper is ended by Appendix in which we postpone some technical results and numerical computations.
2. Main Results and Applications
2.1. Main Results
Here, we state our theoretical results. Theorem 1 gives a uniform in bandwidth LIL for the maximal deviation of the estimator (1). Theorem 2 handles the bias, while Theorem 3 provides asymptotic optimal simultaneous confidence bands for the copula function
Theorem 1. Suppose that the copula function







where



Remark 1. Theorem 1 represents a uniform in bandwidth law of the iterated logarithm for the maximal deviation of the estimator (1). As in [15] we may use it, in its probability version, to construct simultaneous asymptotic confidence bands from the estimator (1). In this purpose, we must ensure before hand that the bias term





Theorem 2. Suppose that the copula function







Because a number of copula families do not possess bounded second-order partial derivatives, the application of these results is limited by a corner bias problem. To overcome this difficulty and apply these results to a wide family of copulas, we adopt the shrinkage method of Omelka et al. [10] , by taking a local data-driven bandwidth

(H1)
where



For such a bandwidth

By condition (H1), (5) is equivalent for


This latter estimator (6) is exactly the improved shrinked version proposed by Omelka et al. [10] . It enables us to keep the bias bounded on the borders of the unit square and then to remove the problem of possible unboundedness of the second order partial derivatives of the copula function C. To set up asymptotic optimal simultaneous confidence bands for the copula C, we need the following additional condition:
(H2)
If conditions (H1) and (H2) hold, then we can infer from Theorem 1 that
This is still equivalent to

To make use of (7) for forming confidence bands, we must ensure that the bandwidth


This would be the case if condition H2) holds and
Theorem 3. Suppose that the assumptions of Theorem 1 and Theorem 2 hold. Then for any local data-driven bandwidth




and,

where


Remark 2. Whenever (9) and (10) hold jointly for each

provide asymptotic simultaneous optimal confidence bands (at an asymptotic confidence level of 100%) for the copula function


2.2. Simulations and Data-Driven Applications
2.2.1. Simulations
We make some simulation studies to evaluate the performance of our asymptotic confidence bands. To this end, we compute the confidence bands given in (11) for some classical parametric copulas, and check for whether the true copula is lying in these bands. For simplicity, we consider for example two families of copulas: Frank and Clayton, defined respectively as follows:

and

We fix values for the parameter

・ step 1: Generate two values u and v from
・ step 2: Set
・ step 3: Compute

Then






and (H2) are fulfilled. That is the case for






In Figure 1, we represent the confidence bands and the Frank copula, while Figure 2 represents the confidence bands and the Clayton copula. One can see that the true curves of the two parametric copulas are well contained in the bands.
We can also remark some simulitudes between Figure 1 and Figure 2. This seems normal because of the closeness of the values of




As we cannot visualize all the information in the above figures, we provide in Appendix some numerical computations to best appreciate the performance of our bands. To this end, we generate 10 couples










Figure 1. Confidence bands for the Frank copula in 3D, with
Figure 2. Confidence bands for the Clayton copula in 3D, with





2.2.2. Data-Driven Applications
In this subsection, we apply our theoretical results to select graphically, among various copula families, the one that best fits sample data. Towards this end, we shall represent in a same 2-dimensional graphic the confidence bands established in Theorem 3 and the curves corresponding to the different copulas considered. To illustrate this, we use data expenses of Senegalese households, available in databases managed by the National Agency of Statistics and Demography (ANSD) of the Republic of Senegal (www.ansd.sn). The data were obtained from two sample surveys: ESAM2 (Senegalese Survey of Households, 2nd edition, 2001-2002) and ESPS (Monitoring Survey of Poverty in Senegal, 2005-2006). Because of the not availability of recent data, we deal with the pseudo-panel data utilized in [17] , which consist of two series of observations of size

Instead of smoothing these observations denoted by
to define the kernel estimator of the true copula. Here,




This application is limited to Archimedean copulas. We will consider for example three parametric families of copulas: Frank, Gumbel and Clayton. Our aim is to find graphically, using our confidence bands, the family that best fits these pseudo-panel data. The unknown parameter





Figure 3 shows that the Clayton family seems more adequate to fit our pseudo-panel data. That is, the dependence modeling for these Senegalese households expense data is more satisfactory with the Clayton family than for the other two copulas.
We now apply the maximum likelihood method for fitting copulas and compare it with our graphical method described in Figure 3. For this, it suffices to compute (see Table 2 below), for each of the three copulas, the log-likelihood function defined as
Table 1. Expression of Kendall’s tau and estimated values for q.
Table 2. Log-likelihood values.
Figure 3. Simultaneous representation of the three copulas into the confidence bands.
where


From Table 2, we can see that the maximum likelihood method also leads to the Clayton family as the best copula for fitting our data. So we recommend it to model the dependence structure of the Senegalese households expense data in the years 2001 and 2006.
2.3. Concluding Remarks
This paper presented a nonparametric method to estimate the copula function by providing asymptotic confidence bands based on the local linear kernel estimator. The results are applied to select graphically the best copula function that fits the dependence structure of the Senegalese households pseudo-panel data.
In perspective, similar results can be obtained with other kernel-type estimators of copula function like the mirror-reflection and transformation estimators.
3. Proofs
In this section, we first expose technical details allowing us to use the methodology of Mason [3] described in Proposition 1 and Corollary 1 that are necessary to prove our results. In the second step we give successively the proofs of the theorems stated in Section 2.
We begin by decomposing the difference

The probabilistic term
is called the deviation of the estimator from its expectation. We’ll study its behavior by making use of the methodology described in [3] . The other term that we denote
is the so-called bias of the estimator. It is deterministic and its behavior will depend upon the smoothness conditions on the copula C and the bandwidth h.
Recall the estimator proposed by Deheuvels in [7] , which is defined as
where



with


To study the behavior of the deviation
be the uniform bivariate empirical distribution function based on a sample


Then one can observe that

For all
where g belongs to a class of measurable functions

Since


To make use of Mason’s Theorem in [3] , the class of functions

(G.i)
(G.ii)There exists some constant


(F.i)


(F.ii)






The checking of these conditions constitutes the proof of the following proposition which will be done in Appendix.
Proposition 1. Suppose that the copula function C has bounded first order partial derivatives on

where

Corollary 1. Under the assumptions of Proposition 1, for any sequence of constants


Proof. (Corollary 1)
First, observe that the condition


Next, by the monotonicity of the function




Combining this and Proposition 1, we obtain
Thus the Corollary 1 follows from (16).
Proof. (Theorem 1)
The proof is based upon an approximation of the empirical copula process





By Theorem 3.2 in [14] , for

where
This yields

By the works of Wichura on the iterated law of logarithm (see [15] ), one has

which readily implies
Since


The proof is then finished by applying Corollary 1 which yields

Thus, there exists a constant


Proof. (Theorem 2)
For all
and
with


Hence
By continuity of F and G, we have for n large enough,
and
Thus,
By applying a 2-order Taylor expansion and taking account of the symmetry of the kernels


we obtain, by Fubini, that for all
Since the second order partial derivatives are assumed to be bounded, then we can infer that
and hence,

Proof. (Theorem 3)
From (8), we can infer that for any given




That is

On the other hand we deduce from (7) that for all
Case 1. If
then (23) becomes
Thus, for any given


Case 2. If
then, analogously to Case 1, we can infer from (23) that, for any given


Letting


or

Now, by observing that
we can write, for any
and
That is, (9) and (10) hold.
Acknowledgements
The authors are very grateful to anonymous referees for their valuable comments and suggestions.
Cite this paper
DiamBâ,Cheikh TidianeSeck,Gane SambLô, (2015) Asymptotic Confidence Bands for Copulas Based on the Local Linear Kernel Estimator. Applied Mathematics,06,2077-2095. doi: 10.4236/am.2015.612183
References
- 1. Chen, S.X. and Huang, T.-M. (2007) Nonparametric Estimation of Copula Functions for Dependence Modeling. Canadian Journal of Statistics, 35, 265-282.
http://dx.doi.org/10.1002/cjs.5550350205 - 2. Nelsen, R.B. (2006) An Introduction to Copulas. 2nd Edition, Springer, New York.
- 3. Mason, D.M. and Swanepoel, J.W.H. (2010) A General Result on the Uniform in Bandwidth Consistency of Kernel-Type Function Estimators. TEST, 20, 72-94.
http://dx.doi.org/10.1007/s11749-010-0188-0 - 4. Tsukahara, H. (2005) Semiparametric Estimation in Copula Models. The Canadian Journal of Statistics, 33, 357-375.
http://dx.doi.org/10.1002/cjs.5540330304 - 5. Scaillet, O. and Fermanian, J.-D. (2002) Nonparametric Estimation of Copulas for Time Series. FAME Research Paper No. 57.
http://ssrn.com/abstract=372142
http://dx.doi.org/10.2139/ssrn.372142 - 6. Genest, C. and Rivest, L. (1993) Statistical Inference for Archimedean Copulas. Journal of the American Statistical Association, 88, 1034-1043.
http://dx.doi.org/10.1080/01621459.1993.10476372 - 7. Deheuvels, P. (1979) La fonction de dépendence empirique et ses propriétés. Un test non paramétrique. d’indépendance. Bulletin Royal Belge de l’Académie des Sciences, 65, 274-292.
- 8. Fermanian, J., Radulovic, D. and Wegkamp, M. (2004) Weak Convergence of Empirical Copula Processes. International Statistical Institute (ISI) and Bernoulli Society for Mathematical Statistics and Probability, 10, 847-860.
http://dx.doi.org/10.3150/bj/1099579158 - 9. Gijbels, I. and Mielniczuk, J. (1990) Estimation of the Density of a Copula Function. Communications in Statistics, Series A, 19, 445-464.
http://dx.doi.org/10.1080/03610929008830212 - 10. Omelka, M., Gijbels, I. and Veraverbeke, N. (2009) Improved Kernel Estimators of Copulas: Weak Convergence and Goodness-of-Fit Testing. The Annals of Statistics, 37, 3023-3058.
http://dx.doi.org/10.1214/08-AOS666 - 11. Dony, J. (2007) On the Uniform in Bandwidth Consistency of Kernel-Type Estimators and Conditional. Proceedings of the European Young Statisticians Meeting, Castro Urdiales, 10-14 September 2007.
- 12. Einmahl, U. and Mason, D.M. (2005) Uniform in Bandwidth Consistency of Kernel-Type Function Estimators. The Annals of Statistics, 33, 1380-1403.
http://dx.doi.org/10.1214/009053605000000129 - 13. Deheuvels, P. and Mason, D.M. (2004) General Asymptotic Confidence Bands Based on Kernel-Type Function Estimators. Statistical Inference Stochastic Process, 7, 225-277.
http://dx.doi.org/10.1023/B:SISP.0000049092.55534.af - 14. Zari, T. (2010) Contribution à l’étude du processus empirique de copule. Thèse de doctorat, Université Paris, Paris, 6.
- 15. Wichura, M.J. (1973) Some Strassen-Type Laws of the Iterated Logarithm for Multiparameter Stochastic Processes with Independent Increments. The Annals of Probability, 1, 272-296.
http://dx.doi.org/10.1214/aop/1176996980 - 16. van der Vaart, A.W. and Wellner, J.A. (1996) Weak Convergence and Empirical Processes. Springer, New York.
http://dx.doi.org/10.1007/978-1-4757-2545-2 - 17. Lo, G.S., Sall, S.T. and Mergane, P.D. (2015) Functional Weak Laws for the Weighted Mean Losses or Gains and Applications. Applied Mathematics, 6, 847-863.
http://dx.doi.org/10.4236/am.2015.65079
Appendix
1. Proof of Proposition 1
Proof. It suffices to check the conditions (G.i), (G.ii), (F.i) and (F.ii) given in Section 3.
Checking for (G.i). For


Then,
This implies
Checking for (G.ii).
We have to show that
where



Now we express A and B as integrals of the copula function
Since because


We can also notice that
Thus
For n enough large, we have by continuity of F and G,
and
By splitting the integrals, we obtain after simple calculus that
All these six terms can be bounded up by applying Taylor expansion. Precisely, we have
From this, we can conclude that
and
with
Checking for (F.i). We have to check that

Consider the following classes of functions:

It is clear that by applying the lemmas 2.6.15 and 2.6.18 in van der Vaart and Wellner (see [16] , p. 146-147), the sets








Checking for (F.ii).
Define the class of functions
It’s clear that


and, for
where


Let

and
Then, one can easily see that


This implies, for all large m, that



By right-continuity of
and conclude that

2. Numerical Computations
Table A1. Confidence bands for clayton copula calculated for some random couples of values
Table A2. Confidence bands for Frank copula calculated for some random couples of values


















































































