Applied Mathematics
Vol.4 No.3(2013), Article ID:28865,9 pages DOI:10.4236/am.2013.43077

The Riemannian Structure of the Three-Parameter Gamma Distribution

William W. S. Chen1, Samuel Kotz2

1Department of Statistics, The George Washington University, Washington DC, USA

2Department of Engineering Management and Systems Engineering, The George Washington University, Washington DC, USA

Email: williamwschen@gmail.com

Received October 24, 2012; revised January 11, 2013; accepted January 18, 2013

Keywords: Mean Curvature; Gamma Distribution; Ricci Curvature; Riemannian Geometry; Scalar Curvature; Sectional Curvature

ABSTRACT

In this paper, we will utilize the results already known in differential geometry and provide an intuitive understanding of the Gamma Distribution. This approach leads to the definition of new concepts to provide new results of statistical importance. These new results could explain Chen [1-3] experienced difficulty when he attempts to simulate the sampling distribution and power function of Cox’s [4,5] test statistics of separate families of hypotheses. It may also help simplify and clarify some known statistical proofs or results. These results may be of particular interest to mathematical physicists. In general, it has been shown that the parameter space is not of constant curvature. In addition, we calculated some invariant quantities, such as Sectional curvature, Ricci curvature, mean curvature and scalar curvature.

1. Introduction

This paper will cover the study of the geometrical structure of the parameter space of the three parameter Gamma distribution. Rao [6] was one of the first to propose the differential-geometrical approach to parameter spaces, by introducing a Riemannian metric in terms of the Fisher Information matrix. The major objective of this paper is to utilize the results already known in differential geometry and provide an intuitive understanding of the three parameter Gamma Distribution. This approach leads to the definition of new concepts, which provide new results of statistical importance. These points will be clear in the concluding remark. Pioneer work in this direction was done by Efron [7], who introduced the notion of the so called “statistical curvature” of a one-parameter statistical model. Later, he showed that the statistical curvature is closely related to Fisher and Rao’s theory of second-order efficiency. Recently, there have been some new developments in this area. Mitchel A.F.S. [8] studies alpha-curvature tensor and alpha-geodesics of the elliptic distribution. Burbea and Rao [9] also study distances between distributions more particularly the multinomial and the normal distributions. Skovgaard [10] defines and provides some details on the geometry of the multivariate normal manifold. Sato et al. [11] has listed all the possible sectional curvatures of the bivariate normal distribution. Kass R.E. and Vos P.W. [12,13] gave an organized and through summary in this area. In Section 2, we defined the fundamental Riemannian metric tensor and its inverse tensor. In Section 3.1, we define and derive the sectional curvature. In Section 3.2, the definition and origin of the Ricci curvature tensors will be given and then used to find the mean curvatures of each direction and the scalar curvature of the parameter space. In Section 4, we list all nonzero Riemannian tensors for reference purposes. In the concluding remarks we tabulate the three sectional curvature and mean curvature versus a function of shaped parameter. Finally, we plot on the vertical axis of these tabulated seven functional values against its shape parameter as horizontal axes. In this way we could clearly see the validity of the function and the points where functions are undefined. It is helpful to use the “REDUCE Programming Language” [14] to perform the needed computation.

2. The Parameter Space and Riemannian Metric

Let us consider a family of gamma distribution with random variables x, such that a distribution is specified by a set of 3 real parameters. Then we can construct an 3-dimensional space of distributions with a coordinate system. Let denote the probability density function of x specified by. We assume the regularity conditions that the density functions exist and smooth in the space. A maximal smooth structure may be introduced making the family a 3-dimensional smooth manifold. A family of densities having a locally 3-dimensional Euclidean topology together with a maximal smooth structure will be called a 3-dimensional manifold of densities.

When the parameter space is in a higher dimension, a certain coordinate system, that specifies a point p on the n-dimensional manifold, by is considered. The squared distance between two neighboring points p and p+dp, ,

, can be represented by the quadratic form, where is the component of Fisher’s Information matrix defined by

(2.1)

where E denotes the expectation with respect to the distribution

If these quadratic form is a symmetric positive definite metric tensor then the surface is said to form a Riemannian manifold.

A random variable x has a Gamma distribution if its probability density function is:

(2.3)

The distribution is known as the Type III of Pearson’s system. This distribution depends on three parameters. They are known as the location, scale and shape parameters. This distribution has widely been employed as a model for distributions of life span, reaction time, and related phenomena. The three-parameter Gamma distribution has shapes similar to the Weibull, Lognormal, and Inverse Gaussian Distribution. Balakrishnan N. and Chen W. [15] will publish a handbook that tabulates the coefficients of “BLUP” of the estimable function for the location and scalar parameters for some commonly used for given shape parameters. A list of 130 references in Johnson and Kotz [16] is located at the end of Chapter 17, pp. 200-206. An updated revised version by Johnson, Kotz and Balakrishnan [17] listed more than 310 references at the end of Chapter 17, pp. 397-414. These are the most complete resources for this distribution. In both editions the authors have suggested that maximum likelihood of estimating equations (MLE) should not be used unless the shape parameter is greater than 2.5. A detailed discussion of this observation will be given at the concluding remarks.

Gamma distribution contains three parameters: In some coordinate systems we can consider a three-dimensional parameter space where one point can be represented by the number To simplify our notation it would be convenient to reparameterize our parameters as .

Using the definition of (2.1), we can derive our Riemannian Metric Tensor of the three-parameter Gamma distribution in the following form:

(2.4)

In Equation (2.4), the distribution property (2.3) restricts. It is also clear that when the shape parameter equals 1 or 2, the Riemannian Metric Tensors do not exist. In fact, when the distribution in (2.3) reduces to an exponential distribution. Additionally, when or when closing on either from left side or right side of 2, the -Tensor will become infinitely large. The concluding remarks will discuss this case in detail. Due to the commonly known distributional property of gamma, that digamma functions are only undefined at negative integers and at zero, the shape parameter must be a positive real value in order to guarentee the existence of.

In the three parameters gamma distribution we can show the determinant G is rank 3 and hence can form an 3-dimensional smooth manifold. However, if the location parameter is fixed then scalar and shape parameters can form a 2-dimensional smooth maniforld only. In (2.4), each element of the matrix does not involve the location parameter. Hence the rank of matrix is full rank after the location parameter is fixed. This means the 2-manifold formed by scale and shape parameters is not a geodesic submanifold of the 3-manifold.

Again, to simplify our notation, we will set the numerator of determinant G equal to Δ. Hence,

The inverse

of the Riemannian Metric Tensor is given by

(2.5)

where the inverse of the Riemannian Metric Tensor will satisfy the relation The Kronecker Symbol

3. Structure of the Parameter Space

3.1. Sectional Curvature

This section will give some geometrical quantitative interpretations to the curvature tensor in n-dimensional space. Let M be a Riemannian n-manifold and let p be a point in M. The Gaussian Curvature at a point p on the surface, geodesic at p, and tangent to a plane in the tangent space at p is called the sectional curvature at. If is any basis for the plane, we would use the notation for the sectional curvature and define it as follows:

(3.1)

In the case of the three-parameter Gamma Distribution, we can apply Formula (3.1) to find the sectional curvature in the following form:

(3.2)

If M is a two-dimensional space, it is clear that the sectional curvature coincides with the Gaussian Curvature. If the sectional curvature at every point of a Riemannian n-Manifold M does not depend on the two-dimensional section passing through the point, then it is constant over the manifold. Such a Riemannian n-Manifold is said to be of constant curvature. The Equation (3.2) show that all three sectional curvatures are functions of the shape parameter. Clearly, they are not of constant curvature. The detailed discussion about the vertical or horizontal asymptotic lines of these functions are given at the concluding remarks (See Figure 1).

3.2. Ricci Tensor and Related Curvatures

The four dimensional Riemannian Tensors are complicated, it would be better to construct simpler tensors that summarize some of the information contained in the curvature tensor. The most important of such tensors is the Ricci curvature tensor, or simply, the Ricci tensor. Denoted Ricci, which is the covariant 2-tensor field defined as the trace of the curvature endomorphism, on its first and last indices. The components of Ricci are usually denoted so that For the space of a three-parameter Gamma Distribution, the components of the Ricci Tensors are given by the following formulae:

(3.3)

Figure 1. Location-scale, location-shape, scale-shape curvature function and mean curvature function of location.

During the derivation of the Ricci curvature tensor, we applied the following four symmetric properties of the Riemannian Curvature Tensor:

(3.4)

Detailed proof of these four useful properties will not be given but left in references [18,19]. We will state these relations in terms of components with respect to any basis. Relation c) states the first pair of indices and the second pair can be interchanged without altering the value of the components. Relations a) and b) state that the covariant curvature tensor is skew-symmetric with respect to the first two indices and the last two indices. The symmetry expressed in d) is called the “algebraic Bianchi identity”. Using a)-d), it is easy to show that a three-term sum obtained by cyclically permuting any three indices of R is also zero.

Let be a normal orthogonal base of tangent space TpM at a point p of the manifold M. is the sectional curvature determined by the basis vectors. Then, the mean curvatures

in the direction are defined by:

(3.5)

In the three-parameter Gamma Distribution case, we obtain the mean curvatures for each direction of the bases.

(3.6)

The scalar curvature, R, of the space is defined as the trace of the Ricci Tensor:. As a result, the calculation discovered is the scalar curvature of our parameter space:

(3.7)

From Equations (3.6) and (3.7), notice that the mean and scalar curvatures are also a function of shape parameter only. In the concluding remarks, we would like to tabulate and plot all the three sectional curvatures and mean curvatures together. In this way it would be very easy to see the common behavior of these six functions (See Figure 2).

Figure 2. Mean curvature function of scale, shape and total curvature function.

4. List the Fundamental Tensor

4.1. This Section Shows the List for Christoffel Symbols of the First Kind of the Three-Parameter Gamma Distribution

(4.1)

4.2. This Is the List for Christoffel Symbols of the Second Kind

(4.2)

4.3. This Is the List for the Four Components Covariant Riemannian Curvature Tensor

While all the other components are zero.

5. Concluding Remarks

As the saying goes, a graph is worth a thousand words. After tabulating in 5.1 these seven important curvature functions as functions of the shape parameter, using both common values and extremes, the next step in the analysis is to plot these seven functions versus the shape parameter.

Let us summarize the sectional and mean curvature graphs and enumerate their common phenomenona. First, we observe that there is a common result among the six functions when the shape parameter increases to positive infinity: the six functions all converge to zero, either from above or below the horizontal axes. It is a wellknown fact that for shape parameters greater than or equal to 44.44, the gamma distribution nearly coincides with the normal distribution. Even if the shape parameter increase to 120, the gamma distribution would not deviate much from the normal distribution. Hence, we would expect to see one of two things: either the three sectional curvatures have identical values, or the graphs of the three sectional curvature have a very flat and long tail or the horizontal axis as their common asympototic axis. This point of view could also apply to the three mean curvatures. Second, we observed that all six curvature graphs have a vertical asympototic line at either

or both. For the location-scale, k(1,2), or the scale-shape parameters, k(2,3), or sectional curvature and mean curvature of scale parameter, k(2), there is an almost equilateral hyperbola. All of these three functions have a vertical asympototic line at 2 or 1. This could explain Chen’s [1-3] attempts to simulate the sampling distribution and power function of Cox’s [4,5] test statistics of separate families of hypotheses. He found that the Newton-Raphson iteration method could fail to converge to the required roots when real roots are close to 2 or 1. It clearly finds that the slope of the curve or surface will always switch between a very large positive and negative value. As we mention before, Johnson, Kotz and Balakrishnan: Continuous Univariate Distributions Volume I both editions (p. 356 for second edition) has claimed that the shape parameter would be at least 2.5 or larger to avoid this nonconvergent problem. The location-shape, k(1,3), behaves more differently for the sectional curva-

Table 1. Shape parameter of three-parameter gamma distribution.

ture and mean curvature of location. However, while they both have asympototic lines at 2, and both converge to zero above or below the horizontal axis as the shape parameter increased to infinity. Finally, the mean curvature of shape parameter, k(3), has two asympototic lines at 1 and 2, with a point of inflection near 1.44 where it switches from the positive curvature to the negative one. As the shape parameter graduately increase from 2 to infinity, the curvature tends to zero below the axis.

The question why the “magic number 2.5” is picked. Based on Table 1 and first six graphs we draw the answer has appeared to be trivial. We would like to 1) For the shape parameter greater than 2.5 it is safe zone in term of slope;

2) For the shape parameter between 0 and 2.5 all the six curvature functions behaves very unstably. We found all six curvature functions except scale-shape curvature function have a dramatically changed value when the shape parameter is around to 2.01, 1.99, 1.01 or 0.99 [18- 20].

REFERENCES

  1. W.W. S. Chen, “Testing Gamma and Weibull Distribution: A Comparative Study,” Estadistica, Vol. 39, 1987, pp. 1-26.
  2. W. W. S. Chen, “Evaluation of the First 12 Derivatives of the Digamma PSI Functions with Applications,” Proceeding of Statistical Computing Section, 1982, pp. 293- 298.
  3. W. W. S. Chen, “Curvature Gaussian or Riemann,” International Conference (IISA), McMaster University, Hamilton, 10-11 October 1998.
  4. D. R. Cox, “Tests of Separate Families of Hypotheses,” Proceedings of 4th Berkeley Symposium, Vol. 1, 1961, pp. 105-123.
  5. D. R. Cox, “Further Results on Tests of Separate Families of Hypotheses,” Society B, Vol. 24, 1962, pp. 406-424.
  6. C. R. Rao, “Information and Accuracy Attainable in the Estimation of Statistical Parameters,” Bulletin of Calcutta Mathematical Society, Vol. 37, 1945, pp. 81-89.
  7. B. Efron, “Defining the Curvature of a Statistical Problem (with Applications to Second Order Efficiency),” Annals of Statistics, Vol. 3, No. 6, 1975, pp. 1189-1217. doi:10.1214/aos/1176343282
  8. A. F. S. Mitchell, “Statistical Manifolds of Univariate Elliptic Distributions,” International Statistical Review, Vol. 56, No. 1, 1988, pp. 1-16. doi:10.2307/1403358
  9. J. Burbea and C. R. Rao, “Entropy Differential Metric, Distance and Divergence Measures in Probability Spaces: A Unified Approach,” Journal Multivariate Analysis, Vol. 12, No. 4, 1982, pp. 575-596. doi:10.1016/0047-259X(82)90065-3
  10. L. T. Skovgaard, “A Riemannian Geometry of the Multivariate Normal Model,” Scandinavian Journal of Statistics, Vol. 11, No. 4, 1984, pp. 211-223.
  11. Y. Sato, K. Sugawa and M. Kawaguchi, “The Geometrical Structure of the Parameter Space of the Two-Dimensional Normal Distribution,” Reports on Mathematical Physics, Vol. 16, No. 1, 1979, pp. 111-119. doi:10.1016/0034-4877(79)90043-0
  12. R. E. Kass, “The Geometry of Asymptotic Inference,” Statistical Science, Vol. 4, No. 3, 1989, pp. 188-234.
  13. R. E. Kass and P. W. Vos, “Geometrical Foundations of Asymptotic Inference,” John Wiley & Sons, Inc., New York, 1997.
  14. A. C. Hearn, “REDUCE User’s and Contributed Packages Manual,” Version 3.7.
  15. N. Balakrishnan and W. Chen, “Handbook of Tables for Order Statistics from Gamma Distributions with Applications,” Kluwer Academic Publishers, Not Publish yet.
  16. N. L. Johnson and S. Kotz, “Continuous Univariate Distributions-1,” Houghton Mifflin Company, Boston, 1970.
  17. N. L. Johnson, S. Kotz and N. Balakrishnan, “Continuous Univariate Distributions,” 2nd Edition, John Wiley & Sons, Inc., New York, 1994.
  18. D. J. Struik, “Lectures on Classical Differential Geometry,” 2nd Edition, Dover Publications, Inc., New York, 1998.
  19. S. I. Goldberg, “Curvature and Homology,” Revised Edition, Dover Publications, Inc., New York, 1998.
  20. E. Kreyszig, “Differential Geometry,” Dover Publications, Inc., New York, 1991.

APPENDIX

The definition of the Christoffel Symbols of the first kind, in terms of the first partial derivatives of the components of the Riemannian metric tensor:

The Christoffel Symbols, of the second kind, are defined as the inner product of the inverse of the Riemannian metric tensor with the Christoffel Symbols of the first kind:

The Mixed Riemann curvature tensor is defined by:

The inner product of the Mixed Riemann curvature tensor and Riemannian metric tensor,

is called the Covariant Riemann curvature tensor. It is a covariant tensor of the fourth order. The components are also respectively known as Riemannian Symbols of the first and second kind. All listed formulae can be found in the following reference books Struik, D.J. [18] or Goldberg, S.I. [19].