Asymptotics and Well-Posedness of the Derived Distribution Density in a Study of Biovariability

doi:10.4236/am.2018.96046

Applied Mathematics
Vol.09 No.06(2018), Article ID:85617,19 pages
10.4236/am.2018.96046

Hongyun Wang¹, Wesley A. Burgei², Hong Zhou³

●How to Cite this Article

¹Department of Applied Mathematics and Statistics, University of California, Santa Cruz, CA, USA

²US Department of Defense, Joint Non-Lethal Weapons Directorate, Quantico, VA, USA

³Department of Applied Mathematics, Naval Postgraduate School, Monterey, CA, USA

This work is licensed under the Creative Commons Attribution International License (CC BY 4.0).

http://creativecommons.org/licenses/by/4.0/

Received: May 3, 2018; Accepted: June 25, 2018; Published: June 28, 2018

ABSTRACT

In our recent work (Wang, Burgei, and Zhou, 2018) we studied the hearing loss injury among subjects in a crowd with a wide spectrum of heterogeneous individual injury susceptibility due to biovariability. The injury risk of a crowd is defined as the average fraction of injured. We examined mathematically the injury risk of a crowd vs the number of acoustic impulses the crowd is exposed to, under the assumption that all impulses act independently in causing injury regardless of whether one is preceded by another. We concluded that the observed dose-response relation can be explained solely on the basis of biovariability in the form of heterogeneous susceptibility. We derived an analytical solution for the distribution density of injury susceptibility, as a power series expansion in terms of scaled log individual non-injury probability. While theoretically the power series converges for all argument values, in practical computations with IEEE double precision, at large argument values, the numerical accuracy of the power series summation is completely wiped out by the accumulation of round-off errors. In this study, we derive a general asymptotic approximation at large argument values, for the distribution density. The combination of the power series and the asymptotics provides a practical numerical tool for computing the distribution density. We then use this tool to verify numerically that the distribution obtained in our previous theoretical study is indeed a proper density. In addition, we will also develop a very efficient and accurate Pade approximation for the distribution density.

Keywords:

Distribution of Individual Injury Susceptibility in a Crowd, Biovariability, Asymptotic Approximation, Pade Approximation

1. Introduction

Sound is an indispensable part of our life and we experience sound every day. A common way to measure the amount of sound is the decibel (dB) [1] . Sounds of less than 75 dB are at safe levels that do not damage our hearing. However, any sound above 85 dB is potentially harmful and can cause hearing loss. Examples of harmless sounds are normal conversation (60 dB), the humming of a refrigerator (45 dB) whereas harmful sounds include noise from lawn mowers (90 dB) and gun shots or firecrackers (both 150 dB) [2] . The risk of hearing damage depends on the power of the sound as well as the length of exposure.

Hearing loss is a common health problem among veterans. In order to protect warfighters, starting 1960s the US Army conducted and funded research to assess the risk of hearing loss caused by intense impulse noise from explosive blasts and weapon firings [3] . Recently Dr. Chan and his collaborators [4] developed a dose-response model for the assessment of injury caused by impulse noise and a model for the possible recovery afterwards, based on chinchilla data. Chinchillas share similar hearing capabilities as humans and thereby are commonly used for hearing-related experiments.

In [5] , we interpreted the empirical dose-response relation from [4] for exposure to multiple sound impulses in the framework of immunity. In [6] , we viewed the empirical dose-response relation from a completely different angle, in the framework of biovariability. Together in these two studies [5] [6] , we showed that it is possible to interpret the empirical dose-response relation from either of the two extreme cases: immunity or biovariability. Here we would like to further our study in [6] to demonstrate that the derived distribution density of injury susceptibility in [6] is well-posed.

2. Mathematical Formulation of the Problem

In experiments [4] , the injury risk of a crowd caused by a sound exposure event is described by the logistic dose-response relation:

$p = \frac{1}{1 + \exp (- α (S E L A - D_{50}))}$ . (1)

Here the dose of the sound exposure event is defined by the SELA (A-Weighted Sound Exposure Level) in units of dBA. The injury risk p of the crowd represents the average injury fraction of the crowd. In the logistic dose response model (1), the parameter α determines the steepness of function p while D₅₀ denotes the median injury dose. For a crowd, the median injury dose is the dose level at which half of the population is expected to be injured. For a crowd of subjects with a wide spectrum of heterogeneous individual injury probabilities, at the apparent median injury dose, a particular subject's individual injury probability may be below or above 50% due to biovariability. For injury of permanent threshold shift (PTS) > 25 dB, the values of parameters α and D₅₀ are found to be $α = 0.1$ and $D_{50} = 161$ dB, respectively [4] . It was also noticed that the parameter value α remains unchanged ( $α = 0.1$ ) for PTS injuries of all cut-off levels whereas the median injury dose D₅₀ rises with the PTS cut-off level [4] .

In the framework of the logistic dose-response relation, the injury risk of the crowd caused by a sequence of N acoustic impulses is given by the expression

${p |}_{(N impulses)} = \frac{1}{1 + e x p (- α (S_{comb} - D_{50}))}$ . (2)

where $S_{comb}$ is the effective combined dose for the whole sequence of impulses as a single sound exposure event. For a sequence of N identical impulses each with SELA value S, the effective combined dose, $S_{comb}$ , was observed to follow the dose combination rule [4] :

$S_{comb} = S + λ l o g_{10} N, λ = 3.44$ . (3)

Thus, for a sequence of N impulses each with SELA value S, the injury risk takes the form

${p |}_{(N impulses)} = \frac{1}{1 + e x p (- α (S - D_{50} + λ l o g_{10} N))} \equiv \frac{1}{1 + {(a N^{η})}^{- 1}}$ . (4)

where parameters a and η are defined as

$a \equiv e x p (α (S - D_{50})), η \equiv \frac{α λ}{l n 10} = 0.1494$ . (5)

Parameter a is related to probability ${p |}_{(1impulse)}$ by

$a = \frac{p |_{(1impulse)}}{1 - p |_{(1impulse)}}$ . (6)

That is, parameter a is the injury odds of a hypothetical subject in the crowd with the average injury probability, ${p |}_{(1impulse)}$ , in responding to a single acoustic impulse.

In [6] under the assumption that N acoustic impulses act independently from each other in causing injury, regardless of whether one is preceded by another (i.e., no immunity effect), we explored the possibility of interpreting the observed logistic dose-response relation for a crowd in the framework of biovariability. For mathematical convenience, we consider non-injury probability instead of injury probability. Let $q (ω)$ denote the individual non-injury probability of a random subject in the crowd, in responding to one acoustic impulse. Here $q (ω)$ is a random variable, due to the presence of biovariability. Let $ρ (q)$ be the distribution density of random variable $q (ω)$ . Mathematically in the framework of biovariability, the average non-injury fraction for N acoustic impulses is expressed in terms of $ρ (q)$ as

${(Non-injury fraction) |}_{(N impulses)} = E [q {(ω)}^{N}] = \int_{0}^{1} q^{N} ρ (q) d q$ . (7)

On the other hand, experimentally, the injury risk was observed to follow the logistic dose-response relation (4), which relates to the average non-injury fraction as

${(Non-injury fraction) |}_{(N impulses)} = {1 - p |}_{(N impulses)} = 1 - \frac{1}{1 + {(a N^{η})}^{- 1}} = \frac{1}{1 + a N^{η}}$ . (8)

For the theoretical model of biovariability to reproduce the experimentally observed results, the distribution density $ρ (q)$ has to satisfy an equation obtained by combining (7) and (8), which we write out below:

$\int_{0}^{1} q^{N} ρ (q) d q = \frac{1}{1 + a N^{η}}, N = 1,2, \dots$ . (9)

In [6] we solved Equation (9) analytically by constructing a power series

expansion in new variable $s = - a^{\frac{- 1}{η}} \ln (q)$ . Since the non-injury probability $q (ω)$ is constrained to interval $(0,1)$ , variable $s = - a^{\frac{- 1}{η}} \ln (q)$ has the domain $(0, + \infty)$ . The distribution density of random variable $s (ω) = - a^{\frac{- 1}{η}} l n (q (ω))$ is

$g (s) \equiv a^{\frac{1}{η}} ρ (\exp (- a^{\frac{1}{η}} s)) \exp (- a^{\frac{1}{η}} s)$ . (10)

For density $g (s)$ , Equation (9) becomes

$\int_{0}^{+ \infty} \exp (- (a^{\frac{1}{η}} N) s) g (s) d s = \frac{1}{1 + {(a^{\frac{1}{η}} N)}^{η}}$ . (11)

In [6] we derived a power series solution for $g (s)$ :

$g (s) = \sum_{k = 0}^{+ \infty} \frac{{(- 1)}^{k}}{Γ (η (k + 1))} s^{η (k + 1) - 1}$ . (12)

In power series (12), the gamma function $Γ (η (k + 1))$ in the denominator of the coefficient grows faster than any exponential function of k. As a result, power series (12) converges for all values of s. It follows that function $g (s)$ is well defined by power series (12) for all values of s. To be a proper density function, however, $g (s)$ must satisfy the two properties below:

$g (s) \geq 0$ (13)

$\int_{0}^{+ \infty} g (s) d s = 1$ . (14)

In [6] we rigorously proved these two properties for the special case of $η = \frac{1}{2}$ and the special case of $η = \frac{1}{3}$ . The analysis procedure differs quite significantly between these two special cases. It is highly unlikely that the particular analysis approach used in either of these two special cases can be directly extended to the general case of arbitrary η. In the current study, we aim at verifying semi-analytically the two properties for any given value of η. For that purpose, we need the numerical capability of calculating function $g (s)$ for all values of s, from small to large. Power series (12) has the nice property that theoretically it converges for all values of s. In practical computations, however, at large values of s, the numerical accuracy of the power series summation is completely wiped out by the accumulation of round-off errors. In the power series summation, as s increases the net sum decreases while the magnitude of the largest term grows exponentially with s [6] . The combined effect of these two factors magnifies catastrophically, at large s, the influence of round-off errors on the numerical accuracy of the net sum. For practical computation of $g (s)$ in finite precision arithmetic, we need a robust numerical formula of $g (s)$ at large s. In the next section, we derive a general asymptotic approximation of $g (s)$ at large s. The synthesis of the power series and the asymptotics will provide a practical numerical tool for computing the distribution density $g (s)$ for all values of s at any given value of parameter η.

3. Asymptotics of g(s) at Large s

Now we derive a general asymptotic approximation of $g (s)$ at large s when η is a rational number. We then reasonably conjecture that the same asymptotic approximation is also valid even when η is irrational. In practical computation of $g (s)$ , the case of irrational η actually does not apply since all numerical calculations are carried out in finite precision arithmetic, using only rational numbers.

A rational number η takes the form $η = \frac{m}{n}$ where both m and n are positive

integers. We rewrite the power series of $g (s)$ in terms of the reciprocal gamma function as follows:

$g (s) = \sum_{k = 0}^{+ \infty} {(- 1)}^{k} f (\frac{m}{n} (k + 1)) s^{\frac{m}{n} (k + 1) - 1}$ (15)

where $f (z)$ is the reciprocal gamma function defined as

$f (z) \equiv \frac{1}{Γ (z)}$ . (16)

The advantage of working with the reciprocal gamma function $f (z)$ is that it is well defined and is analytic everywhere. In comparison, the gamma function $Γ (z)$ diverges at all non-positive integer values of z. The reciprocal gamma function $f (z)$ has the property

$(z - 1) f (z) = \frac{z - 1}{Γ (z)} = \frac{1}{Γ (z - 1)} = f (z - 1)$ . (17)

Using this convenient property when differentiating $g (s)$ , we have

$\frac{d}{d s} g (s) = \sum_{k = 0}^{+ \infty} {(- 1)}^{k} f (\frac{m}{n} (k + 1) - 1) s^{\frac{m}{n} (k + 1) - 2}$ . (18)

Differentiating $g (s)$ repeatedly m times, we obtain a differential equation for $g (s)$ .

$\begin{matrix} \frac{d^{m}}{d s^{m}} g (s) = \sum_{k = 0}^{+ \infty} {(- 1)}^{k} f (\frac{m}{n} (k + 1) - m) s^{\frac{m}{n} (k + 1) - m - 1} \\ = {(- 1)}^{n} \sum_{k^{'} = - n}^{+ \infty} {(- 1)}^{k^{'}} f (\frac{m}{n} (k^{'} + 1)) s^{\frac{m}{n} (k^{'} + 1) - 1} \\ = {(- 1)}^{n} [\sum_{k^{'} = - n}^{- 1} {(- 1)}^{k^{'}} f (\frac{m}{n} (k^{'} + 1)) s^{\frac{m}{n} (k^{'} + 1) - 1} + g (s)] \\ = {(- 1)}^{n} [- \sum_{\hat{k} = 1}^{n - 1} {(- 1)}^{\hat{k}} f (\frac{- \hat{k} m}{n}) s^{- (\frac{\hat{k} m}{n} + 1)} + g (s)] \end{matrix}$ (19)

We derive an asymptotic approximation of $g (s)$ at large s based on this differential equation. We proceed with the assumption that $g (s)$ converges to 0 as s goes to $+ \infty$ . This assumption is reasonable although not directly derivable from the power series form of $g (s)$ . Under this assumption, the asymptotics of $g (s)$ at large s takes the form

$g (s) ~ \sum_{j = 1} b_{j} s^{- β_{j}}, 0 < β_{1} < β_{2} < β_{3} < \dots$ . (20)

Differentiating the asymptotic form m times, we get

$\frac{d^{m}}{d s^{m}} g (s) ~ {(- 1)}^{m} \sum_{j = 1} b_{j} \frac{f (β_{j})}{f (β_{j} + m)} s^{- (β_{j} + m)} = O (s^{- (β_{1} + m)})$ . (21)

In (21) the first term is of the order $O (s^{- (β_{1} + m)})$ while all other terms are asymptotically smaller than $O (s^{- (β_{1} + m)})$ . Using the result of (21), we rewrite (19) as

$O (s^{- (β_{1} + m)}) = [- \sum_{k = 1}^{n - 1} {(- 1)}^{k} f (\frac{- k m}{n}) s^{- (\frac{k m}{n} + 1)} + g (s)]$ (22)

which leads to

$g (s) = \sum_{k = 1}^{n - 1} {(- 1)}^{k} f (\frac{- k m}{n}) s^{- (\frac{k m}{n} + 1)} + O (s^{- (β_{1} + m)})$ . (23)

Equating the leading terms on both sides of (23) yields $β_{1} = \frac{m}{n} + 1$ . Substituting this value of $β_{1}$ back into (23) gives us

$g (s) = \sum_{k = 1}^{n - 1} {(- 1)}^{k} f (\frac{- k m}{n}) s^{- (\frac{k m}{n} + 1)} + O (s^{- (\frac{m}{n} + m + 1)})$ . (24)

Notice that in (24), the smallest term in the summation is $O (s^{- (\frac{- m}{n} + m + 1)})$ which occurs at $k = n - 1$ . Thus, all terms in the summation are indeed asymptotically larger than $O (s^{- (\frac{m}{n} + m + 1)})$ .

In summary, expression (24) gives an asymptotic approximation of $g (s)$ at large s, accurate up to the order $O (s^{- (m + 1)})$ . Recall that (24) is derived by differentiating each of power series (15) and asymptotic form (20) m times, and then combining the results. To derive a general asymptotic approximation, we differentiate each of the power series and the asymptotic form $(r \cdot m)$ times where r is a positive integer. In the above, asymptotics (24) is the outcome in the special case of $r = 1$ . In the general case, differentiating power series (15) $(r m)$ times yields.

$\begin{matrix} \frac{d^{(r m)}}{d s^{(r m)}} g (s) = \sum_{k = 0}^{+ \infty} {(- 1)}^{k} f (\frac{m}{n} (k + 1) - (r m)) s^{\frac{m}{n} (k + 1) - (r m) - 1} \\ = {(- 1)}^{(r n)} \sum_{k^{'} = - (r n)}^{+ \infty} {(- 1)}^{k^{'}} f (\frac{m}{n} (k^{'} + 1)) s^{\frac{m}{n} (k^{'} + 1) - 1} \\ = {(- 1)}^{(r n)} [\sum_{k^{'} = - (r n)}^{- 1} {(- 1)}^{k^{'}} f (\frac{m}{n} (k^{'} + 1)) s^{\frac{m}{n} (k^{'} + 1) - 1} + g (s)] \\ = {(- 1)}^{(r n)} [- \sum_{\hat{k} = 1}^{(r n) - 1} {(- 1)}^{\hat{k}} f (\frac{- \hat{k} m}{n}) s^{- (\frac{\hat{k} m}{n} + 1)} + g (s)] \end{matrix}$ (25)

Differentiating asymptotic form (20) $(r m)$ times, we get

$\frac{d^{(r m)}}{d s^{(r m)}} g (s) ~ {(- 1)}^{(r m)} \sum_{j = 1} b_{j} \frac{f (β_{j})}{f (β_{j} + (r m))} s^{- (β_{j} + (r m))} = O (s^{- (\frac{m}{n} + (r m) + 1)})$ (26)

where we have used $β_{1} = \frac{m}{n} + 1$ . Combining (25) and (26), we obtain

$\begin{matrix} g (s) = \sum_{k = 1}^{(r n) - 1} {(- 1)}^{k} f (\frac{- k m}{n}) s^{- (\frac{k m}{n} + 1)} + O (s^{- (\frac{m}{n} + (r m) + 1)}) \\ ~ \sum_{k = 1} {(- 1)}^{k} f (\frac{- k m}{n}) s^{- (\frac{k m}{n} + 1)} \end{matrix}$ (27)

Since integer r can be made as large as we like, in (27) we simply use the infinite series as a symbolic general asymptotics, with the understanding that a particular asymptotic approximation will use only a partial sum of the infinite series. We need to point out that the infinite series in (27) actually diverges for all values of s. So the infinite summation does not have a well defined sum. Instead, the infinite series serves only as a symbolic general asymptotics. Summation of moderate number of terms, however, will provide an accurate asymptotic approximation of $g (s)$ at moderately large s and beyond. We will examine the approximation errors in details later. In the analysis above, we did not assume that integers m and n are prime to each other. It turns out that asymptotics (27) can be written in terms of η only, without any reference to m or n. The expression in terms of η gives us the general asymptotics, which does not depend on a particular rational form of η:

$g (s) ~ \sum_{k = 1} {(- 1)}^{k} f (- k η) s^{- (k η + 1)}$ . (28)

The asymptotic expansion (28) depends only on η and is invariant with respect to different rational representations of η. It is plausible to conjecture that asymptotics (28) is also valid for irrational η although it is derived in the case of rational η. This conjecture cannot be tested numerically since all computations use a finite precision number representation system, which is a subset of all rational numbers. In the next section, in preparation for the numerical verification of properties (13) and (14), we build the necessary numerical tools for computing function $g (s)$ .

4. Accurate Evaluation of g(s) in Finite Precision

In this section, we develop a practical numerical method for computing $g (s)$ in IEEE double precision, over the whole range of s and at any given value of parameter η.

Function $g (s)$ is defined straightforwardly by power series (12). Theoretically $g (s)$ can be evaluated as accurately as we like by including sufficiently large number of terms in summation and carrying out the computation in arithmetic of sufficiently high numerical precision. Practically, with the IEEE double precision arithmetic, the numerical accuracy of the power series summation, at large s, is completely ruined by the round-off errors from terms of the largest magnitude. In [6] we showed that the largest term grows roughly exponentially with s and it has the behavior.

$\underset{k}{m a x} | \frac{{(- 1)}^{k}}{Γ (η (k + 1))} s^{η (k + 1) - 1} | \approx \frac{e^{s}}{\sqrt{2 π s}}$ (29)

Even at a moderate large value of $s = 40$ , the largest term in the summation is more than 10¹⁶, which in general will pollute the numerical value of $g (s)$ with an error of magnitude 1 or bigger. Thus, at large s, the power series summation is not a workable numerical tool for accurately calculating $g (s)$ in finite precision arithmetic.

The infinite series in the asymptotic expansion (28) diverges for all values of s. As a result, it does not make sense to include in the asymptotic approximation a very large number of terms from (28). When a moderate number of terms are used, however, the partial sum of (28) provides an accurate approximation of $g (s)$ at moderately large s and beyond. For a fixed number of terms, the larger the value of s is, the better the approximation. Therefore, at large s, function $g (s)$ can be evaluated fairly accurately by employing an asymptotic approximation with a suitable number of terms.

The contrast and complementary behaviors of the power series around $s = 0$ and the asymptotics at large s suggest that a viable numerical strategy is to use the power series summation for small s and switch to the asymptotic approximation when s is above a threshold s_sw, which is yet to be specified. The success of this numerical strategy depends on that there is an overlapping region of intermediate s in which both the power series summation and the asymptotic approximation will yield reasonably good accuracy. Without this intermediate region, if the valid region of the power series summation is separated by a gap from the valid region of the asymptotic approximation, $g (s)$ cannot be calculated accurately in the gap region. The existence of an overlapping region of intermediate s also provides us a numerical mechanism for identifying the overlapping region and selecting an optimal threshold s_sw for switching from one numerical formula to the other.

In the overlapping region, both the power series summation and the asymptotic approximation are reasonably accurate. Accordingly, the difference between the two numerical formulas should be fairly small inside the overlapping region. Below or above the overlapping region, only one of the numerical formula is very accurate while the other is not. Consequently, outside the overlapping region, the difference between the two will be significantly more pronounced than inside the overlapping region. To identify the overlapping region, we examine the difference between the power series summation and the asymptotic approximation as s increases from small values to large values. The magnitude of the minimum difference indicates the existence (or non-existence) of the overlapping region; the location of the minimum difference suggests an optimal threshold s_sw for switching. To proceed along this line, we introduce two notations.

• $g^{(P S)} (s)$ = power series summation (12)

• $g^{(A A)} (s)$ = asymptotic approximation (28) with terms up to $O (s^{- (N_{g} + 1)})$ ...

Throughout this paper, all numerical results are computed in IEEE double precision arithmetic. In this section, simulations are focused on the case of $η = 0.1494$ , the observed value of parameter η in experiments. We will explore other values of η in subsequent sections.

In Figure 1, we plot the difference between $g^{(P S)} (s)$ and $g^{(A A)} (s)$ as a function of s for several different values of N_g. Three asymptotic approximations respectively with $N_{g} = 9$ , $N_{g} = 12$ and $N_{g} = 15$ are tested. For all 3 values of N_g, especially for $N_{g} = 12$ and $N_{g} = 15$ , Figure 1 demonstrates clearly the existence of an overlapping valid region for the two numerical formulas. For s values smaller than 17, there is a visible discrepancy among the three curves because in this region $| g^{(P S)} (s) - g^{(A A)} (s) |$ is primarily attributed to the approximation error in $g^{(A A)} (s)$ which depends heavily on N_g when s is not very large. For s values bigger than 17, the three curves almost coincide with each other due to the dominant effect of round-off errors in $g^{(P S)} (s)$ which is independent of N_g. The overlapping region is in a neighborhood around [5] [6] . The asymptotic approximation with $N_{g} = 12$ has the best performance since it reaches the lowest minimum difference and attains the minimum difference at a smaller value of s, indicating that it is already valid even when s is not very large. In our subsequent simulations, we shall select N_g using this strategy. For $N_{g} = 12$ , Figure 1 suggests that an optimal threshold s_sw for switching from power series summation to asymptotic approximation is about $s_{s w} = 15$ . Based computing function $g (s)$ .

Figure 1. Difference between the power series summation $g^{(P S)} (s)$ and the asymptotic expansion $g^{(A A)} (s)$ for various values of N_g.

Numerical procedure:

• For $s \leq s_{s w}$ , $g (s)$ is computed using the partial sum of the first K_f terms in power series summation (12).

$g (s) \approx \sum_{k = 0}^{K_{f}} \frac{{(- 1)}^{k}}{Γ (η (k + 1))} s^{η (k + 1) - 1}, for s \leq s_{s w}$ (30)

on these numerical findings we adopt the numerical procedure below for where K_f is the number of terms needed to make the truncation error well below the machine precision of IEEE double precision. In computations, we make the truncation error smaller than 10⁻²⁰. Theoretically, K_f has an a priori estimate expressed in terms of s when s is moderately large. In practice, K_f is determined automatically in the numerical summation process by monitoring the magnitude of terms.

• For $s \geq s_{s w}$ , $g (s)$ is calculated using the partial sum of terms up to $O (s^{- (N_{g} + 1)})$ in the asymptotic approximation (28).

$g (s) \approx \sum_{\begin{matrix} k = 1 \\ k η \leq N_{g} \end{matrix}} {(- 1)}^{k} f (- k η) s^{- (k η + 1)}, for s \geq s_{s w}$ (31)

The choice of $s_{s w} = 15$ and $N_{g} = 12$ above is based on numerical minimization of $| g^{(P S)} (s) - g^{(A A)} (s) |$ with respect to $(s, N_{g})$ in the case of $η = 0.1494$ . This particular set of $(s_{s w}, N_{g})$ is for computing function $g (s)$ at $η = 0.1494$ . In a similar fashion, at each different value of η, an individual set of $(s_{s w}, N_{g})$ is determined for that η and then used in evaluating $g (s)$ .

In the next section, we apply the numerical procedure described above to verify properties (13) and (14) numerically, and thus demonstrate the well-posedness of the distribution density.

5. Numerical Verification of Well-Posedness

We first verify that $g (s)$ defined by power series (12) is positive for all values of $s > 0$ at any parameter value of $η > 0$ . To examine both the sign and the magnitude of $g (s)$ , we use mapping $D (z)$ , defined below, to display $g (s)$ as $D (g (s))$ . Let

$D (z) \equiv z_{0} \sinh^{- 1} (\frac{z}{z_{0}})$ . (32)

where $z_{0} > 0$ is a design parameter depending on the range we like to focus on.

The mapping $D (z)$ has several design features for showing the sign and for accommodating a huge range over many orders of magnitude.

• $D (z)$ is an odd function of z, clearly showing the sign of z.

• $D (z)$ is a monotonically increasing function of z, preserving any trend of z.

• When $| z |$ is significantly below $z_{0}$ , the mapping $D (z)$ displays z in a linear scale:

$D (z) \approx z for | z | ≪ z_{0}$ .

• When $| z |$ is significantly above $z_{0}$ , the mapping $D (z)$ displays z in a logarithmic scale:

$D (z) \approx z_{0} \ln (\frac{2 z}{z_{0}}) for z ≫ z_{0}$ .

We calculate $g (s)$ vs s numerically for several representative values of $η > 0$ , and plots $D (g (s))$ in Figure 2 with $z_{0} = 10^{- 4}$ . Function $g (s)$ is positive for all values of η we examined.

Next we verify that $\int_{0}^{+ \infty} g (s) d s = 1$ for parameter $η > 0$ . We integrate the power series (12) to write out the cumulative distribution function (CDF), which becomes

$G (s) \equiv \int_{0}^{s} g (u) d u = \sum_{k = 0} \frac{{(- 1)}^{k}}{Γ (η (k + 1) + 1)} s^{η (k + 1)}$ . (33)

Again, theoretically power series (33) converges for all s, making $G (s)$ a well defined function for all s. But in numerical computations with IEEE double precision, at large s, power series summation (33) suffers catastrophically from complete loss of accuracy. As a result, using the power series summation to compute $G (s)$ at large s is not viable for demonstrating $\lim_{s \to + \infty} G (s) = 1$ . Instead, we consider the complementary cumulative distribution function at large s, defined as

$G^{(C)} (s) \equiv \int_{s}^{+ \infty} g (u) d u$ . (34)

Figure 2. Plots of $D (g (s))$ for several values of parameter η. The mapping $D (z)$ defined in (32) is designed for showing the sign and for accommodating a wide range of quantity z. The plots demonstrate that function $g (z)$ is positive for all values of parameter η tested.

To verify $\int_{0}^{+ \infty} g (s) d s = 1$ , we only need to demonstrate numerically that $G (s) + G^{(C)} (s) = 1$ at some value of s. This allows us to select a value of s such that both $G (s)$ and $G^{(C)} (s)$ can be computed accurately.

For $s \leq s_{s w}$ , $G (s)$ can be accurately calculated using a partial sum of power series (33)

$G (s) \approx \sum_{k = 0}^{K_{f}} \frac{{(- 1)}^{k}}{Γ (η (k + 1) + 1)} s^{η (k + 1)}, for s \leq s_{s w}$ . (35)

For $s \geq s_{s w}$ , $g (s)$ is well approximated by asymptotics (28). Using a partial sum of (28) with terms up to $O (s^{- (N_{g} + 1)})$ to replace $g (s)$ in the integral of $G^{(C)} (s)$ , we write out an asymptotic approximation for $G^{(C)} ( s )$

$G^{(C)} (s) \approx - \sum_{\begin{matrix} k = 1 \\ k η \leq N_{g} \end{matrix}} {(- 1)}^{k} f (- k η + 1) s^{- k η}, for s \geq s_{s w}$ . (36)

Results (35) and (36) suggest that quantity $\int_{0}^{+ \infty} g (u) d u = G (s) + G^{(C)} (s)$ has the optimal numerical accuracy if we evaluate it at $s = s_{s w}$ . Thus, we compute quantity $| G (s_{s w}) + G^{(C)} (s_{s w}) - 1 |$ and use it to judge if $\int_{0}^{+ \infty} g (u) d u = 1$ is satisfied.

Figure 3 plots $| G (s_{s w}) + G^{(C)} (s_{s w}) - 1 |$ vs parameter η. It is clear that for any parameter value η in $(0,1)$ , the assertion $\int_{0}^{+ \infty} g (u) d u = 1$ is indeed valid within the numerical approximation error.

Figure 3. The difference between $\int_{0}^{\infty} g (u) d u$ and 1 for different values of η.

In summary, we have numerically verified 1) $g (s) > 0$ for $s > 0$ and 2) $\int_{0}^{+ \infty} g (s) d s = 1$ . It follows that function $g (s)$ defined by power series (12) is mathematically a proper distribution density.

With the property $G (+ \infty) = 1$ established, we write out a unified numerical procedure for computing function $G (s)$ over the full range of $s \in (0, + \infty)$ ,

$G (s) \approx {\begin{array}{l} \sum_{k =0}^{K_{f}} {(- 1)}^{k} f (η (k + 1) + 1) s^{η (k + 1)} & for s \leq s_{s w} \\ \sum_{\begin{matrix} k =0 \\ k η \leq N_{g} \end{matrix}} {(- 1)}^{k} f (- k η + 1) s^{- k η} & for s \geq s_{s w} \end{array}$ (37)

For readers’ convenience, we also summarize below the unified numerical procedure for computing function $g (s)$ over the full range of $s \in (0, + \infty)$ ,

$g (s) \approx {\begin{array}{l} \sum_{k = 0}^{K_{f}} {(- 1)}^{k} f (η (k + 1)) s^{η (k + 1) - 1} & for s \leq s_{s w} \\ \sum_{\begin{matrix} k = 1 \\ k η \leq N_{g} \end{matrix}} {(- 1)}^{k} f (- k η) s^{- (k η + 1)} & for s \geq s_{s w} \end{array}$ (38)

6. Pade Approximations

In the previous section, we verified $G (+ \infty) = 1$ . We now impose this property as a constraint at $s = + \infty$ and construct a Pade approximation [7] [8] for $G (s)$ based on its power series around $s = 0$ . As we will see, the Pade approximation provides an accurate and efficient approximation over the full range of $s \in (0, + \infty)$ .

The power series of $G (s)$ , given in (33), consists of integer powers of $s^{η}$ . Accordingly, we use integer powers of $s^{η}$ in constructing the Pade approximation. For mathematical convenience, we write power series (33) in terms of $x = s^{η}$ , in an abstract form

$G (s) = \sum_{k = 1}^{+ \infty} c_{k} x^{k}, x = s^{η}, c_{k} = \frac{{(- 1)}^{k + 1}}{Γ (k η + 1)}$ . (39)

Note that one can set $c_{0} = 0$ and start the summation at $k = 0$ in (39). Taking these features of function $G (s)$ into consideration, we adopt a Pade approximation of order $[n / n]$ , of the form

$R (s; n) = \frac{a_{1} x + a_{2} x^{2} + \dots + a_{n - 1} x^{n - 1} + x^{n}}{b_{0} + b_{1} x + b_{2} x^{2} + \dots + b_{n - 1} x^{n - 1} + x^{n}}, x = s^{η}$ . (40)

where $a_{0} = 0$ follows from $c_{0} = 0$ , and $a_{n} = b_{n} = 1$ follows from $G (+ \infty) = 1$ and normalization. There are $(2 n - 1)$ unknown coefficients in Pade approximation (40). To determine these coefficients, we multiply both (40) and (39) by $\sum_{k = 0}^{n} b_{k} x^{k}$ , and then match the $x^{k}$ terms for $1 \leq k \leq (2 n - 1)$ . The product of two power series has the expression:

$(\sum_{k = 0} b_{k} x^{k}) (\sum_{k = 0} c_{k} x^{k}) = \sum_{k = 0} (\sum_{j = 0}^{k} b_{j} c_{k - j}) x^{k}$ .

For k in the range of $n \leq k \leq (2 n - 1)$ , equating the corresponding coefficients of $x^{k}$ terms on the left-hand side and on the right-hand sides yields equations for the unknowns ${b_{k}}$ .

$\sum_{j = 0}^{n - 1} b_{j} c_{k - j} = w_{k}, k = n, n + 1, \dots, (2 n - 1)$ (41)

where $w_{k}$ is known and has the expression

$w_{k} = {\begin{array}{l} 1, & k = n \\ - c_{k - n}, & (n + 1) \leq k \leq (2 n - 1) \end{array}$ (42)

Equation (41) is an $n \times n$ linear system for ${b_{0}, b_{1}, \dots, b_{n - 1}}$ . Coefficients ${b_{k}}$ are determined by solving linear system (41). Once coefficients ${b_{k}}$ are known, we write out coefficients ${a_{k}}$ by matching the coefficients of $x^{k}$ terms for $1 \leq k \leq (n - 1)$ .

$a_{k} = \sum_{j = 0}^{k - 1} b_{j} c_{k - j}, k = 1, 2, \dots, (n - 1)$ (43)

To estimate the error of Pade approximation $R (s; n)$ defined in (40), we use $G (s)$ computed with the unified numerical procedure (37) as the “exact” solution to compare with. We calculate the difference between the numerical value of $G (s)$ and the Pade approximation $R (s; n)$ . Figure 4 shows $| G (s) - R (s; n) |$ vs s for parameter value $η = 0.1494$ . Four Pade approximations, respectively with n = 3, 4, 5, and 6, are shown where n is the highest power used in Pade approximation (40). For n = 4, the approximation error of $R (s; n)$ is already below 10⁻⁸, which is similar to the errors of both the power series summation and

Figure 4. Discrepancy between $G (s)$ and Pade approximation $R (s; n)$ .

the asymptotic approximation in the overlapping region around $s = 15$ . The numerical error of procedure (37) varies with the magnitude of s. The numerical error is the largest in the overlapping region: below the overlapping region, the power series summation is less polluted by the round-off errors and thus is more accurate; above the overlapping region, the asymptotic approximation becomes more accurate. The numerical accuracy of $G (s)$ calculated using (37) is significantly higher than 10⁻⁸ when s is outside the overlapping region. This property of $G (s)$ will help us decipher the error behavior in higher order Pade approximations.

When n is increased to $n = 5$ , the difference $| G (s) - R (s; n) |$ is below 10⁻¹⁰ outside the overlapping region, implying that the error of Pade approximation is also below 10⁻¹⁰ outside the overlapping region. The difference $| G (s) - R (s; n) |$ increases significantly in the overlapping region. However, it is highly unlikely that the approximation error of Pade approximation jumps significantly only in the overlapping region while remaining below 10⁻¹⁰ outside the overlapping region. The Pade approximation consists of one rational function for all values of s; it does not involve any switching. It is much more likely that the approximation error of Pade approximation actually remains below 10⁻¹⁰ over the full range of s; the significant increase in $| G (s) - R (s; n) |$ is solely caused by the increased numerical error of $G (s)$ in the overlapping region. If this is true, then for $n = 5$ , the Pade approximation is already more accurate than the unified numerical procedure (37) in IEEE double precision. The smaller numerical error of the Pade approximation is mainly attributed to that it has only a few terms, and subsequently, is much less affected by round-off errors in IEEE double precision. For $n = 6$ , Figure 4 shows that the increase of $| G (s) - R (s; n) |$ near the overlapping region is much more pronounced than in the case of $n = 5$ . The pattern of increase strongly suggests that it is caused by the increased numerical error of $G (s)$ near the overlapping region. Figure 4 indicates that the true numerical error in Pade approximation is very likely below 10⁻¹² throughout the full range of s, much more accurate than the unified numerical procedure (37) in IEEE double precision.

We carry out single precision computations to support the assertion we made above that in finite precision arithmetic, Pade approximations can be more accurate than the power series even though the power series is theoretically exact in infinite precision arithmetic. We use the double precision result of $G (s)$ as the exact solution to compare with. We compute power series summation, asymptotics, and Pade approximations in single precision, and then examine the numerical errors in single precision results. Figure 5 shows the error behaviors of single precision results. As s increases, the power series summation starts losing accuracy due to the exponential growth of the largest term and the associated round-off error in summation. Meanwhile, as s increases, the approximation error in asymptotics decreases and its numerical accuracy improves. In contrast, the numerical errors in Pade approximations remain fairly steady with respect to s and decays very rapidly as n is increased. Pade approximation $R (s;3)$ is already significantly more accurate than both the power series summation and the asymptotics in a large neighborhood of the overlapping region (for single precision arithmetic, the overlapping region is around $s = 6$ ). It is evident in Figure 5 that the numerical error of $R (s;4)$ is primarily caused by round-off errors and its true approximation error is below the machine epsilon of single precision

Figure 5. Errors of power series, asymptotics and Pade approximations in single precision computations.

system. In single precision, the actually realized numerical accuracy of $R (s;4)$ is uniformly much higher than that of both the power series summation and the asymptotics over the full range of s.

Next, we construct a Pade approximation for density $g (s)$ . Power series of $g (s)$ around $s = 0$ has the form

$g (s) = η s^{η - 1} \frac{d G (s (x))}{d x} = s^{η - 1} \sum_{k = 0}^{+ \infty} {\tilde{c}}_{k} x^{k}$

$x = s^{η}, {\tilde{c}}_{k} = \frac{{(- 1)}^{k}}{Γ (η (k + 1))}$ . (44)

Note that since $l i m_{x \to + \infty} G (s (x)) = 1$ , we have $l i m_{x \to + \infty} \frac{d}{d x} G (s (x)) = 0$ in

(44), which suggests us to adopt a Pade approximation of order $[(n - 1) / n]$ , of the form

$r (s; n) = s^{η - 1} (\frac{{\tilde{a}}_{0} + {\tilde{a}}_{1} x + \dots + {\tilde{a}}_{n - 1} x^{n - 1}}{{\tilde{b}}_{0} + {\tilde{b}}_{1} x + \dots + {\tilde{b}}_{n - 1} x^{n - 1} + x^{n}}), x = s^{η}$ . (45)

There are 2n unknown coefficients in the Pade approximation of $g (s)$ . To determine the coefficients, we multiply both (45) and (44) by $\sum_{k = 0} {\tilde{b}}_{k} x^{k}$ , match the coefficients of $x^{k}$ terms for $0 \leq k \leq (2 n - 1)$ to form a linear system, and then solve for the unknowns, following a procedure similar to the one used in the Pade approximation of $G (s)$ .

To assess the error of Pade approximation $r (s; n)$ given in (45), we use $g (s)$ computed with the unified numerical procedure (38) as the “exact” solution to compare with. Figure 6 shows $| g (s) - r (s; n) |$ vs s for parameter value $η = 0.1494$ . Four Pade approximations, respectively with, n = 3, 4, 5, and 6, are shown where n is the highest power used in Pade approximation (45). The behaviors of the Pade approximations for density $g (s)$ are similar to those for the CDF $g (s)$ . In IEEE double precision, the actually realized numerical accuracy of the Pade approximations with $n = 5$ and $n = 6$ is significantly better than that of numerical procedure (38). Again, in IEEE double precision, the smaller numerical error of the Pade approximations is mainly attributed to the fact that it contains only a few terms, and its numerical results are much less contaminated with round-off errors.

7. Conclusion

We studied the biovariability of a crowd for hearing loss injury, in the form of heterogeneous injury susceptibility. We constructed a unified numerical procedure for computing the distribution density of injury susceptibility that reproduces the observed logistic dose-response relation in a crowd. The unified procedure combines the advantage of power series expansion for small values of argument and the advantage of asymptotic approximation for large values of argument. It switches between these two approaches to achieve a numerical

Figure 6. Differences between density $g (s)$ and its Pade approximations $r (s; n)$ .

accuracy of 10⁻⁸ or better with IEEE double precision, over the full range of argument. Using this unified procedure, we verified numerically that for all parameter values, the derived distribution density, i) is non-negative everywhere and ii) integrates to one. These results establish numerically that the derived distribution is indeed a proper density for all values of parameter, and thus, is well-posed. Furthermore, we developed efficient and accurate Pade approximations for the distribution density and for the cumulative distribution function. In the computational environment of IEEE double precision, Pade approximations actually yield a much higher realized numerical accuracy than that of both the asymptotic approximation for large argument value and the power series for small argument value. The superior performance of Pade approximations is attributed to the fact that it attains high theoretical accuracy with only a few terms, which leads to less contamination with round-off errors and better realized numerical accuracy. In conclusion, we verified numerically that the observed logistic dose-response relation can be explained solely based on a valid distribution of injury susceptibility. Rigorous proof of the well-posedness of the derived distribution density, however, still remains open.

8. Disclaimer

The authors thank the Joint Non-Lethal Weapons Directorate of US Department of Defense for supporting this work. The views expressed in this document are those of the authors and do not reflect the official policy or position of the Department of Defense or the US Government.

Cite this paper

Wang, H.Y., Burgei, W.A. and Zhou, H. (2018) Asymptotics and Well-Posedness of the Derived Distribution Density in a Study of Biovariability. Applied Mathematics, 9, 672-690. https://doi.org/10.4236/am.2018.96046

References

1. Glodsmith, M. (2015) Sound: A Very Short Introduction. Oxford University Press, Oxford. https://doi.org/10.1093/actrade/9780198708445.001.0001

2. https://www.nidcd.nih.gov/health/noise-induced-hearing-loss

3. Murphy, W.J., Khan, A. and Shaw, P.B. (2011) Analysis of Chinchilla Temporary and Permanent Threshold Shifts Following Impulsive Noise Exposure. https://www.cdc.gov/niosh/surveyreports/pdfs/338-05c.pdf

4. Chan, P., Ho, K. and Ryan, A.F. (2016) Impulse Noise Injury Model. Military Medicine, 181, 59-69. https://doi.org/10.7205/MILMED-D-15-00139

5. Wang, H., Burgei, W.A. and Zhou, H. (2017) Interpreting Dose-Response Relation for Exposure to Multiple Sound Impulses in the Framework of Immunity. Health, 9, 1817-1842. https://doi.org/10.4236/health.2017.913132

6. Wang, H., Burgei, W.A. and Zhou, H. (2018) Risk of Hearing Loss Caused by Multiple Acoustic Impulses in the Framework of Biovariability. Health, 10, Article ID: 84786. https://doi.org/10.4236/health.2018.105048

7. Bush, A.W. (1992) Perturbation Methods for Engineers and Scientists. CRC Press, Boca Raton.

8. Hinch, E.J. (1991) Perturbation Methods. Cambridge University Press, New York. https://doi.org/10.1017/CBO9781139172189

Journal Menu>>