Variance-Gamma approximation via Stein's method
Robert E. Gaunt
Introduction
In 1972, Stein introduced a powerful method for deriving bounds for normal approximation. Since then, this method has been extended to many other distributions, such as the Poisson , Gamma , , Exponential , and Laplace , . Through the use of differential or difference equations, and various coupling techniques, Stein’s method enables many types of dependence structures to be treated, and also gives explicit bounds for distances between distributions.
At the heart of Stein’s method lies a characterisation of the target distribution and a corresponding characterising differential or difference equation. For example, Stein’s method for normal approximation rests on the following characterization of the normal distribution, which can be found in Stein , namely if and only if
for all sufficiently smooth . This gives rise to the following inhomogeneous differential equation, known as the Stein equation:
where , and the test function is a real-valued function. For any bounded test function, a solution to (1.2) exists (see Lemma 2.4 of Chen et al. ). There are a number of techniques for obtaining Stein equations, such as the density approach of Stein et al. , the scope of which has recently been extended by Ley and Swan . Another commonly used technique is a generator approach, introduced by Barbour . This approach involves recognising the target the distribution as the stationary distribution of Markov process and then using the theory of generators of stochastic process to arrive at a Stein equation; for a detailed overview of this method see Reinert . Luk used this approach to obtain the following Stein equation for the distribution:
The next essential ingredient of Stein’s method is smoothness estimates for the solution of the Stein equation. This can often be done by solving the Stein equation using standard solution methods for differential equations and then using direct calculations to bound the required derivatives of the solution (Stein used the approach to bound the first two derivatives of the solution to the normal Stein equation (1.2)). The generator approach is also often used to obtain smoothness estimates. The use of probabilistic arguments to bound the derivatives of the solution often make it easier to arrive at smoothness estimates than through the use of analytical techniques. Luk and Pickett used the generator approach to bound -th order derivatives of the solution of the Stein equation (1.3). Pickett’s bounds are as follows
In this paper we obtain the key ingredients required to extend Stein’s method to the class of Variance-Gamma distributions. The Variance-Gamma distributions are defined as follows (this parametrisation is similar to that given in Finlay and Seneta ).
The Variance-Gamma distributions were introduced to the financial literature by Madan and Seneta . For certain parameter values the Variance-Gamma distributions have semi heavy tails that decay slower than the tails of the normal distribution, and therefore are often appropriate for financial modelling.
The class of Variance-Gamma distributions includes the Laplace distribution as a special case and in the appropriate limits reduces to the normal and Gamma distributions. This family of distributions also contains many other distributions that are of interest, which we list in the following proposition (the proof is given in Appendix A). As far as the author is aware, this is the first list of characterisations of the Variance-Gamma distributions to appear in the literature.
The representations of the Variance-Gamma distributions given in Proposition 1.2 enable us to determine a number of statistics that may have asymptotic Variance-Gamma distributions.
One of the main results of this paper (see Lemma 3.1) is the following Stein equation for the Variance-Gamma distributions:
In Section 3, we analyse the Stein equation (1.7). In particular, we show that the normal Stein equation (1.2) and Gamma Stein equation (1.3) are special cases. As a Stein equation for a given distribution is not unique (see Barbour ), the fact that in the appropriate limit the Variance-Gamma Stein equation (1.7) reduces to the known normal and Gamma Stein equation is an attractive feature.
Stein’s method has also recently been extended to the Laplace distribution (see Pike and Ren and Döbler ), although the Laplace Stein equation obtained by differs from the Laplace Stein equation that arises as a special case of (1.7); see Section 3.1.1 for a more detailed discussion. Another special case of the Stein equation (1.7) is a Stein equation for the product of two independent central normal random variables, which is in agreement with the Stein equation for products of independent central normal that was recently obtained by Gaunt . Therefore, the results from this paper allow the existing literature for Stein’s method for normal, Gamma, Laplace and product normal approximation to be considered in a more general framework.
More importantly, our development of Stein’s method for the Variance-Gamma distributions allows a number of new situations to be treated by Stein’s method. In Section 4, we illustrate our method by obtaining a bound for the distance between the statistic
local approach couplings, and symmetry arguments, that were introduced by Pickett , we obtain a bound for smooth test functions. A similar phenomena was observed in chi-square approximation by Pickett, and also by Goldstein and Reinert in which they obtained convergence rates in normal approximation, for smooth test functions, under the assumption of vanishing third moments. For non-smooth test functions we would, however, expect a convergence rate (cf. Berry-Esséen Theorem (Berry and Esséen ) to hold; see Remark 4.11.
The rest of this paper is organised as follows. In Section 2, we introduce the Variance-Gamma distributions and state some of their standard properties. In Section 3, we obtain a characterising lemma for the Variance-Gamma distributions and a corresponding Stein equation. We also obtain the unique bounded solution of the Stein equation, and present uniform bounds for the first four derivatives of the solution for the case . In Section 4, we use Stein’s method for Variance-Gamma approximation to bound the distance between the statistic (1.8) and its limiting Variance-Gamma distribution. We then apply this bound to an application of binary sequence comparison, which is a simple special case of the more general problem of word sequence comparison. In Appendix A, we include the proofs of some technical lemmas that are required in this paper. Appendix B provides a list of some elementary properties of modified Bessel functions that we make use of in this paper.
The class of Variance-Gamma distributions
In this section we present the Variance-Gamma distributions and some of their standard properties. Throughout this paper we will make use of two different parametrisations of the Variance-Gamma distributions; the first parametrisation was given in Section 1, and making the change of variables
leads to another useful parametrisation. This parametrisation can be found in Eberlein and Hammerstein .
The first parametrisation leads to simple characterisations of the Variance-Gamma distributions in terms of normal and Gamma distributions, and therefore in many cases it allows us to recognise statistics that will have an asymptotic Variance-Gamma distribution. For this reason, we state our main results in terms of this parametrisation. However, the second parametrisation proves to be very useful in simplifying the calculations of Section 3, as the solution of the Variance-Gamma Stein equation has a simpler representation for this parametrisation. We can then state the results in terms of the first parametrisation by using (2.1).
The Variance-Gamma distributions have moments of arbitrary order (see Eberlein and Hammerstein ), in particular the mean and variance (for both parametrisations) of a random variable with a Variance-Gamma distribution are given by
The following proposition, which can be found in Bibby and Sørensen , shows that the class of Variance-Gamma distributions is closed under convolution, provided that the random variables have common values of and (or, equivalently, common values of and in the second parametrisation).
Variance-Gamma random variables can be characterised in terms of independent normal and Gamma random variables. This characterisation is given in the following proposition, which can be found in Barndorff-Nielsen et al. .
Using Proposition 2.4 we can establish the following useful representation of the Variance-Gamma distributions, which appears to be a new result. Indeed, the representation allows us to see that the statistic (1.8) has an asymptotic Variance-Gamma distribution.
Let and be sequences of independent standard normal random variables. Then , , has a distribution, that is a distribution. Define
Stein’s method for Variance-Gamma distributions
Note that the tails are in general not symmetric.
Firstly, we consider . Let , and . Then applying integration by parts twice gives
as is a solution of the modified Bessel differential equation (see (B.10)).
Using formula (B.7) to differentiate gives
We now calculate the limit in the above expression. We first consider the case . Applying the asymptotic formula (B.2) gives
since . Now consider the case . We use the fact that to obtain
since . Therefore we have
Finally, we consider the case . We use the fact that to obtain
As is continuous, it follows that (and so ), which completes the proof of necessity.
This solution and its first derivative are bounded (see Lemma 3.3) and is piecewise twice differentiable. As and are bounded, they satisfy the condition (3.3) (with ) and must also satisfy the condition because, from (3.5),
for some constants and . Hence, if (3.2) holds for all piecewise twice continuously differentiable functions satisfying (3.3) (with ), then by (3.5),
which, recalling (1.3), we recognise as the Stein equation (1.3) of Luk (up to a multiplicative factor).
which in the limit is the classical Stein equation.
Taking , and in (1.7) gives the following Stein equation for distribution of the product of independent and random variables (see part (iii) of Proposition 1.2):
This Stein equation is in agreement with the Stein equation for the product of two independent, zero mean normal random variables that was obtained by Gaunt .
They have also solved (3.9) and have obtained uniform bounds for the solution and its first three derivatives. Their characterisation was obtained by a repeated application of the density method, and is similar to the characterisation for the Exponential distribution that results from the density method (see Stein et al. , Example 1.6), which leads to the Stein equation
1.2 Applications of Lemma 3.1
The main application of Lemma 3.1 that is considered in this paper involves the use of the resulting Stein equation in the proofs of the limit theorems of Section 4. There are, however, other interesting results that follow from Lemma 3.1. We consider a couple here.
Solving this equation subject to the condition then gives that the moment generating function of the Variance-Gamma distribution with is
which in terms of the first parametrisation is
We have that and (see (2.3)), and thus we can solve these recurrence equations by forward substitution to obtain the moments of the Variance-Gamma distributions. As far as the author is aware, these recurrence equations are new, although Scott et al. have already established a formula for the moments of general order of the Variance-Gamma distributions.
2 Smoothness estimates for the solution of the Stein equation
In the following lemma we give the solution to the Stein equation. The proof is given in Appendix A.
is very useful when it comes to obtaining smoothness estimates for the solution to the Stein equation. The equality ensures that we can restrict our attention to bounding the derivatives in the region , provided we obtain these bounds for both positive and negative .
The bounds given in Lemma 3.5 are of order as , except when is not equal to an integer, but is sufficiently close to an integer that
Gaunt remarked that the rogue term appeared to be an artefact of the analysis that was used to obtain the bounds.
Limit theorems for Symmetric-Variance Gamma distributions
We now consider the Symmetric Variance-Gamma () limit theorem that we discussed in the introduction. Let be a matrix of independent and identically random variables with zero mean and unit variance. Similarly, we let be a matrix of independent and identically random variables with zero mean and unit variance, where the are independent of the . Then the statistic
We first consider the case ; the general case follows easily as is a linear sum of independent . For ease of reading, in the statement of the following theorem and in its proof we shall set , and . Then we have the following:
Notice that the statistic is symmetric in and and the random variables and , and yet the bound (4.1) of Theorem 4.1 is not symmetric in and and the moments of and . This asymmetry is a consequence of the local couplings that we used to obtain the bound.
Before proving Theorem 4.1, we introduce some notation and preliminary lemmas. We define the standardised sum and by
and we have that . In our proof we shall make use of the sums
which are independent of and , respectively. We therefore have the following formulas
In the proof of Theorem 4.1 we use the following lemma, which can be found in Pickett , Lemma 4.3.
Due to the independence of the and variables, we are in the realms of the local approach coupling. We Taylor expand about to obtain
As , we obtain
We begin by bounding and . Taylor expanding about and using (4.2) gives
The bound for is immediate. We have
Taylor expanding about gives
where we used independence and that the have zero mean to obtain the final inequality. Putting this together we have that
Noting that , we may write as
We first consider . Taylor expanding about and using that gives
Putting this together we have the following bound for :
Using independence and that the have zero mean and then Taylor expanding about gives
To bound we Taylor expand about and use independence and that the have zero mean to obtain
1.2 Proof Part II: Symmetry Argument for Optimal Rate
We begin by considering the bivariate standard normal Stein equation (see, for example, Goldstein and Rinott ) with test functions , and . The bivariate standard normal Stein equation with test function , and solution is given by
where and are independent standard normal random variables.
with , , , .
Before we bound the remainder terms, we need bounds for the third order partial derivatives of the solution in terms of the derivatives of . We achieve this task by using the following lemma, the proof of which is given in Appendix A. Before stating the lemma, we define the double factorial function. The double factorial of a positive integer is given by
and we define (Arfken , p.547).
With these bounds it is straightforward to bound the remainder terms. The following lemma allows us to easily deduce bounds for the remainder terms , , and , .
We prove that the bound for holds; the bound for then follows by symmetry. We begin by defining . We note the following simple bound for , for :
Using our bound (4.5) for the third order partial derivative of with respect to , we have
We can bound , , and by using the bounds in Lemma 4.8. We illustrate the argument by bounding . In this case we have , that is and . We have
2 Extension to the case r>1𝑟1r>1
For the case of , we have the following generalisation of Theorem 4.1:
Since for any constant , we may use bound (4.1) from Theorem 4.1 and the bounds of Theorem 3.6 for the derivatives of the solution of the Stein equation to bound the above expression, which yields (4.8). ∎
The terms , for , are of order as (recall Theorem 3.6), and therefore the bound of Theorem 4.9 is of order . This in agreement with bound of Theorem 4.7 of Pickett for chi-square approximation, which is of order .
3 Application: Binary Sequence Comparison
We now consider a straightforward application of Theorem 4.1 to binary sequence comparison. This example is a simple special case of a more general problem of word sequence comparison, which is of particular importance to biological sequence comparisons. One way of comparing the sequences uses -tuples (a sequence of letters of length ). If two sequences are closely related, we would expect the -tuple content of both sequences to be very similar. A statistic for sequence comparison based on -tuple content, known as the statistic was suggested by Blaisdell (for other statistics based on -tuple content see Reinert et al. ). Letting denote an alphabet of size , and and the number of occurrences of the word in the first and second sequences, respectively, then the statistic is defined by
Due to the complicated dependence structure at both the local and global level (for a detailed account of the dependence structure see Reinert et al. ) approximating the asymptotic distribution of is a difficult problem. However, for certain parameter regimes has been shown to be asymptotically normal and Poisson; see Lippert et al. for a detailed account of the asymptotic distributions of for different parameter values.
We now consider the standardised statistic,
References
Appendix A Proofs from the text
Here we prove the lemmas that we stated in the main text without proof.
(ii) This follows by applying the formula to the density (1.5).
(iv) Taking in Corollary (2.5) leads to the general representation. The representation for the Laplace distribution now follows from part (ii).
(v) This follows on letting in Proposition 2.4 and then using the fact that if then .
(vi) Theorem 6 of Holm and Alouini gives the following formula for the probability density function of :
We can write the density of as follows
A.2 Proof of Lemma 3.3
We begin by proving that there is at most one bounded solution to the Variance-Gamma Stein equation (3.7) when . Suppose and are solutions to the Stein equation that satisfy . Define . Then satisfies , and is a solution to the following differential equation
This homogeneous differential equation has general solution
From the asymptotic formula (B.3) for , it follows that to have a bounded solution we must take . From the asymptotic formula (B.2) for , we see that has a singularity at the origin if . Therefore if , then for to be bounded we must take , and therefore and so .
We now use variation of parameters (see Collins ) to solve the Stein equation equation (3.7). The method allows us to solve differential equations of the form
Suppose and are linearly independent solutions of the homogeneous equation
Then the general solution to the inhomogeneous equation is given by
where and are arbitrary constants and is the Wronskian.
It is easy to verify that a pair of linearly independent solutions to the homogeneous equation
Formula (B.6) states that and therefore
where we used (B.5) to obtain the equality in the above display. Therefore the general solution to the inhomogeneous equation is given by
This solution is clearly bounded everywhere except possibly for or in the limits . We therefore choose and so that our solution is bounded at these points and thus for all real . To ensure the solution is bounded at the origin we must take . We choose so that the solution is bounded in the limits . If we take then we obtain solution (3.3). Taking would lead to the same solution (see Remark 3.4).
A.3 Proof of Lemma 4.5
Taylor expanding about gives
Taylor expanding the about allows us to write as
Putting this together, we have shown that
Rearranging and apply the triangle inequality now gives
and summing up the remainder terms completes the proof.
A.4 Proof of Lemma 4.7
We prove that inequality (4.5) holds; inequality (4.6) then follows by symmetry. We begin by obtaining a formula for the third order partial derivative of with respect to . Using a straightforward generalisation of the proof of Lemma 3.2 of Raič it can be shown that
We now use the simple inequality that to obtain the following bound on
and a similar inequality holds for . With these inequalities we have the following bound
Applying this bound to equation (A.3) gives the following bound on the third order partial derivative of with respect to :
Appendix B Elementary properties of modified Bessel functions
Here we list standard properties of modified Bessel functions that are used throughout this paper. All these formulas can be found in Olver et al. , except for the inequalities, which are given in Gaunt .
B.2 Basic properties
B.3 Asymptotic expansions
B.4 Identities
B.5 Differentiation
B.6 Modified Bessel differential equation
The modified Bessel differential equation is
The general solution is
B.7 Inequalities
Let and , then for we have
Acknowledgements
During the course of this research the author was supported by an EPSRC DPhil Studentship and an EPSRC Doctoral Prize. The author would like to thank Gesine Reinert for the valuable guidance she provided on this project. The author would also like to thank two anonymous referees for their helpful comments which have lead to a substantial improvement in the presentation of this paper.