Stein's method for dependent random variables occurring in Statistical Mechanics
Peter Eichelsbacher, Matthias Löwe
Introduction and main result
There is a long tradition in considering mean–field models in statistical mechanics. The Curie–Weiss model is famous, since it exhibits a number of properties of real substances, such as multiple phases, metastable states and others, explicitly. The aim of this paper is to prove Berry-Esseen bounds for the sums of dependent random variables occurring in statistical mechanics under the name Curie-Weiss models. To this end, we will develop Stein’s method for exchangeable pairs (see ) for a rich class of distributional approximations. For an overview of results on the Curie–Weiss models and related models, see , , .
Here is the inverse temperature and is a normalizing constant known as the partition function and denotes the cardinality of . Moreover is the distribution of a single spin in the limit . We define , the total magnetization inside . We take without loss of generality and , where is a positive integer. We write , , and , respectively, instead of , , , and , respectively. In the case where is fixed we may even sometimes simply write .
In the classical Curie–Weiss model, spins are distributed in according to . More generally, the Curie–Weiss model carries an additional parameter called external magnetic field which leads to the modified measure, given by
The measures is completely determined by the value of the total magnetization. It is therefore called an order parameter and its behaviour will be studied in this paper. The non-negative external magnetic field strength may even depend on the site:
In the general case (1.1), we will see (analogously to the treatment in ) that the asymptotic behaviour of depends crucially on the extremal points of a function (which is a transform of the rate function in a corresponding large deviation principle): define
We shall drop in the notation for whenever there is no danger of confusion, similarly we will suppress in the notation for and . For any measure , was proved to have global minima, which can be only finite in number, see [12, Lemma 3.1]. Define to be the discrete, non–empty set of minima (local or global) of . If , then there exists a positive integer and a positive real number such that
The numbers and are called the type and strength, respectively, of the extremal point . Moreover, we define the maximal type of by the formula
Note that the can be calculated explicitly: one gets
An interesting point is, that the global minima of of maximal type correspond to stable states, meaning that multiple minima represent a mixed phase and a unique global minimum a pure phase. For details see the discussions in .
The following is known about the fluctuation behaviour of under . In the classical model ( is the symmetric Bernoulli measure), for , in the Central Limit Theorem is proved:
in distribution with respect to the Curie–Weiss finite volume Gibbs states with . Since for the variance diverges, the Central Limit Theorem fails at the critical point. In it is proved that for there exists a random variable with probability density proportional to such that as
in distribution with respect to the finite-volume Gibbs states. Asymptotic independence properties and propagation of chaos for blocks of size have been investigated in .
In general, given , let be one of the global minima of maximal type and strength of . Then
in distribution, where is a random variable with probability density , defined by
Here, so that for as in (1.6), (see , ). Moderate deviation principles have been investigated in .
In and , a class of measures is described exhibiting a behaviour similar to that of the classical Curie–Weiss model. Assume that is any symmetric measure that satisfies the Griffiths-Hurst-Sherman (GHS) inequality,
(see also ). One can show that in this case has the following properties: There exists a value , the inverse critical temperature, and has a unique global minimum at the origin for and exactly two global minima, of equal type, for . For the unique global minimum is of type whereas for the unique global minimum is of type 1. At the law of large numbers still holds, but the fluctuations of live on a smaller scale than . This critical temperature can be explicitly computed as . By rescaling the we may thus assume that .
Alternatively, the GHS-inequality can be formulated in the terms of , defined in (1.3):
With GHS, we will denote the set of measures such that the GHS-inequality (1.10) is valid (for in the sense of (1)). We will give examples in Section 7.
In [12, Lemma 4.1], for it is proved that has a unique global minimum if and only if
where the right hand side of this strict inequality is the moment generating function of a standard normal random variable. Moreover, in the same Lemma it is proved that has a local minimum at the origin of type and strength if and only if
The aim of this paper is to prove the following theorems:
Let and . We have
where denotes the distribution function of the normal distribution with expectation zero and variance , and is an absolute constant, depending on , only.
Let and . We have
Let and depend on in such a way that monotonically as . Then the following assertions hold:
If for some , we have
If , converges in distribution to , given in (1.14). Moreover, if , (1.13) holds true.
If , the Kolmogorov distance of the distribution of and the normal distribution converges to zero. Moreover, if , we obtain
In , Barbour obtained distributional limit theorems, together with rates of convergence, for the equilibrium distributions of a variety of one-dimensional Markov population processes. In section 3 he mentioned, that his results can be interpreted in the framework of . As far as we understand, his result (3.9) can be interpreted as the statement (1.13), but with the rate .
In the first assertion of Theorem 1.4, our method of proof allows to compare the distribution of alternatively with the distribution with Lebesgue-density proportional to
To be able to compare the distribution of interest with a distribution depending on (on ), is one of the advantages of Stein’s method. The proof of this statement follows immediately from the proof of Theorem 1.4.
If in Theorem 1.4 (2) the speed of convergence reduces to . Likewise, if in Theorem 1.4 (3) , the speed of convergence is . This reduced speed of convergence reflects the influence of two potential limiting measures. Next to the ”true” limit there is also the limit measure from part (1) of Theorem 1.4, which in these cases is relatively close to our measures of interest.
2. Results for a general class of Curie-Weiss models
More generally, we obtain Berry-Esseen bounds for sums of dependent random variables occurring in the general Curie-Weiss models. We will be able to obtain Berry-Esseen-type results for -a.s. bounded single-spin variables :
Given in GHS, let be the global minimum of type and strength of . Assume that the single-spin random variables are bounded -a.s. In the case we obtain
where with defined by
with and is an absolute constant.
Let satisfy the GHS-inequality and assume that . Let be the global minimum of type with and strength of and let the single-spin variable be bounded. Let depend on in such a way that monotonically as . Then the following assertions hold true:
If for some , we have
If , converges in distribution to , defined as in Theorem 1.7. Moreover, if , (1.17) holds true.
Since the symmetric Bernoulli law is , Theorems 1.7 and 1.8 include Berry-Esseen type results for this case. But these results differ from the results in Theorem 1.2, 1.3 and 1.4 with respect to the limiting laws: the laws in 1.7 and 1.8 depend on moments of . The bounds in Theorems 1.2-1.4 are easier to obtain; moreover their proofs apply Corollary 2.8 and part (2) of Theorem 4.6 which are less involved versions of Stein’s method for exchangeable pairs.
The class of test functions for the Wasserstein distance is just the Lipschitz functions with constant no greater than 1. The total variation distance is given by the set of indicators of Borel sets, the Kolmogorov distance by the set of indicators of half lines.
Only for technical reasons, we consider now a modified model. Let
Given the Curie-Weiss model and in GHS, let be the global minimum of type and strength of . In the case , for any uniformly Lipschitz function we obtain for that
Lebowitz proved that if , then (1.12) is non-positive (see [10, V.13.7.(b)] and ). Stein’s method reduces to the computation of, or bounds on, low order moments, perhaps even only on variances of certain quantities. Such variance computations can be very difficult. We will see in the proof of Theorem 1.7 and Theorem 1.8 the use of Lebowitz’ inequality for bounding the variances successfully.
In the situation of Theorem 1.7 and Theorem 1.8 we can bound higher order moments as follows:
Given , let be one of the global minima of maximal type for and strength of . For
We prepare for the proof of Lemma 1.13. It considers a well known transformation – sometimes called the Hubbard–Stratonovich transformation – of our measure of interest.
As shown in , Lemma 3.1, our condition (1.2) ensures that
is finite, such that the above density is well defined.
The proof of this lemma can be found at many places, e.g. in , Lemma 3.3. ∎
In Section 2, we develop in Theorem 2.5, Corollary 2.8 and Corollary 2.9 refinements of Stein’s method for exchangeable pairs in the case of normal approximation. As a first application we prove Theorem 1.2 in Section 3. In Section 4 we develop Stein’s method for exchangeable pairs for a rich class of other distributional approximations. Obtaining good bounds for the solutions of the corresponding Stein equations in the appendix, we prove Theorem 1.3 and Theorem 1.4 in Section 5, applying Theorem 4.6. In Section 6, we proof Theorems 1.7, 1.8 and 1.10, applying Corollary 2.9 and Theorem 4.7. Section 7 contains a collection of examples including the Curie-Weiss model with three states, studying liquid helium, and a continuous Curie-Weiss model, where the single spin distribution is a uniform distribution.
Stein’s method with exchangeable pairs for normal approximation
for some . This approach has been successfully applied in many models, see and for example and references therein. In , the range of application was extended by replacing the linear regression property by a weaker condition, allowing to hold the regression property only approximately. The exchangeable pair approach is also successful for other distributional approximations, as will be shown in Section 4. We develop Stein’s method by replacing the linear regression property by
where will be depend on a continuous distribution under consideration. Before we consider in this section the case of normal approximation, we mention that this is not the first paper to study other distributional approximations via Stein’s method. For a rather large class of continuous distributions, the Stein characterization was introduced in , following [22, Chapter 6]. In , the method of exchangeable pairs was introduced for this class of distribution and used in a simulation context. Recently, the exchangeable pair approach was introduced for exponential approximation in [4, Lemma 2.1].
For measuring the distance of the distribution of and the standard normal distribution (or any other distribution), we would like to bound
for a class of test functions , where and is the standard normal distribution function. One advantage of Stein’s method is that we are able to obtain bounds for different distances like the Wasserstein distance , the total variation distance or the Kolmogorov distance . In , the exchangeable pair approach of Stein was developed for a broad class of non smooth functions , applying standard smoothing inequalities.
where is a number satisfying . If moreover
Rinott and Rotar also proved a bound in the case, where is not assumed to be bounded. In this case, the last two summands on the right hand side of (2.21) have to be replaced by
This estimation is crude, since even for a normalized sum of independent variables , it leads to a bound of the order . The advantage of the results in is, that these bounds do not only apply to indicators on half lines, but also to a broad class of non smooth test functions, see [19, Section 1.2].
Chen and Shao introduced a concentration inequality approach. Here a concentration inequality is proved using the Stein identity (see and ). In the context of the construction of an exchangeable pair, in Shao and Su proved the following theorem:
If , then the bound reduces to
When is bounded, (2.23) improves (2.21) with respect to the constants.
Following the lines of the proofs in and , we obtain the following refinement: Given two random variables and defined on a common probability space, we denote by
the Kolmogorov distance of the distributions of and .
Let be an exchangeable pair of real-valued random variables such that
If for a constant , we obtain the bound
When is bounded, (2.24) improves (2.21) with respect to the Berry-Esseen constants.
We sketch the proof: For a function with we obtain
Let denote the solution of the Stein equation
Using for all real (see [6, Lemma 2.2]), we obtain the bound
Using (see [6, Lemma 2.2]), we have
Bounding we apply the concentration technique, see :
Next observe that , see : by the mean value theorem one gets
Under the assumptions of our Theorem we proceed as in and obtain the following concentration inequality:
see ; here is defined by for , for and in between. Now we apply (2.25) and get
In the following corollary, we discuss the Kolmogorov-distance of the distribution of a random variable to a random variable distributed according to , the normal distribution with mean zero and variance .
Let and be an exchangeable pair of real-valued random variables such that
Let us denote by the solution of the Stein equation
with F_{\sigma}(z):=\frac{1}{\sqrt{2\pi}\sigma}\int_{-\infty}^{z}\exp\bigl{(}-\frac{y^{2}}{2\sigma^{2}}\bigr{)}\,dy. It is easy to see that the identity f_{\sigma,z}(x)=\sigma f_{z}\bigl{(}\frac{x}{\sigma}\bigr{)}, where is the solution of the corresponding Stein equation of the standard normal distribution, holds true. Using [6, Lemma 2.2] we obtain , , and . With (2.30) we arrive at
with ’s defined in (2.27). Using the bounds of and , the bound of is the same as in the proof of Theorem 2.5, whereas the bound of changes to
Since we consider the case , we have to bound
Using the Stein identity (2.32), the mean value theorem as well as the concentration inequality-argument along the lines of the proof of Theorem 2.5, we obtain
Berry-Esseen bounds for the classical Curie-Weiss model
Let be the symmetric Bernoulli measure and . Then
converges in distribution to a with :
We consider the usual construction of an exchangeable pair. We produce a spin collection via a Gibbs sampling procedure: select a coordinate, say , at random and replace by drawn from the conditional distribution of the ’th coordinate given . Let be a random variable taking values with equal probability, and independent of all other random variables. Consider
Hence is an exchangeable pair and
Let . Now we obtain
The conditional distribution at site is given by
Now \frac{1}{\sqrt{n}}\frac{1}{n}\sum_{i=1}^{n}\tanh(\beta m_{i}(X))=\frac{1}{\sqrt{n}}\frac{1}{n}\sum_{i=1}^{n}\bigl{(}\tanh(\beta m_{i}(X))-\tanh(\beta m(X))\bigr{)}+\frac{1}{\sqrt{n}}\tanh(\beta m(X))=:R_{1}+R_{2} with . Taylor-expansion leads to
To bound the first summand in (2.31), we obtain . Hence
Now we discuss the critical case , when is the symmetric Bernoulli distribution. For , using the Taylor expansion , (3.36) would lead to
Constructing the exchangeable pair in the same manner as before we will obtain
with and a reminder presented later. Considering the density , we have
This is the starting point for developing Stein’s method for limiting distributions with a regular Lebesgue-density and an exchangeable pair which satisfies the condition
with . To prove (3.38), observe that
By Taylor expansion and the identity we obtain
The exchangeable pair approach for distributional approximations
Motivated by the classical Curie-Weiss model at the critical temperature, we will develop Stein’s method with the help of exchangeable pairs as follows. For a rather large class of continuous distributions, the Stein characterization was introduced in , following the lines of [22, Chapter 6]. The densities occurring as limit laws in models of statistical mechanics belong to this class. Let be a real interval, where . A function is called regular if is finite on and, at any interior point of , possesses a right-hand limit and a left-hand limit. Further, possesses a right-hand limit at the point and a left-hand limit at the point .
Let us assume, that the regular density satisfies the following condition:
Assumption (D) Let be a regular, strictly positive density on an interval . Suppose has a derivative that is regular on and has only countably many sign changes and being continuous at the sign changes. Suppose moreover that and assume that
In [23, Proposition] it is proved, that a random variable is distributed according to the density if and only if
for a suitably chosen class of functions . The proof is integration by parts. The corresponding Stein identity is
where is a measurable function for which , and . The solution of this differential equation is given by
For the function let be the corresponding solution of (4.40). We will make the following assumptions:
Assumption (B1) Let be a density fulfilling Assumption (D). We assume that for any absolute continuous function , the solution of (4.40) satisfies
where and are constants.
Assumption (B2) Let be a density fulfilling Assumption (D) We assume that the solution of
for all real and , where and are constants.
At first glance, Condition (4.43) seem to be a rather strong or at least a rather technical condition.
In the case of the normal approximation, , we have to bound for the solution of the classical Stein equation. But it is easy to observe that by direct calculation (see [6, Proof of Lemma 6.5]). However, in the normal approximation case, this bound would lead to a worse Berry-Esseen constant (compare Theorem 2.5 with Theorem 4.6). Hence in this case we only use and .
We will see, that for all distributions appearing as limit laws in our class of Curie-Weiss models, Condition (4.43) can be proved:
The densities in (1.8) and (1.9) and the densities in Theorem 1.4, Theorem 1.7 and Theorem 1.8 satisfy Assumptions (D), (B1) and (B2).
We defer the proofs to the appendix, since they only involve careful analysis. ∎
With respect to all densities which appear as limiting distributions in our theorems, we restrict ourselves to bound solutions (and its derivatives) of the corresponding Stein equation characterizing distributions with probability densities of the form . Along the lines of the proof of Lemma 4.2, one would be able to present good bounds (in the sense that Assumption (B1) and (B2) are fulfilled) even for measures with a probability density of the form
In the case of comparing with an exponential distribution with parameter , it is easy to see, that Assumption (D) and (B2) is fulfilled, see [23, Example 1.6] for (D) and [4, Lemma 2.1] for (B2). We have and , and . Thus .
Therefore one has to bound the derivative of
The following result is a refinement of Stein’s result for exchangeable pairs.
Let be a density fulfilling Assumption (D). Let be an exchangeable pair of real-valued random variables such that
for some random variable , and defined in (4.39). Then
Let be a random variable distributed according to . Under Assumption (B1), for any uniformly Lipschitz function , we obtain
Let be a random variable distributed according to . Under Assumption (B2), we obtain for any
for a suitably chosen class of functions.
Under Assumption (B2) we obtain for any
Interestingly enough, the proof is a quite simple adaption of the results in and follows the lines of the proof of Theorem 2.5. For a function with we obtain
Proof of (1): Now let be the solution of the Stein equation (4.40), and define
By (4.50), following the calculations on page 21 in , we simply obtain
Proof of (2): Now let be the solution of the Stein equation (4.42). As in (2.27), using (4.50), we obtain
With we obtain
Since we obtain .
Analogously to the steps in the proof of Theorem 2.5, can be bounded by
The main observation is the following identity:
with defined as in the proof of Theorem 4.6. Now we can apply the Cauchy-Schwarz inequality to get
Now the proof follows the lines of the proof of Theorem 4.6. ∎
We discuss an alternative bound in Theorem 4.6 in the case that cannot be bounded uniformly. By the mean value theorem we obtain in general
Let us consider the example . Now
with . Hence we get
We will see in Section 5, that this bound is good enough for an alternative proof of Theorem 1.3.
Berry-Esseen bound at the critical temperature
We start with (3.38), where is given by (3.37). We will calculate the remainder term more carefully: By Taylor expansion and the identities and we obtain
Hence applying Theorem 4.6 we have to bound the expectation of
In Remark 4.8, we presented an alternative bound via Stein’s method without proving a uniform bound for . As we can see, the additional terms in this bound are of smaller order than , using .
(1) Let and . For the distribution function in Theorem 1.4 we obtain . Moreover we have
with . With we obtain
(2): we consider the case and . Now in (5.52), the term will be a part of the remainder:
applying Theorem 4.6, we obtain the convergence in distribution for any with , and we obtain the Berry-Esseen bound of order for any .
(3) Finally we consider and . Now we obtain
with and . We apply Corollary 2.8: with , one obtains and
Hence with we obtain convergence in distribution. Under the additional assumption we obtain the Berry-Esseen result. ∎
Proof of the general case
Given which satisfies the GHS-inequality and let be the global minimum of type and strength of . In case it is known that the random variable converges in distribution to a normal distribution with , see for example [10, V.13.15]. Hence in this case we will apply Corollary 2.9 (to obtain better constants for our Berry-Esseen bound in comparison to Theorem 4.7).
Consider . We just treat the case and denote . The more general case can be done analogously. For , we consider with . For any we consider
and , constructed as in Section 3, such that
Now we have to calculate the conditional distribution at site in the general case:
In the situation of Theorem 1.7, if is -a.s. bounded, we obtain
with .
We compute the conditional density of given under the Curie-Weiss measure:
By computation of the derivative of we see that
If we consider the Curie-Weiss model with respect to , the conditional density under this measure becomes
Applying Lemma 6.1 and the presentation (1.5) of , it follows that
With and we obtain
For any the first summand () is
To see this, let . Since we set , we obtain and therefore . In the case we know that . Hence in both cases, (6.53) is checked. Summarizing we obtain for any
Hence the last four summands in (4.48) of Theorem 4.7 are .
Since we assume that , we can apply the correlation-inequality due to Lebowitz (see Remark 1.12)
The choice and leads to the bound
Using a conditional version of Jensen’s inequality we have
Hence the variance of the second term in (6.54) is of the same order as the variance of the first term. Applying (1.5) for , the variance of the third term in (6.54) is of the order of the variance of . Summarizing the variance of (6.54) can be bounded by 9 times the maximum of the variances of the three terms in (6.54), which is a constant times , and therefore for we obtain
Since and for while and for , can now be expanded as
Hence . With Lemma 6.1 and we obtain
The remainder is the remainder in the proof of Theorem 1.7 with exchanged by and exchanged by .
Let and . We obtain
where . As in the proof of Theorem 1.7 we obtain that . Now we only have to adapt the proof of Theorem 1.7 step by step, applying Lemma 1.13, Lemma 4.2 and Theorem 4.7.
Let and . Now in (6.55), the term will be a part of the remainder:
Thus with Theorem 4.7 we obtain convergence in distribution for any with . Moreover we obtain the Berry-Esseen bound of order for any .
Finally we consider and . A little calculation gives
with and . Now we apply Corollary 2.9. With we obtain
which is of order \mathcal{O}\bigl{(}\frac{\beta_{n}}{n(1-\beta_{n})}\bigr{)}. Hence with we get convergence in distribution. Under the additional assumption that we obtain the Berry-Esseen bound. ∎
Examples
It is known that the following distributions are (see [11, Theorem 1.2]). The symmetric Bernoulli measure is , first noted in . The family of measures
for is , whereas the GHS-inequality fails for , see [21, p.153]. contains all measures of the form
where is even, continuously differentiable, and unbounded above at infinity, and is convex on . contains all absolutely continuous measures with support on for some provided is continuously differentiable and strictly positive on and is concave on . Measures like \varrho(dx)={\rm const.}\exp\bigl{(}-ax^{4}-bx^{2}\bigr{)}\,dx or \varrho(dx)={\rm const.}\exp\bigl{(}-a\cosh x-bx^{2}\bigr{)}\,dx with and real are GHS. Both are of physical interest, see and references therein).
We will now consider the next simplest example of the classical Curie–Weiss model: a model with three states. Observe, that this is not the Curie–Weiss–Potts model , since the latter has a different Hamiltonian. Indeed the Hamiltonian considered in is of the form . It favours states with many equal spins, whereas in our case the spins also need to have large values. We choose to be
This model seems to be of physical relevance. It is studied in . In it was used to analyze the tri-critical point of liquid helium. A little computation shows that
for all . Hence the GHS-inequality (1.10) is fulfilled (see also [11, Theorem 1.2]), which implies that there is one critical temperature such that there is one minimum of for and two minima above . Since we see that . For the minimum of is located in zero while for the two minima are symmetric and satisfy
For the rescaled magnetization satisfies a Central Limit Theorem and the limiting variance is . Indeed, . Hence and . Moreover we obtain
For the rescaled magnetization converges in distribution to which has the density . Indeed is computed to be 6. Moreover we obtain
If converges monotonically to faster than then converges in distribution to , whereas if converges monotonically to slower than then satisfies a Central Limit Theorem. Eventually, if , converges in distribution to a random variable which probability distribution has the mixed Lebesgue-density
For the rescaled magnetization converges in distribution to which has the density . Indeed is computed to be
If converges monotonically to faster than then converges in distribution to , whereas if converges monotonically to slower than then satisfies a Central Limit Theorem. Eventually, if , converges in distribution to the mixed density
Note that there is some interesting change in limiting behaviour of all of these models at criticality. While for all of the models have the same rate of convergence for the Central Limit Theorem behaviour, in the limit at criticality the limiting distribution function as well as the distributions which depend on some moments of becomes characteristic of the underlying distribution . Moreover the rate of convergence differs at criticality (for ).
Appendix
Consider a probability density of the form
Here . We have
with . Note that , so we need only to consider the case . For we obtain
So \exp\bigl{(}a_{k}x^{2k}\bigr{)}\int_{x}^{\infty}\exp\bigl{(}-a_{k}t^{2k}\bigr{)}\,dt attains its maximum at and therefore
So \exp\bigl{(}a_{k}x^{2k}\bigr{)}\int_{-\infty}^{x}\exp\bigl{(}-a_{k}t^{2k}\bigr{)}\,dt attains its maximum at and therefore
Applying (8.61) and (8.62) gives for all . Note that for we only have to consider the first case of (8.57), since . The constant is not optimal. Following the proof of Lemma 2.2 in or alternatively of Lemma 2 in [22, Lecture II] would lead to optimal constants. We omit this. It follows from (8.57) that
With (8.58) we obtain for that
The same argument for leads to . For we use the first half of (8.57) and apply (8.59) to obtain . Actually this bound will be improved later. Next we calculate the derivative of :
With (8.60) we obtain , so is an increasing function of (remark that for we only have to consider the first half of (8.57)). Moreover with (8.58), (8.59) and (8.60) we obtain that
Hence we have and |2k\,a_{k}\bigl{(}x^{2k-1}f_{z}(x)-u^{2k-1}f_{z}(u)\bigr{)}|\leq 1 for any and . From (8.58) it follows that for all and for . With Stein’s identity and (8.65) we have
Next we bound . We already know that . Again we apply (8.58) and (8.59) to see that
for and all . For this latter bound holds, as can be seen by applying this bound (more precisely the bound for for ) with for to the formula for in . For some constant we can bound by for all . Moreover, on the continuous function is bounded by some constant , hence we have proved
The problem of finding the optimal constant, depending on , is omitted. Summarizing, Assumption (B2) is fulfilled for with and some constants and .
An alternative bound is with some constant depending on the ’th moment of . This is using Stein’s identity (4.40) to obtain
The details are omit. To bound the second derivative , we differentiate (4.40) and have
Now we apply the fact that the quantity in (8.64) is non-negative to obtain
Moreover we know, that the quantity in (8.64) can be bounded by , hence
where is distributed according to . Summarizing we have for some constant , using the fact that and therefore and are continuous. Hence satisfies Assumption (B1). ∎
Now let p(x)=b_{k}\exp\bigl{(}-a_{k}V(x)\bigr{)} and satisfies the assumptions listed in Remark 4.3. To proof that (with respect to ) satisfies Assumption (B2), we adapt (8.60) as well as (8.61) and (8.62), using the assumptions on . We obtain for
Estimating gives
Acknowledgement. During the preparation of our manusscript we became aware of a preprint of S. Chatterjee ans Q.-M. Shao about Stein’s method with applications to the Curie-Weiss model. As far as we understand, there the authors give an alternative proof of Theorem 1.2 and 1.3.