Exponential Convergence in $L^p$-Wasserstein Distance for Diffusion Processes without Uniformly Dissipative Drift
Dejun Luo, Jian Wang
Introduction
In this paper we consider the following Itô stochastic differential equation
Yet there is another approach for studying exponential convergence of the semigroup corresponding to the SDE (1.1) considered in this paper, that is, the coupling method. If the drift vector field fulfills certain dissipative properties, this latter method provides explicit rate of convergence to equilibrium in a straightforward way, see e.g. and the preprint . The present work is motivated by where the author obtained exponential decay of when the drift is assumed to be only dissipative at infinity, see the introduction below for more details.
In this paper, we shall establish the exponential convergence of the map with respect to the -Wasserstein distance for all . We first recall some known results.
Suppose that and there exists a constant such that
The proof of this result is quite straightforward, by simply using the synchronous coupling (also called the coupling of marching soldiers in [7, Example 2.16]), see e.g. [2, p.2432] for a short proof.
As in [13, (2.3)], we shall assume throughout the paper that
This technical condition will be used in Section 2 to construct the auxiliary function.
Suppose that the vector field is locally Lipschitz continuous, and there is a constant such that
In particular, when , the condition (1.5) holds true if (1.2) is satisfied only for large , that is,
for some constant large enough. Therefore, [13, Corollary 2.3] implies that the map converges exponentially with respect to the standard -Wasserstein distance under locally non-dissipative drift, see [13, Example 1.1] for more details. The proof of [13, Corollary 2.3] is based on the coupling by reflection of diffusion processes and a carefully constructed concave function, cf. [13, Section 2]. A number of direct consequences are presented in [13, Section 2.2] which indicate that the convergence result as [13, Corollary 2.3] is extremely useful.
However, [13, Corollary 2.3] is not satisfactory in the sense that no information on the -Wasserstein distance is provided. This fact has also been noted in [5, Section 7.1, Remark 19], saying that “the reflection coupling cannot furnish some information on ”. Our main result of this paper shows that this is not the case.
Assume that there are constants and such that for all , one has
where is a positive constant.
Theorem 1.3 above does provide new conditions on the drift term such that the associated semigroup is exponentially convergent with respect to the -Wasserstein distance for all . The reason that we can obtain the exponential convergence in for all , not only , is due to our particular choice of the auxiliary function which is convex near infinity. It is designed by using Chen–Wang’s famous variational formula for the principal eigenvalue of one-dimensional diffusion operator, see for instance or [7, Section 3.4]. The reader can refer to and the references therein for recent studies on this topic.
The assertion of Theorem 1.3 can be slightly strengthened if (1.6) is replaced by a stronger condition.
Assume that there are constants , and such that for all , one has
The idea of the proof is to use synchronous coupling for large and the coupling by reflection for small . For the latter part, we can directly use the result of Theorem 1.3, since (1.8) implies that (1.6) holds with if .
Before presenting the consequences of Theorem 1.3, let us first make some comments and give some examples. In the beginning, we intended to generalize Eberle’s results by assuming that there is a constant such that
It turns out that under mild conditions on , the two conditions (1.5) and (1.10) are equivalent, up to changing the constants. More explicitly, we have
Assume that there are constants such that and . Then, the condition (1.5) holds with some other positive constant.
This result indicates that if the function is locally bounded from above, then the following statements are equivalent:
there exist constants such that ;
there exist constants and such that for large enough;
there exists a constant such that for large enough.
The equivalence stated above also indicates that Theorem 1.3 is sharp in some situation, as shown by the next example.
Assume that and with for some . Then, we have the following statements.
If , then for all large enough, and the inequality (1.7) does not hold for any positive constants and with .
where is the total variation distance between probability measures.
To show the power of Theorem 1.4, we consider the following example which yields the exponential convergence of the semigroup with respect to the -Wasserstein distance for super-convex potentials. The assertion below improves the results mentioned in [5, Section 6.1].
Let and with and . It follows from [5, Section 6, Example 1] that there is a constant such that for all ,
holds with some constant .
As applications of Theorem 1.3, we consider the existence and uniqueness of the invariant probability measure, and also the exponential convergence of the semigroup with respect to the -Wasserstein distance . For , we define
holds with some positive constant .
Under the assumptions of Corollary 1.8, it is easy to establish the following Foster-Lyapunov type conditions:
where is the associated transition probability.
As was pointed by the referee, the conclusion of Corollary 1.8 for also could be deduced from Theorem 1.4 and the Foster-Lyapunov type condition above, by Harris’s theorem for the exponential convergence to the invariant measure in the Wasserstein metric. See [17, Theorem 4.8] and [4, Theorem 2.4] for more details.
The following statement is concerned with symmetric diffusion processes. Though we believe the assertion below is known (see e.g. [25, Corollary 1.4]), we stress the relation between the exponential convergence with respect to -Wasserstein distance and that with respect to the -norm, which is equivalent to the Poincaré inequality.
then satisfies the Poincaré inequality, i.e.
Preliminaries
Similar to the main result in , the proof of Theorem 1.3 is based on the reflection coupling of Brownian motion, which was introduced by Lindvall and Rogers and developed by Chen and Li . First, we give a brief introduction of the coupling by reflection. Together with (1.1), we also consider
where is a one dimensional Brownian motion expressed by .
Next, by Itô’s formula and (2.2), for ,
Denote by . Then
2 Auxiliary Function
For any , let be the following strictly increasing function
The definition of seems a little bit strange at first sight, indeed, it is motivated by Chen–Wang’s variational formula for principal eigenvalue of one-dimensional diffusion operator
on . One key to this famous formula is the following elegant mimic eigenfunction associated with the first eigenvalue see [7, pp. 52–53] for a heuristic argument:
Now, let in the definition of the function above. Then we have
for some proper choice of the constant in the definition of .
On the one hand, it is easy to see that for any ,
We first show that and are comparable.
There exist two constants and such that for all ,
Note that as and as , where means the two quantities are of the same order. By L’Hôpital’s law,
Therefore, the required assertion follows from the two limits above. ∎
Roughly speaking, the above two inequalities imply that the auxiliary function behaves like for small , and grows exponentially fast as for large . Hence, the function can be used to control the function with . More explicitly, we have
There is a constant such that for all ,
Consequently, for any , there is a constant such that for all ,
Furthermore, we can give an explicit estimate to the constant in Lemma 2.2, which will be used in the exponential convergence rate.
The constant in Lemma 2.2 has the following expression:
In order to estimate , we need the following inequality on the tail of standard Gaussian distribution (e.g. see [11, (3)]):
where and are respectively the distribution and density function of the standard Gaussian distribution . Consequently, for ,
Substituting this estimate into the expression of leads to
Next, we consider two cases. (i) If , then
Substituting this estimate into (2.10) yields
Summarizing the above two cases and using (2.9), we obtain
it is easy to show that for all , it holds
On the other hand, for , we have
Combining the above two inequalities, we deduce that for all ,
Having the two inequalities (2.12) and (2.13) in hand, and using (2.11), we complete the proof. ∎
We also need the following simple result.
If , then, using the fact that
The required assertion follows immediately from these two conclusions. ∎
Finally, we present a consequence of all the previous results in this part.
Let be the function defined by (2.4), and be the constant in Theorem 1.3. Then, for all ,
By (2.5), we deduce from Lemmas 2.4 and 2.5 that
where the last inequality follows from (2.6). This, along with the definition of the constant in Theorem 1.3, yields the required assertion.∎
Proofs of Theorems and Proposition
Let be the function defined by (2.4). According to (2.3) and Itô’s formula, it holds that
Combining this inequality with (2.15), we obtain
where is the constant in Theorem 1.3.
Then, for , the inequality (3.1) yields
Taking expectation in the both hand sides of the inequality above leads to
Since the coupling process is non-explosive, we have a.s. as , where is the coupling time. Thus by Fatou’s lemma, letting in the above inequality gives us
Thanks to our convention that for , we have for all . Therefore,
If , then for any and any , we deduce from (2.8), (3.3) and (2.7) that
for some constant . Therefore, if , then for any there is a constant such that
Set for . Then and ; moreover, (3.6) implies for all . Therefore, for all , by (3.5),
where in the last inequality we have used . The proof of Theorem 1.3 is completed. ∎
and is the coupling time. For , we still set . Therefore, the difference process satisfies
Still denoting by , we get
where in the first inequality we have used (3.4), and the last inequality follows from (3.8). In particular, we have for all and ,
Combining with all conclusions above, we complete the proof of Theorem 1.4. ∎
Next, since , we have
which implies . Therefore
for all , and so (1.5) holds with the new constant . ∎
Proofs of Examples and Corollaries
Now the result follows immediately from the fact that is strictly increasing when is large enough. Indeed, we have
which is positive if .
On the other hand, it is easy to see that with the choices of and above, the semigroup is symmetric with respect to the probability measure . Then, according to (the proof of) Corollary 1.10 below, we know that fulfills the Poincaré inequality (1.15) if (1.7) is satisfied with ; however, this is impossible, see e.g. [27, Example 4.3.1 (3)].
which implies is strictly concave. Hence for all . On the other hand, to show that for all , as in the proof of (1), we simply look at the one dimensional case:
Then, the assertion (1.11) immediately follows from (the proof of) Theorem 1.1, by simply using the synchronous coupling, see e.g. [2, p.2432].
Finally we prove the algebraic convergence rate (1.12). For this, we mainly follow from [9, Section 5] or [5, Section 7.2] (see also [21, Theorem 1.1]). By (2.3),
where is the coupling time of the coupling process , and is the same one-dimensional Brownian motion as in (2.2). Hence,
Denote by which has the same law as that of . Thus for any ,
In particular, by the definition of total variation distance,
Finally we present the proofs of the two corollaries of Theorem 1.3.
The inequality above also yields the uniqueness of the invariant measure.
This implies that (e.g. see [6, Theorem 5.10] or [24, Theorem 5.10])
holds for any and any Lipschitz continuous function , where denotes the Lipschitz semi-norm with respect to the Euclidean norm .
Replacing with in the equality above, we arrive at
Next, we follow the proof of [23, Lemma 2.2] to show that the inequality above yields the desired Poincaré inequality. Indeed, for every with and . By the spectral representation theorem, we have
where in the inequality above we have used Jensen’s inequality. Thus,
which is equivalent to the desired Poincaré inequality, see e.g. [27, Theorem 1.1.1]. ∎
Acknowledgements. The authors are grateful to the referee for his valuable comments which helped to improve the quality of the paper.