Exponential Contraction in Wasserstein Distances for Diffusion Semigroups with Negative Curvature

Feng-Yu Wang

Introduction

Let MM be a dd-dimensional connected complete Riemannian manifold possibly with a convex boundary M\partial M. Let ρ\rho be the Riemannian distance. Consider L=Δ+ZL=\Delta+Z for the Laplace-Beltrami operator Δ\Delta and some C1C^{1}-vector field ZZ such that the (reflecting) diffusion process generated by LL is non-explosive. Then the associated Markov semigroup PtP_{t} is the (Neumann if M\partial M\neq\emptyset) semigroup generated by LL on MM. In particular, it is the case when the curvature of LL is bounded below; that is,

is equivalent to the curvature condition (1.1). Here, P(M)\mathscr{P}(M) is the class of all probability measures on MM; WpW_{p} is the LpL^{p}-Warsserstein distance induced by ρ\rho, i.e.,

where C(μ1,μ2)\mathscr{C}(\mu_{1},\mu_{2}) is the class of all couplings of μ1\mu_{1} and μ2\mu_{2}; and for a Markov operator PP on Bb(M)\mathscr{B}_{b}(M) (i.e. PP is a positivity-preserving linear operator with P1=1P1=1),

where ν(f):=Mfdν\nu(f):=\int_{M}f\text{\rm{d}}\nu for fL1(ν)f\in L^{1}(\nu). In some references, νP\nu P is also denoted by PνP^{*}\nu. In the sequel we will use PtP_{t}^{*} to stand for the adjoint operator of PtP_{t} in L2(μ)L^{2}(\mu) for the invariant probability measure μ\mu, hence adopt the notation νP\nu P rather than PνP^{*}\nu to avoid confusion. When the curvature is positive (i.e. K>0K>0), (1.2) implies the WpW_{p}-exponential contraction of PtP_{t} for p1.p\geq 1.

In this paper, we aim to consider the case when (1.1) only holds for some negative constant K,K, and to prove the exponential contraction

for some constants c,λ>0c,\lambda>0. It is crucial that the exponential rate λ\lambda is independent of pp. Due to the equivalence of (1.1) and (1.2), in the negative curvature case it is essential that c>1c>1.

According to , even when RicZ\text{\rm{Ric}}_{Z} is unbounded below, i.e. RicZ\text{\rm{Ric}}_{Z} goes to -\infty when ρo:=ρ(o,)\rho_{o}:=\rho(o,\cdot)\rightarrow\infty for a fixed oMo\in M, there may exist the log-Sobolev inequality which implies the exponentially convergence of PtP_{t} in entropy. This suggests that (1.3) may also hold for a class of diffusion semigroups with negative curvature.

for some constants c,λ>0c,\lambda>0, where δx\delta_{x} is the Dirac measure at point xx. Indeed, proved the W1W_{1}-exponential contraction with respect to a modified distance f(xy)f(|x-y|) in place of xy|x-y| as constructed in for estimates of the spectral gap using the coupling by reflection. Under condition (1.4) the modified distance is comparable with the usual one so that (1.5) follows. As mentioned in that there is essential difficulty to prove (1.3) for p>1p>1 even for this flat case.

In Luo and Wang the estimate (1.5) was extended as

for some constants c,λ>0c,\lambda>0. Comparing with (1.3) which is equivalent to

according to (see Proposition 3.1 below), (1.6) is less sharp for small xy|x-y| and/or large pp. It is open whether (1.4), or in the Riemannian setting that RicZ\text{\rm{Ric}}_{Z} is uniformly positive outside a compact domain, implies (1.3) for some constants c,λ>0c,\lambda>0.

As in , we will consider the Warsserstein distances induced by Young functions in the class

For any ΦN\Phi\in\mathscr{N} and a measure ν\nu on MM, consider the gauge norm in LΦ(ν):L^{\Phi}(\nu):

In particular, we have fLΦp(ν)=fLp(ν)\|f\|_{L^{\Phi_{p}}(\nu)}=\|f\|_{L^{p}(\nu)} for Φp(r):=rp, p(1,)\Phi_{p}(r):=r^{p},\ p\in(1,\infty). This is the reason why we do not take Φp(r)=1prp\Phi_{p}(r)=\frac{1}{p}r^{p} in the characterization of Legendre conjugates. We extend the notion Φp\Phi_{p} to p=1,p=1,\infty by letting Φ1(r)=r,Φ=limpΦp\Phi_{1}(r)=r,\Phi_{\infty}=\lim_{p\rightarrow\infty}\Phi_{p} and fLΦp(ν)=fLp(ν)\|f\|_{L^{\Phi_{p}}(\nu)}=\|f\|_{L^{p}(\nu)} for all p[1,].p\in[1,\infty]. Now, let

In particular, WΦp=WpW_{\Phi_{p}}=W_{p} for p[1,].p\in[1,\infty]. We aim to prove the exponential decay

when (1.1) only holds for a negative constant KK, where Φ1\Phi^{-1} is the inverse of Φ(Φ)\Phi(\neq\Phi_{\infty}) and we set Φ1(1)=1\Phi^{-1}_{\infty}(1)=1 by convention.

To extend condition (1.4) to the Riemannian setting, consider the index

where ρ\rho is the Riemannian distance, R\mathscr{R} is the curvature tensor; γ:[0,ρ(x,y)]M\gamma:[0,\rho(x,y)]\rightarrow M is the minimal geodesic from xx to yy with unit speed; {Ji}i=1d1\{J_{i}\}_{i=1}^{d-1} are Jacobi fields along γ\gamma such that

holds for the parallel transform Px,y:TxMTyMP_{x,y}:T_{x}M\rightarrow T_{y}M along the geodesic γ\gamma, and {γ˙(s),Ji(s):1id1}\{\dot{\gamma}(s),J_{i}(s):1\leq i\leq d-1\} (s=0,ρ(x,y)s=0,\rho(x,y)) is an orthonormal basis of the tangent space (at points xx and yy, respectively).

Note that when (x,y)Cut(M)(x,y)\in{\rm Cut}(M), i.e. xx is in the cut-locus of yy, the minimal geodesic may be not unique. As a convention in the literature, all conditions on the index II are given outside Cut(M){\rm Cut}(M). We now extend condition (1.4) to the non-flat case as follows: for some constants K1,K2>0K_{1},K_{2}>0,

In the flat case we have I(x,y)=0I(x,y)=0 and ρ(x,y)=xy\rho(x,y)=|x-y|, so that this condition reduces back to (1.4). Moreover, the curvature condition (1.1) is equivalent to

so that (1.8) implies RicZ(K1+K2).\text{\rm{Ric}}_{Z}\geq-(K_{1}+K_{2}).

In the next section, we state our main results and present examples. With condition (1.8) we first extend the main results of to the present Riemannian setting and give the exponential convergence of PtP_{t} in W2W_{2}. Under the ultracontractivity and condition (1.1) for some K<0K<0, our the second result ensures the desired inequality (1.7). Finally, we extend these results to SDEs with multiplicative noise by using explicit conditions on the coefficients. To prove these results, we make some preparations in Section 3. Complete proofs of the main results are addressed in Sections 4-6 respectively.

Main Results and examples

We first consider the Riemannian setting, then extend to SDEs with multiplicative noise by using explicit conditions on the coefficients instead of the less explicit curvature condition.

We start with condition (1.8). Besides the extension of (1.6), this condition also implies the hypercontractivity and the exponential convergence in W2W_{2} for the semigroup PtP_{t}. For a measure μ\mu and constants p,q1p,q\geq 1, let Lp(μ)Lq(μ)\|\cdot\|_{L^{p}(\mu)\rightarrow L^{q}(\mu)} stand for the operator norm form Lp(μ)L^{p}(\mu) to Lq(μ)L^{q}(\mu). Recall that PtP_{t} is called hypercontractive if it has a unique invariant probability measure μ\mu and PtL2(μ)L4(μ)=1\|P_{t}\|_{L^{2}(\mu)\rightarrow L^{4}(\mu)}=1 holds for large t>0t>0. By interpolation theorem, PtL2(μ)L4(μ)=1\|P_{t}\|_{L^{2}(\mu)\rightarrow L^{4}(\mu)}=1 can be replaced by PtLp(μ)Lq(μ)=1\|P_{t}\|_{L^{p}(\mu)\rightarrow L^{q}(\mu)}=1 for some >q>p>1.\infty>q>p>1.

Let \eqrefEB\eqref{EB'} hold for some constants K1,K2K_{1},K_{2} and r0>0r_{0}>0. Then:

There exist two constants c,λ>0c,\lambda>0 such that for any ΦNˉ\Phi\in\bar{\mathscr{N}} and x,yMx,y\in M,

PtP_{t} has a unique invariant probability measure μ\mu and the log-Sobolev inequality

holds for some constant C>0C>0. Consequently, PtP_{t} is hypercontractive.

There exist constants c,λ>0c,\lambda>0 such that

To illustrate this result, we present below a consequence with explicit curvature conditions in the spirit of . These conditions allow RicZ\text{\rm{Ric}}_{Z} to be negative everywhere, for instance, when C1RicC2-C_{1}\leq\text{\rm{Ric}}\leq-C_{2} and C2>ZδC_{2}>-\nabla Z\geq\delta for some constants C1>C2>δ>0C_{1}>C_{2}>\delta>0. As indicated in Introduction that (1.8) implies RicZ(K1+K2),\text{\rm{Ric}}_{Z}\geq-(K_{1}+K_{2}), so in the following corollary we assume that RicZ\text{\rm{Ric}}_{Z} is bounded below.

Assume that RicZ\text{\rm{Ric}}_{Z} is bounded below. Let ρo=ρ(o,)\rho_{o}=\rho(o,\cdot) for a fixed point oMo\in M. If there exist constants σ>0\sigma>0 and δ>σ(1+2)d1\delta>\sigma(1+\sqrt{2})\sqrt{d-1} such that

Next, we introduce sufficient conditions for (1.7) which allow RicZ\text{\rm{Ric}}_{Z} to be negative. Due to technical reason, we will need the ultracontractivity of PtP_{t}, which is essentially stronger than the hypercontractivity. We call PtP_{t} ultracontractive if PtL1(μ)L(μ)<\|P_{t}\|_{L^{1}(\mu)\rightarrow L^{\infty}(\mu)}<\infty for all t>0.t>0. The ultracontractivity implies that PtP_{t} has a density pt(x,y)p_{t}(x,y) with respect to μ\mu (called heat kernel) and

In references (see e.g. ), the ultracontractivity is also defined by PtL2(μ)L(μ)<\|P_{t}\|_{L^{2}(\mu)\rightarrow L^{\infty}(\mu)}<\infty for t>0t>0. When PtP_{t} is symmetric in L2(μ)L^{2}(\mu) we have

so that these two definitions are equivalent. However, when PtP_{t} is non-symmetric, the former might be stronger than the latter. The appearance of the ultracontractivity in our study is very nature: by Theorem 2.3(1) we already have (1.7) for Φ=Φ1\Phi=\Phi_{1} (the weakest case), and by the ultracontractivity we are able to deduce the inequality from Φ1\Phi_{1} to Φ\Phi_{\infty} (the strongest case). On the other hand, the result also indicates that (1.7) implies the hypercontractivity of PtP_{t}.

Assume that RicZ\text{\rm{Ric}}_{Z} is bounded below.

If PtP_{t} is ultracontractive, then there exist constants c,λ>0c,\lambda>0 such that for any ΦNˉ\Phi\in\bar{\mathscr{N}},

Consequently, for any p[1,],t0p\in[1,\infty],t\geq 0 and μ1,μ2P(M)\mu_{1},\mu_{2}\in\mathscr{P}(M),

On the other hand, if there exist constants c,λ>0c,\lambda>0 such that

then the log-Sobolev inequality \eqrefLS\eqref{LS} holds for c=2c2λc=\frac{2c^{2}}{\lambda}, so that PtP_{t} is hypercontractive.

We note that in Theorem 2.3(1) we have ρLp(μ×μ)<\|\rho\|_{L^{p}(\mu\times\mu)}<\infty for p[1,)p\in[1,\infty). Indeed, since RicZ\text{\rm{Ric}}_{Z} is bounded below, by [23, Theorem 2.1] the ultracontractivity implies the super log-Sobolev inequality (3.3) below, so that due to Herbst we have (μ×μ)(erρ2)<(\mu\times\mu)(\text{\rm{e}}^{r\rho^{2}})<\infty for all r>0r>0 (see e.g. ). Therefore, GΦ(t)<G_{\Phi}(t)<\infty for t>0t>0 and ΦN\Phi\in\mathscr{N} satisfying

In the symmetric case (i.e. Z=VZ=\nabla V for some VC2(M)V\in C^{2}(M)), explicit sufficient conditions for the ultracontractivity have been introduced in by using the dimension-free Harnack inequality in the sense of . Together with a suitable exponential estimate on the diffusion process, this inequality implies PtL2(μ)L(μ)<\|P_{t}\|_{L^{2}(\mu)\rightarrow L^{\infty}(\mu)}<\infty for t>0t>0 and thus, PtP_{t} is ultracontractive due to (2.5). The conditions can be formulated as

where Ψ1,Ψ2:(0,)(0,)\Psi_{1},\Psi_{2}:(0,\infty)\rightarrow(0,\infty) are increasing functions such that

and for some constants θ(0,1/(1+2))\theta\in(0,1/(1+\sqrt{2})) and C>0,C>0,

When Ric is bounded below, (2.11) as well as the second inequality in (2.9) hold for Ψ2\Psi_{2} being a large enough constant. In general, since 0rΨ1(s)ds20r/2Ψ1(s)ds\int_{0}^{r}\Psi_{1}(s)\text{\rm{d}}s\geq 2\int_{0}^{r/2}\Psi_{1}(s)\text{\rm{d}}s, (2.11) with θ=14<11+2\theta=\frac{1}{4}<\frac{1}{1+\sqrt{2}} follows from

Since (2.5) fails for non-symmetric semigroups, we apply the inequality

due to the semigroup property. So, to ensure the ultracontractivity, we need an additional condition implying PtL1(μ)L2(μ)<\|P_{t}\|_{L^{1}(\mu)\rightarrow L^{2}(\mu)}<\infty (see Corollary 2.4(2) below).

To estimate GΦ(t)G_{\Phi}(t) in (2.6) using Ψ1\Psi_{1}, we introduce

Obviously, the inverse function Λ21\Lambda_{2}^{-1} exists on (0,)(0,\infty), and since Λ1\Lambda_{1} is increasing with Λ1()=\Lambda_{1}(\infty)=\infty, we have

Assume that \eqref4.3\eqref{4.3} and \eqref4.4\eqref{4.4} hold for some constants θ(0,1/(1+2))\theta\in(0,1/(1+\sqrt{2})) and C>0.C>0.

If PtP_{t} is symmetric, i.e. Z=VZ=\nabla V for some VC2(M)V\in C^{2}(M), then there exist constants c,λ>0c,\lambda>0 such that \eqrefLL0\eqref{LL0} and \eqrefLL1\eqref{LL1} hold for

If PtP_{t} is non-symmetric but there exists continuous hC(;[0,))h\in C(;[0,\infty)) with h(r)>0h(r)>0 for r>0r>0 such that 01h(r)rdr<\int_{0}^{1}\frac{h(r)}{r}\text{\rm{d}}r<\infty and

then there exist constants c,λ>0c,\lambda>0 such that \eqrefLL0\eqref{LL0} holds for

To conclude this part, we present a simple example to illustrate Corollary 2.4.

Let MM have non-positive sectional curvatures and a pole oMo\in M. Let Z=Z0δρo2+εZ=Z_{0}-\delta\nabla\rho_{o}^{2+\varepsilon} outside a compact domain, where δ,ε>0\delta,\varepsilon>0 are constants and Z0Z_{0} is a C1C^{1} vector field with

Let Ψ2:(0,)(0,)\Psi_{2}:(0,\infty)\rightarrow(0,\infty) be increasing such that

By (2.13), (2.14) and the Hessian comparison theorem, we see that (2.9), (2.10) and (2.12) hold with Ψ1(r)=c1rε\Psi_{1}(r)=c_{1}r^{\varepsilon} for some constant c1>0c_{1}>0. According to Corollary 2.4, there exist constants c,λ>0c,\lambda>0 such that for any p1p\geq 1,

2 SDEs with multiplicative noise

We intend to investigate the WpW_{p}-exponential contraction for p[1,)p\in[1,\infty). As mentioned in Introduction that existing results only apply to p=1p=1 and σ=I\sigma=I, and as mentioned in that there is essential difficulty to prove (1.3) for p>1p>1 even for σ=I\sigma=I. So, the present study is non-trivial.

Corresponding to that (1.1) implies (1.2) in the Riemannian setting, we have the following assertion.

Note that this result does apply to p=p=\infty when σ\sigma is non-constant. Next, as in the Riemannian case, we intend to prove the exponential contraction in WpW_{p} when (2.16) only holds for some negative constant KpK_{p}. To this end, we need the SDE to be non-degenerate. The following result contains analogous assertions in Theorems 2.1 and 2.3, where the first assertion extends (1.5) to the multiplicative noise setting.

Assume that σσλ02I\sigma\sigma^{*}\geq\lambda_{0}^{2}I for some constant λ0>0\lambda_{0}>0.

If there exist constants K1,K2,r0>0K_{1},K_{2},r_{0}>0 such that ZZ and σ0:=σσλ02I\sigma_{0}:=\sqrt{\sigma\sigma^{*}-\lambda_{0}^{2}I} satisfy

then there exist constants c,λ>0c,\lambda>0 such that

Let PtP_{t} have a unique invariant probability measure μ\mu such that the log-Sobolev inequality

holds for some constant C>0C>0. If there exists a constant K>0K>0 such that

Combining this with HS2d2\|\cdot\|_{HS}^{2}\leq d\|\cdot\|^{2}, we see that (2.17) follows from the following more explicit condition:

Note that conditions in Theorem 2.5 and Theorem 2.6(1) are explicit. To illustrate Theorem 2.6(2)-(3), we present below sufficient conditions for the log-Sobolev inequality (2.18) and the ultracontractivity of PtP_{t}. For a:=σσa:=\sigma\sigma^{*} and (gij)1i,jd:=a1(g_{ij})_{1\leq i,j\leq d}:=a^{-1}, we introduce the Christoffel symbols

for some constant K0K_{0}. If there exist constants c1,c2>0c_{1},c_{2}>0 and δ>1\delta>1 such that

then PtP_{t} has a unique invariant probability measure μ\mu and there exists a constant c>0c>0 such that

We now introduce a simple example to illustrate Theorem 2.6.

then (2.22) holds for some constant K0K_{0}. Moreover, it is easy to see that

holds for some constants c1,c2>0c_{1},c_{2}>0. By Proposition 2.7 and Theorem 2.6(3), for any p[1,)p\in[1,\infty), there exist constants λ,c>0\lambda,c>0 such that

Preparations

This section includes some propositions which will be used to prove the results introduced in Section 2. We first recall a link between the Wasserstein distance and gradient estimates due to , then deduce the hyperboundedness and the exponential convergence in entropy from the log-Sobolev inequality for non-symmetric diffusion semigroups, and finally prove the exponential contraction in gradient for ultracontractive semigroups in a general framework including both diffusion and jump Markov semigroups.

Let (E,ρ)(E,\rho) be a geodesic Polish space, i.e. it is a Polish space and for any two different points x,yEx,y\in E, there exists a continuous curve γ:E\gamma:\rightarrow E such that γ0=x,γ1=y\gamma_{0}=x,\gamma_{1}=y and ρ(γs,γt)=stρ(x,y)\rho(\gamma_{s},\gamma_{t})=|s-t|\rho(x,y) for s,t.s,t\in. Then for any fLipb(E)f\in{\rm Lip}_{b}(E), the class of bounded Lipschitz functions on EE, the length of gradient

is measurable. Moreover, let P(x,dy)P(x,\text{\rm{d}}y) be a Markov transition kernel and define the Markov operator

For any ΦNˉ{Φ}\Phi\in\bar{\mathscr{N}}\setminus\{\Phi_{\infty}\}, consider the Young norm induced by Φ\Phi with respect to PP

and set fLΦ(P)(x)=Pf(x).\|f\|_{L_{*}^{\Phi_{\infty}}(P)}(x)=P|f|(x). Then LΦp=LΦq\|\cdot\|_{L_{*}^{\Phi_{p}}}=\|\cdot\|_{L^{\Phi_{q}}} for p[1,],q=pp1.p\in[1,\infty],q=\frac{p}{p-1}. The following result follows from [16, Theorem 2.2, Remark 2 and Remark 3].

For any constant C>0C>0 and ΦNˉ\Phi\in\bar{\mathscr{N}}, the following statements are equivalent to each other:

PfCfLΦ(P)|\nabla Pf|\leq C\|\nabla f\|_{L_{*}^{\Phi}(P)} for fLipb(E).f\in{\rm Lip}_{b}(E).

WΦ(δxP,δyP)Cρ(x,y),  x,yE.W_{\Phi}(\delta_{x}P,\delta_{y}P)\leq C\rho(x,y),\ \ x,y\in E.

When Φ=Φp\Phi=\Phi_{p} for p[1,]p\in[1,\infty], they are also equivalent to

Wp(μ1P,μ2P)CWp(μ1,μ2),  μ1,μ2P(E).W_{p}(\mu_{1}P,\mu_{2}P)\leq CW_{p}(\mu_{1},\mu_{2}),\ \ \mu_{1},\mu_{2}\in\mathscr{P}(E).

2 Hyperboundedness and exponential convergence in entropy

When PtP_{t} is symmetric, it is well known that the hyperbounddeness, exponential convergence in entropy and the log-Sobolev inequality are equivalent each other, see and references within. In the non-symmetric case, the log-Sobolev inequality implies the former two properties if the generator LL and the symmetric part of the Dirichlet form E\mathscr{E} satisfy

for some constant c0>0c_{0}>0 and a reasonable class D\mathscr{D} of non-negative bounded functions, which is stable under PtP_{t} and dense in L+p(μ):={fLp(μ):f0}L^{p}_{+}(\mu):=\{f\in L^{p}(\mu):f\geq 0\} for any p1p\geq 1, see e.g. . In applications, it may be not easy to figure out the class D\mathscr{D} such that (3.2) holds. But in general this condition can be replaced by the following approximation formula Lemma 3.2 in the spirit of .

Now, consider the (Neumann) semigroup PtP_{t} generated by L:=Δ+ZL:=\Delta+Z for a local bounded vector field ZZ such that PtP_{t} has a unique invariant probability measure μ\mu. Let

Then (L,D0)(L,\mathscr{D}_{0}) is dissipative (thus, closable) in L1(μ)L^{1}(\mu) with closure (L,D1(L))(L,\mathscr{D}_{1}(L)) generating PtP_{t} in L1(μ)L^{1}(\mu), see e.g. and references within. Let

Let fDf\in\mathscr{D} and ψCb([essμinff,))\psi\in C_{b}^{\infty}([{\rm ess}_{\mu}\inf f,\infty)). There exists a sequence {fn}n1D0\{f_{n}\}_{n\geq 1}\subset\mathscr{D}_{0} with inffn=inff\inf f_{n}=\inf f such that fnff_{n}\rightarrow f in Lm(μ)L^{m}(\mu) for any m1m\geq 1, LfnLfLf_{n}\rightarrow Lf in L1(μ)L^{1}(\mu), and

Since fDD1(L)L(μ)f\in\mathscr{D}\subset\mathscr{D}_{1}(L)\cap L^{\infty}(\mu), there exists a uniformly bounded sequence {fn}n1D0\{f_{n}\}_{n\geq 1}\subset\mathscr{D}_{0} such that inffn=essμinff\inf f_{n}={\rm ess}_{\mu}\inf f and fnf,LfnLff_{n}\rightarrow f,Lf_{n}\rightarrow Lf in L1(μ)L^{1}(\mu). By the uniform boundedness, fnff_{n}\rightarrow f in Lm(μ)L^{m}(\mu) for any m1m\geq 1. Since ψCb([inffn,))\psi\in C_{b}^{\infty}([\inf f_{n},\infty)),

This implies μ(Lgn)=0\mu(Lg_{n})=0 since μ\mu is PtP_{t}-invariant. So, by the dominated convergence theorem,

Let ZZ be a locally bounded vector field such that the (Neumann) semigroup PtP_{t} generated by L:=Δ+ZL:=\Delta+Z has a unique invariant probability measure μ\mu.

holds for some βC((0,);(0,))\beta\in C((0,\infty);(0,\infty)), then for any constants q>p1q>p\geq 1 and γC((p,q);(0,))\gamma\in C((p,q);(0,\infty)) such that t:=pqγ(r)rdr<,t:=\int_{p}^{q}\frac{\gamma(r)}{r}\text{\rm{d}}r<\infty, there holds

(1) According to Lemma 3.2, for any fDf\in\mathscr{D} and p>1p>1, there exists {fn}n1D0\{f_{n}\}_{n\geq 1}\subset\mathscr{D}_{0} such that fnfp2f_{n}\rightarrow f^{\frac{p}{2}} in Lm(μ)L^{m}(\mu) for all m1m\geq 1, and

Applying (3.3) to fnf_{n} and using (3.5), we obtain

for γ(p):=β(4c(p)(1p1))pc(p).\gamma(p):=\frac{\beta(4c(p)(1-p^{-1}))}{pc(p)}. Noting that D\mathscr{D} is PtP_{t}-invariant (i.e. PtDDP_{t}\mathscr{D}\subset\mathscr{D}) and dense in L+p(μ)L_{+}^{p}(\mu) for any p1p\geq 1, the desired assertion follows from the proof of [13, Corollary 3.13].

(2) It suffices to prove for gDg\in\mathscr{D} with infg>0.\inf g>0. Applying Lemma 3.2 to f=Ptgf=P_{t}g and ψ(s)=1+logs\psi(s)=1+\log s, and using (3.4), we obtain

This implies the desired exponential estimate. ∎

3 Exponential contraction in gradient

In this part, we consider a general framework including both diffusion and jump processes. Let (E,F,μ)(E,\mathscr{F},\mu) be a separable complete probability space, and let PtP_{t} be a Markov semigroup on L2(μ)L^{2}(\mu) with μ\mu as invariant probability measure. Let (L,D(L))(L,\mathscr{D}(L)) be the generator of PtP_{t} in L2(μ)L^{2}(\mu). We assume that there exists an algebra AD(L)\mathscr{A}\subset\mathscr{D}(L) such that

1A1\in\mathscr{A}, A\mathscr{A} is dense in L2(μ)L^{2}(\mu) and the algebra induced by

Γ(f,g):=12(L(fg)fLggLf)\Gamma(f,g):=\frac{1}{2}(L(fg)-fLg-gLf) gives rise to a non-degenerate positive definite bilinear form on D×D\mathscr{D}\times\mathscr{D}; i.e., for any fDf\in\mathscr{D}, Γ(f,f)0\Gamma(f,f)\geq 0 and it equals to if and only if ff is constant.

In particular, when PtP_{t} is the (Neumann) semigroup generated by L:=Δ+ZL:=\Delta+Z on MM with RicZ\text{\rm{Ric}}_{Z} bounded below, the assumption holds for

is closable and the closure (E,D(E))(\mathscr{E},\mathscr{D}(\mathscr{E})) is a conservative symmetric Dirichlet form. Although PtP_{t} is not associated to (E,D(E))(\mathscr{E},\mathscr{D}(\mathscr{E})) when it is non-symmetric, we have

If PtL1(μ)L(μ)<,\|P_{t}\|_{L^{1}(\mu)\rightarrow L^{\infty}(\mu)}<\infty, then PtP_{t} has a heat kernel pt(x,y)p_{t}(x,y) with respect to μ\mu, i.e.

We consider the ``gradient” length Γf=Γ(f,f)|\nabla_{\Gamma}f|=\sqrt{\Gamma(f,f)} induced by Γ\Gamma. Note that for jump processes the length is non-local and thus essentially different from the usual gradient length. As shown below that estimates of ΓPt|\nabla_{\Gamma}P_{t}| have a close link to functional inequalities of the associated Dirichlet form.

Assume that there exist t1>0t_{1}>0 and ηC([0,);(0,))\eta\in C([0,\infty);(0,\infty)) such that

Then there exist constants c,λ,t2>0c,\lambda,t_{2}>0 such that for any q1q\geq 1 and ηqC([0,);(0,))\eta_{q}\in C([0,\infty);(0,\infty)), the gradient estimate

for some constants C,λ>0C,\lambda>0. By the second inequality in (3.7), for any t>0t>0 and fDf\in\mathscr{D} we have

Integrating both sides over [0,t][0,t] leads to

Taking t=t1t=t_{1} and noting that μ\mu is the invariant probability measure of PtP_{t}, we obtain

Since D(E)\mathscr{D}(\mathscr{E}) is the closure of D\mathscr{D} under the E1\mathscr{E}_{1}-norm, this inequality also holds for fD(E).f\in\mathscr{D}(\mathscr{E}). By condition (ii), the symmetric Dirichlet form is irreducible. So, according to [38, Corollary 1.2] the defective Poincaré inequality (3.11) implies the Poincaré inequality

for some constant λ>0\lambda>0. By (3.6) and that D\mathscr{D} is dense in L2(μ)L^{2}(\mu), the Poincaré inequality is equivalent to

On the other hand, by the second inequality in (3.7), for any t>0t>0 and fDf\in\mathscr{D} we have

Using Ptfμ(f)P_{t}f-\mu(f) to replace ff and integrating with respect to μ\mu, we obtain

Combining this with (3.13) and (3.12) we arrive at

for some constant c1>0c_{1}>0; that is, (3.10) holds for t>1.t>1. Finally, (3.7) implies (3.10) for t.t\in.

(b) Next, we intend to find out a constant t0t1t_{0}\geq t_{1} such that

Indeed, by (3.13) and the first inequality in (3.7), we obtain

where c0:=Pt1L1(μ)L(μ).c_{0}:=\|P_{t_{1}}\|_{L^{1}(\mu)\rightarrow L^{\infty}(\mu)}. This implies the desired assertion for t0>0t_{0}>0 such that c02eλt012c_{0}^{2}\text{\rm{e}}^{-\lambda t_{0}}\leq\frac{1}{2}.

(c) Finally, combining (3.7), (3.14), (3.10) and (3.12), we obtain

for some constants c1,c2,c3>0c_{1},c_{2},c_{3}>0. Then (3.9) holds for t2=2t0.t_{2}=2t_{0}.

Proof of Theorem 2.1

The proofs of the other two assertions are based on the log-Sobolev inequality and the log-Harnack inequality derived in and respectively for bounded below RicZ\text{\rm{Ric}}_{Z}.

(a) For two different points x,yMx,y\in M, let Px,y:TxMTyMP_{x,y}:T_{x}M\rightarrow T_{y}M be the parallel displacement along the minimal geodesic γ:[0,ρ(x,y)]M\gamma:[0,\rho(x,y)]\rightarrow M from xx to yy, and let Mx,y:=Px,y2,γ˙0γ˙ρ(x,y):TxMTyMM_{x,y}:=P_{x,y}-2\langle\cdot,\dot{\gamma}_{0}\rangle\dot{\gamma}_{\rho(x,y)}:T_{x}M\rightarrow T_{y}M be the mirror reflection. Both maps are smooth in (x,y)(x,y) outside the cut-locus Cut(M){\rm Cut}(M). According to and , the appearance of the cut-locus and/or a convex boundary helps for the success of coupling, i.e. it makes the distance between two marginal processes smaller. So, for simplicity, we may and do assume that both the cut-locus and the boundary are empty, see [2, Section 3] or [33, Chapter 2] for details.

where dI\text{\rm{d}}_{I} denotes the Itô differential introduced in on Riemannian manifolds, BtB_{t} is the dd-dimensional Brownian motion, and utu_{t} is the horizontal lift of XtX_{t} to the frame bundle O(M)O(M). Then XtX_{t} is a diffusion process generated by LL. To construct the coupling by reflection for short distance and parallel displacement for long distance, we introduce a cut-off function hC1([0,))h\in C^{1}([0,\infty)) which is decreasing such that h(r)=1h(r)=1 for rr0,r\leq r_{0}, h(r)=0h(r)=0 for rr0+1r\geq r_{0}+1, and 1h2\sqrt{1-h^{2}} is also in C1C^{1}, see e.g. [40, (3.1)] for a concrete example. To construct the coupling in the above spirit, we split the noise into two parts, i.e. to replace dBt\text{\rm{d}}B_{t} by h(ρ(Xt,Yt))dBt+1h(ρ(Xt,Yt))2dBth(\rho(X_{t},Y_{t}))\text{\rm{d}}B_{t}^{\prime}+\sqrt{1-h(\rho(X_{t},Y_{t}))^{2}}\text{\rm{d}}B_{t}^{\prime\prime} for two independent Brownian motions BtB_{t}^{\prime} and BtB_{t}^{\prime\prime}, then make reflection for the BtB_{t}^{\prime} part and parallel displacement for the BtB_{t}^{\prime\prime} part. More precisely, let (Xt,Yt)(X_{t},Y_{t}) solve the following SDE on M×MM\times M for (X0,Y0)=(x,y)(X_{0},Y_{0})=(x,y):

Since the coefficients of the SDE are at least C1C^{1} outside the diagonal {(z,z):zM}\{(z,z):z\in M\}, it has a unique solution up to the coupling time

We then let Xt=YtX_{t}=Y_{t} for tTt\geq T as usual. By the second variational formula and the index lemma (see e.g. the proof of [34, Lemma 2.3] and [29, (2.4)]), the process ρt:=ρ(Xt,Yt)\rho_{t}:=\rho(X_{t},Y_{t}) satisfies

for some one-dimensional Brownian motion btb_{t}. Thus, by condition (1.8),

Since h(ρt)=0h(\rho_{t})=0 for ρtr0+1\rho_{t}\geq r_{0}+1 while dρt<0\text{\rm{d}}\rho_{t}<0 when ρtr0+1,\rho_{t}\geq r_{0}+1, this implies

On the other hand, since h(ρt)=1h(\rho_{t})=1 for ρtr0\rho_{t}\leq r_{0}, as observed in we have

for some constants c,λ>0c,\lambda>0. Indeed, let

which proves (2.1). Therefore, the proof of (1) is finished since the second inequality therein is a simple consequence of (2.1).

(b) According to the proofs of [34, Proposition 3.1 and Theorem 1.1], our conditions imply that PtP_{t} is hyperbounded; that is, Pt24<\|P_{t}\|_{2\rightarrow 4}<\infty holds for some t>0t>0. Since (1.8) implies RicZ(K1+K2)\text{\rm{Ric}}_{Z}\geq-(K_{1}+K_{2}), by the hyperboundedness and [23, Theorem 2.1], we have the defective log-Sobolev inequality

for some constants C1,C2>0C_{1},C_{2}>0. Since the symmetric Dirichlet form E(f,g):=μ(f,g)\mathscr{E}(f,g):=\mu(\langle\nabla f,\nabla g\rangle) with domain H1,2(μ)H^{1,2}(\mu) is irreducible, according to (see also ), the log-Sobolev inequality (3.4) holds for some constant C>0C>0, so that (2) is proved.

(c) According to [25, Theorem 1.10] (see for the case without boundary), the log-Sobolev inequality implies the Talagrand inequality

Next, let PtP_{t}^{*} be the adjoint of PtP_{t} in L2(μ)L^{2}(\mu). By Proposition 3.3 for PtP_{t}^{*} in place of PtP_{t}, the log-Sobolev inequality implies

Moreover, according to [36, Theorem 1.1], the curvature condition RicZ(K1+K2)=:K\text{\rm{Ric}}_{Z}\geq-(K_{1}+K_{2})=:-K is equivalent to the log-Harnack inequality

By [39, Proposition 1.4.4(3)], this implies

Combining (4.4), (4.5) and (4.6), we obtain

for some constant c1>0c_{1}>0. Noting that RicZK\text{\rm{Ric}}_{Z}\geq-K implies PtfeKtPtf|\nabla P_{t}f|\leq\text{\rm{e}}^{Kt}P_{t}|\nabla f| (see e.g. ), by Proposition 3.1 we have

for some constants c,λ>0c,\lambda>0. Therefore, the proof of (3) is finished. ∎

Proof of Theorem 2.3 and Corollary 2.4

(1) Since RicZK\text{\rm{Ric}}_{Z}\geq-K for some constant K0K\geq 0, we have (see e.g. )

Combining this with Proposition 3.4 for q=1q=1 and noting that PtfP_{t}|\nabla f| is continuous, we obtain

for some constants c0,λ,t0>0c_{0},\lambda,t_{0}>0. Obviously, (3.1) implies

According to Proposition 3.1, this is equivalent to

Combining this with (5.1) and the semigroup property, we arrive at

This together with (5.1) implies (2.6) for some constants c,λ>0.c,\lambda>0. Moreover, (2.7) follows from (2.6) according to Proposition 3.1.

Then using the standard semigroup calculation of Bakry-Emery, this implies

Since limtPtg=μ(g)\lim_{t\rightarrow\infty}P_{t}g=\mu(g) for gBb(M)g\in\mathscr{B}_{b}(M) due to the ergodicity, by letting tt\rightarrow\infty we prove the log-Sobolev inequality for (3.4) for C=2c2λ.C=\frac{2c^{2}}{\lambda}.

We first observe that the proof of [34, Theorem 4.2] works also for the non-symmetric case with Z\nabla Z in place of HessV\text{\rm{Hess}}_{V}, so that

Since in the symmetric case we have PtL1(μ)L(μ)Pt/2L2(μ)L(μ)2\|P_{t}\|_{L^{1}(\mu)\rightarrow L^{\infty}(\mu)}\leq\|P_{t/2}\|_{L^{2}(\mu)\rightarrow L^{\infty}(\mu)}^{2}, the first assertion follows immediately from Theorem 2.3.

by Theorem 2.3 and (5.2) it suffices to prove

for some constant c>0.c^{\prime}>0. According to [23, Theorem 2.1], (5.2) implies the super log-Sobolev inequality (3.3) for

for some (possibly different) constant c>0c>0. Then Proposition 3.3 with p=1,q=2p=1,q=2 and γ(r):=trh(r1)(r1)01s1h(s)ds\gamma(r):=\frac{trh(r-1)}{(r-1)\int_{0}^{1}s^{-1}h(s)\text{\rm{d}}s} implies (5.3).

Proofs of Theorems 2.5-2.6 and Proposition 2.7

Let Xt(x)X_{t}(x) solve (2.15) with initial point xx. By Itô’s formula and condition (2.16) we obtain

for some martingale MtM_{t}. This implies

Then the desired assertion follows from Proposition 3.1. ∎

where BtB_{t}^{\prime} and BtB_{t}^{\prime\prime} are independent dd-dimensional Brownian motions. For any xyx\neq y, let XtX_{t} solve this SDE with X0=xX_{0}=x, and let YtY_{t} solve the following coupled SDE with Y0=yY_{0}=y:

That is, under the flat metric we have made coupling by reflection for BtB_{t}^{\prime\prime} and coupling by parallel displacement for BtB_{t}^{\prime}. Obviously, the coupled SDE has a unique solution up to the coupling time

We set Yt=XtY_{t}=X_{t} for tTx,yt\geq T_{x,y} as usual. Then by (2.17) and Itô’s formula, we obtain

By repeating the argument leading to (4.3), it is easy see that (6.3) and (6.4) imply

for some constants c,λ>0c,\lambda>0 independent of x,yx,y. Therefore,

so that the first assertion follows from Proposition 3.1.

(2) According to [37, Theorem 1.1], aαIa\geq\alpha I and (2.19) imply the log-Harnack inequality

for some constants c1,c2>0c_{1},c_{2}>0. Combining this with the log-Sobolev inequality, we prove the second assertion as in (c) in the proof of Theorem 2.1.

(3) According to the proof of Theorem 2.5, the condition (2.16) implies the gradient estimate (6.1). Next, by Proposition 3.4, the ultracontractivity and (6.1) imply

for some c(p)>0c(p)>0 and λ>0\lambda>0 independent of pp. Then the proof if finished by Proposition 3.1. ∎

We will apply results in and . To this end, we introduce the Riemannian metric

and let Δg,g,Hessg\Delta^{g},\nabla^{g},\text{\rm{Hess}}^{g} be the corresponding Laplacian, gradient and Hessian tensor respectively. Then L=Δg+ZL=\Delta^{g}+Z for some C1C^{1} vector field ZZ. We first verify the Bakry-Emery curvature condition (1.1) for some constant KK. Using the Christoffel symbols, the intrinsic Hessian tensor induced by gg is formulated as

Thus, by Bochner-Weitzenböck formula and (2.22), at point xx there holds

for some constant K1K_{1}. Then (1.1) hold for some constant KK.

Next, (2.23) implies that PtP_{t} has a unique invariant probability measure μ\mu such that μ(ec22)<\mu(\text{\rm{e}}^{c_{2}|\cdot|^{2}})<\infty for some c2>K2αc_{2}>\frac{K}{2\alpha}. By our assumption on aa, the Riemannian distance ρ\rho induced by the metric gg is equivalent to the Euclidian metric:

Then we may repeat the proof of [23, Corollary 2.5] with γ(r)=c2rδ\gamma(r)=c_{2}r^{\delta} and ρ=\rho=|\cdot| to prove

for some constant c3>0.c_{3}>0. Combining this with the curvature condition (1.1), we obtain from [23, Theorem 2.1] for p=2p=2 and q=q=\infty that

holds for some constant c4>0c_{4}>0. Applying Proposition 3.3 below for p=1,q=2p=1,q=2 and γ(r)=c5t(r1)δ12δ1\gamma(r)=c_{5}t(r-1)^{\frac{\delta-1}{2\delta}-1} for constant c5>0c_{5}>0 such that t=12γ(r)rdrt=\int_{1}^{2}\frac{\gamma(r)}{r}\text{\rm{d}}r, we obtain

for some constant c6>0c_{6}>0. Combining this with (6.6) we arrive at

The author would like to thank Jian Wang for helpful comments.

References