Error analysis of the transport properties of Metropolized schemes

Max Fathi, A. -A. Homman, G. Stoltz

Description of the model

Standard results (see for instance the references in [13, Section 2.2]) show that (1) admits the Boltzmann-Gibbs measure

as its unique invariant probability measure (note that ZZ is finite since the position space M\mathcal{M} is compact and VV is smooth hence bounded). In fact, (1) is ergodic with respect to this measure, where ergodicity is understood both as (i) the long-time (almost-sure) convergence of averages along trajectories

for any initial condition q0Mq_{0}\in\mathcal{M} and all observables fL1(μ)f\in L^{1}(\mu); or as (ii) the convergence of the law ψ(t,q)dq\psi(t,q)\,dq of the process (1), happening here at an exponential rate, for instance in total variation: Denoting with some abuse of notation the measure ψ(t,q)dq\psi(t,q)\,dq by ψ(t)\psi(t), there exist C,λ>0C,\lambda>0 such that

where the total variation distance between two measures ν1,ν2\nu_{1},\nu_{2} is defined as

the suprema being taken over all measurable sets of M\mathcal{M} for the first one, and over all bounded, measurable functions for the second one.

For further purposes, we introduce the generator of (1), namely the operator

This operator (defined with domain D(L)=H2(μ)D(\mathcal{L})=H^{2}(\mu)) is self-adjoint on the Hilbert space L2(μ)L^{2}(\mu) endowed with the scalar product

The operator L-\mathcal{L} moreover has a positive spectral gap (see for instance [13, Section 2] and references therein). Indeed, a simple computation shows that

The Poincaré inequality φL2(μ)CM,VφL2(μ)\|\varphi\|_{L^{2}(\mu)}\leqslant C_{\mathcal{M},V}\|\nabla\varphi\|_{L^{2}(\mu)}, valid for any function belonging to

which shows that the spectral gap is larger or equal to CM,V1C_{\mathcal{M},V}^{-1}. In particular, the resolvent L1\mathcal{L}^{-1} is a well-defined operator on L~2(μ)\widetilde{L}^{2}(\mu), and the following estimate holds:

Here and in the sequel, for a given Banach space XX, we denote by B(X)\mathcal{B}(X) the Banach space of bounded operators on XX, endowed with the norm

2. Self-diffusion

where the expectation is over all realizations of the continuous dynamics (1), starting from initial conditions distributed according to the Boltzmann-Gibbs measure (2). The following result shows that the diffusion tensor (8) is well defined, and naturally arises in a diffusive time-rescaling of the dynamics (1).

Consider for ε>0\varepsilon>0 the diffusively rescaled process Qtε=εQt/ε2Q_{t}^{\varepsilon}=\varepsilon Q_{t/\varepsilon^{2}}. Then, as ε0\varepsilon\to 0, the process QtεQ_{t}^{\varepsilon} starting from a given initial condition Q0Q_{0} weakly converges on finite time intervals to an effective Brownian motion starting from Q0Q_{0} and with covariance matrix D\mathscr{D} given by (8). Moreover, D\mathscr{D} is a real, positive definite dN×dNdN\times dN matrix, satisfying

in the sense of symmetric matrices, and which can alternatively be expressed as

where the expectation is over all realizations of the continuous dynamics (1), starting from initial conditions distributed according to the Boltzmann-Gibbs measure (2).

The proof of this statement is standard, and follows from arguments presented in [1, Chapter 3] for instance. We nonetheless provide a short proof in Section 5.1 since the proofs of the discrete counterparts of Theorem 1 rely on an appropriate extension of the argument used in the continuous case (see Section 5.5).

A straightforward consequence of Theorem 1 is that the self-diffusion constant D\mathcal{D}, defined as the average mean-square displacement of the individual particles, is well defined and has two equivalent expressions:

The expression (10) is called the Einstein formula. The second expression (11) involves an integrated autocorrelation function. In accordance with the standard physics and chemistry nomenclature, we call (11) the Green-Kubo formula for the self-diffusion in the sequel.

3. Numerical estimation of the self-diffusion

In order to compute approximations of formulas such as (10) or (11), the first task is to numerically integrate realizations of the continuous dynamics (1). The most straightforward way would be to resort to a Euler-Maruyama scheme: given a time-step Δt>0\Delta t>0 and denoting by qnq^{n} an approximation of qnΔtq_{n\Delta t}, this scheme reads

where (Gn)n0(G^{n})_{n\geqslant 0} is a sequence of independent and identically distributed (i.i.d.) dNdN-dimensional standard Gaussian random variables. However, this simple scheme has been shown to fail to be ergodic when the dynamics is considered on unbounded spaces and the potential energy function is not globally Lipschitz . In simulations of Brownian dynamics for ionic solutions, potential energy functions with Coulomb-type singularities are used and it has been observed that the energy blows up along trajectories of (12) (see Section 3.2).

is the probability transition of the Markov chain (12). When the proposition is accepted, we project q~n+1\widetilde{q}^{n+1} into the periodic simulation cell M\mathcal{M}. If the proposal is rejected, the previous configuration is counted twice: qn+1=qnq^{n+1}=q^{n} (It is very important to count rejected configuration as many times as needed to ensure that the Boltzmann-Gibbs measure μ\mu is invariant). In conclusion,

where UnU^{n} are i.i.d. uniform random variables in $,and, and\mathbf{1}_{U^{n}\leqslant\alpha}isanindicatorfunctionwhosevalueis1whenis an indicator function whose value is 1 whenU^{n}\leqslant\alpha$ and 0 otherwise. The average rejection rate is

In order to avoid confusion, we call the scheme (13) “Metropolized Euler-Maruyama” in the sequel, and denote by PΔtP_{\Delta t} its evolution operator:

Of course, the fact that some configurations are rejected destroys the trajectorial accuracy of the dynamics, see for precise statements. The resulting strong errors and, as a consequence, errors on finite time correlation functions have been quantified in , with prefactors which unfortunately depend on time. The estimates provided by these works therefore do not provide error estimates on diffusion coefficients, obtained either as infinite time integrals of correlation functions as in the Green-Kubo formula (11) or as the infinite time average mean square displacement as in Einstein’s formula (10).

The next section quantifies the errors in the approximation of (10) and (11) when the Metropolized Euler-Maruyama scheme is used. Although the formulas (10) and (11) are equivalent for continuous dynamics, they lead to different numerical methods. Let us already emphasize that the errors on the diffusion coefficients are in fact determined by the expansion of the evolution operator PΔtP_{\Delta t}. This expansion is the same as the one used to establish weak error estimates. From a technical viewpoint, the techniques used in the proofs of our main results are therefore quite different from the techniques of , which are based on strong error estimates obtained with Gronwall’s lemma.

A priori error estimates on the self-diffusion

As discussed in , error bounds on transport properties in fact depend on approximation properties of the evolution operator (similar to the one used to prove weak error estimates), rather than strong error estimates. A key building block in this framework is the following expansion of the evolution operator, obtained by a slight extension of [5, Lemma 4.7] and [4, Lemma 5.5].

There exist an operator AA and Δt>0{\Delta t}^{*}>0 such that, for any 0<ΔtΔt0<{\Delta t}\leqslant{\Delta t}^{*} and any smooth function ψ\psi,

with a remainder rψ,Δtr_{\psi,{\Delta t}} uniformly bounded for 0<ΔtΔt0<{\Delta t}\leqslant{\Delta t}^{*}. Moreover,

where the expectation is over all possible realizations of the standard dNdN-dimensional Gaussian random variable GG.

The precise expression of the operator AA is unimportant. It is however given in the proof of this result, see Section 5.2. Note that the numerical scheme can be proved to be weakly first order accurate by relying on standard techniques , in view of the equality

Another important result which we will repeatedly use in the analysis below is the following uniform-in-Δt{\Delta t} geometric ergodicity of the Metropolized Euler-Maruyama scheme, easily obtained by adapting the results of to the case of compact position spaces (for completeness, we nonetheless provide elements of proof in Section 5.3). To state the result, we introduce the following functional space

As a consequence, there exists K>0K>0 such that

We first give error estimates on (11) by appropriately adapting the results from . The result is stated for two smooth observables ψ,φ\psi,\varphi with average 0 with respect to μ\mu. Define to this end

Error estimates for (11) are obtained by setting ψ=φ=qi,αV\psi=\varphi=\partial_{q_{i,\alpha}}V, with 1iN1\leqslant i\leqslant N and 1αd1\leqslant\alpha\leqslant d. Note indeed that a simple integration by parts shows that qi,αV\partial_{q_{i,\alpha}}V has average 0 with respect to μ\mu, so qi,αVC~(M)\partial_{q_{i,\alpha}}V\in\widetilde{C}^{\infty}(\mathcal{M}).

Consider two observables ψ,φC~(M)\psi,\varphi\in\widetilde{C}^{\infty}(\mathcal{M}), and define the modified observable

where the operator AA is defined in (15). Then, there exists Δt>0{\Delta t}^{*}>0 such that, for any 0<ΔtΔt0<{\Delta t}\leqslant{\Delta t}^{*},

with rψ,φ,Δtr_{\psi,\varphi,{\Delta t}} uniformly bounded (with respect to Δt{\Delta t}), and where the expectation on the left hand side of the above equation is with respect to initial conditions q0μq_{0}\sim\mu and over all realizations of the dynamics (1), while the expectation on the right hand side is with respect to initial conditions q0μq^{0}\sim\mu and over all realizations of the Metropolized Euler-Maruyama scheme (13).

As a corollary, we obtain first order error bounds on the computation of the self-diffusion through (11):

where D~ΔtGK\widetilde{\mathcal{D}}^{\rm GK}_{\Delta t} is uniformly bounded for Δt{\Delta t} sufficiently small, and where the numerically computed self-diffusion reads

The expression of the correction term is obtained by replacing the modified observable by its expression:

The appearance of subleading fractional correction term in (20) (here, of order Δt3/2{\Delta t}^{3/2}) is typical of Metropolis algorithms, and is usually not encountered for standard, un-Metropolized discretizations of SDEs (compare with the results of ).

2. Error bounds on the Einstein formula

In this section, we investigate the discretization error made when using the Metropolized Euler-Maruyama scheme to approximate the self-diffusion using (10). In accordance with the definition (7), we introduce a discrete additive functional allowing to keep track of the diffuse behavior of the Markov chain: Starting from Q0=q0Q^{0}=q^{0},

where, as in Theorem 2, the expectation on the right hand side is with respect to initial conditions Q0=q0μQ^{0}=q^{0}\sim\mu and for all realizations of the Metropolized Euler-Maruyama scheme.

There exists Δt>0{\Delta t}^{*}>0 such that, for any 0<ΔtΔt0<{\Delta t}\leqslant{\Delta t}^{*},

The proof of this result can be read in Section 5.5. In fact, a slight extension of our technique of proof would allow to show that the diffusively rescaled process generated by the Metropolized Euler-Maruyama scheme, namely εQt/(Δtε2)\varepsilon Q^{\lfloor t/({\Delta t}\,\varepsilon^{2})\rfloor} (where x\lfloor x\rfloor denote the unique integer such that xx<x+1\lfloor x\rfloor\leqslant x<\lfloor x\rfloor+1), weakly converges on finite time intervals to a Brownian motion with covariance matrix DΔt\mathscr{D}_{\Delta t}.

An immediate corollary of Theorem 3 is the following a priori error estimate on the self-diffusion:

where D~ΔtEinstein\widetilde{\mathcal{D}}^{\rm Einstein}_{\Delta t} is uniformly bounded for Δt{\Delta t} sufficiently small. Some more work would allow to prove that the subleading correction term is of order Δt3/2{\Delta t}^{3/2}, as in the Green-Kubo case (see Remark 4).

Numerical illustration

The aim of this section is to illustrate the errors bounds (20) and (24). We perform long computations so that the statistical errors are negligible. Let us mention that numerical simulations illustrating timestep errors for velocity autocorrelation functions were already presented in .

The self-diffusion coefficient DΔtEinstein\mathcal{D}_{\Delta t}^{\rm Einstein} for the Einstein approach is approximated by fitting the unnormalized self-diffusion

by a linear function DΔtEinstein,MnΔt\mathcal{D}^{{\rm Einstein},M}_{\Delta t}\,n{\Delta t}, the slope being the estimation of the self-diffusion for the time step under consideration. This is indeed confirmed by Figure 1 (Left), which presents the evolution of DnMD_{n}^{M} as a function of the physical time nΔtn{\Delta t}. The results produced in Figures 1 and 3 have been obtained with M=107M=10^{7} replicas and nEinstein=3×105n_{\rm Einstein}=3\times 10^{5} steps. Let us also note that, in accordance with (17), the rejection rate scales as Δt3/2{\Delta t}^{3/2}.

A numerical approximation of (21) requires both a discretization using finitely many replicas, but also a truncation of the integration in time with an upper bound τ\tau. We consider the following numerical estimation of the self-diffusion coefficient obtained with the Green-Kubo formula:

The correlation functions shown in Figure 2 suggest that the autocorrelation of VV^{\prime} is exponentially decreasing. It can be considered as negligible for times larger than 0.2. The numerical results reported in Figure 2 and 3 have been obtained with M=2×108M=2\times 10^{8} replicas and a time cut-off τ=0.3\tau=0.3.

The results presented in Figure 3 indeed confirm that, for small time steps Δt{\Delta t}, the error in the self-diffusion is of order Δt{\Delta t} for both methods. For larger time steps, nonlinearities appear. Note also that the errors on the coefficients computed with the Green-Kubo formula are smaller in this simple case. In any case, in accordance with the statements of Theorem 1, the self-diffusion is between 0 and 1 when Δt0{\Delta t}\to 0.

2. The more realistic case of solvated ions

We additionally consider a large ion, modeled as a fixed particle at position qionq_{\rm ion} (the center of the simulation box), whose interaction with the solvent particles is described by an attractive Yukawa potential (screened Coulomb interaction) plus some repulsive potential preventing the small particles from coming too close to the ion. More precisely, for a solvent particle at position qq, the interaction reads vion(qqion)v_{\rm ion}(|q-q_{\rm ion}|), with

The potentials vv and vionv_{\rm ion} are plotted in Figure 4.

Expectations are approximated using MM trajectories of the system. We integrate trajectories one after the other, using the Metropolized Euler scheme (13), with initial conditions for the (m+1)(m+1)th trajectory obtained by taking the last configuration of the mmth trajectory. The self-diffusion coefficient calculated with the Green-Kubo formula is obtained as in the previous section, using the estimator (26) (upon introducing the correct normalization factor 1/(3N)1/(3N) in the autocorrelation and replacing VV^{\prime} by V\nabla V). The values obtained by the Einstein formula are computed by dividing the unnormalized diffusion DnMD_{n}^{M} defined in (25) by the final time of the simulation: for a simulation time τ\tau,

The results presented in Figure 5 show that the unnormalized mean squared displacement indeed grows linearly in time, as expected. Note also that, in accordance with (17), the rejection rate scales as Δt3/2{\Delta t}^{3/2}. The results of Figure 5 and (7) have been obtained with M=105M=10^{5} trajectories, with integrations performed up to τ=20\tau=20.

The results presented in Figure 6 suggest that the decay of the force autocorrelation cannot be represented by a single exponential function. The force autocorrelation can be considered to be small in relative value for times of the order of 0.10.1. The numerical results reported in Figure 6 and 7 were obtained by averaging M=106M=10^{6} trajectories with an integration time τ=0.3\tau=0.3.

Error estimates for the diffusion coefficients are gathered in Figure 7. For small time steps Δt\Delta t, the error in the self-diffusion is linear in Δt\Delta t, while nonlinearities appear for larger time steps. The continuous lines are linear fits obtained over the values corresponding to the 10 smallest time steps. As in the simple example discussed in the previous section, estimates obtained with Green-Kubo’s formula seem more reliable than those obtained with Einstein’s formula. Note also that, in accordance with Theorem 1, the self-diffusion is between 0 and 1 in all cases for sufficiently small time-steps.

Possible work tracks to reduce the error on the estimation of the self-diffusion

Both the Green-Kubo and the Einstein approaches lead to discretization errors of order Δt{\Delta t}, as proved theoretically and verified numerically. A natural question is how to reduce this error. In the chemistry literature, it was proposed in to renormalize the time in Einstein’s method by replacing the simulation time nΔtn{\Delta t} appearing in the denominator of (23) by some effective time θΔtnΔt\theta_{\Delta t}n{\Delta t}, where θΔt\theta_{\Delta t} is the average acceptance rate of the Metropolis algorithm for a given time step Δt{\Delta t}. However, since the average acceptance rate is of order 1CΔt3/21-C{\Delta t}^{3/2} for small time steps (see (17)), such a correction cannot possibly cancel out the Δt{\Delta t}-error in the diffusion coefficient.

A more promising work track, which we started working on at the end of our stay at CIRM, is to modify the proposed move (instead of simply considering the Euler-Maruyama scheme (12)), and possibly the invariant measure as well, in order to increase the weak and strong orders of the associated Metropolized scheme. Some steps in this direction have already been pursued in . In fact, the proofs of Theorem 2 and 3 show that, in order to gain accuracy on the computation of transport coefficients, it is sufficient to find a numerical scheme such that

A key element to obtain such equalities is to decrease the rejection rate for small time steps.

Proof of the results

In all the proofs, the constants C>0C>0 and the critical time steps Δt{\Delta t}^{*} may change from line to line. Upon changing VV into βV\beta V, we may also assume that β=1\beta=1.

The idea is to rewrite the part of the additive functional involving V(qt)-\nabla V(q_{t}) as an approximate martingale. To this end, we introduce the solutions Φ0=(Φ0,1,..,Φ0,dN)\Phi_{0}=(\Phi_{0,1},..,\Phi_{0,dN}) of the following Poisson equations

In view of the resolvent estimate (6), the functions Φ0,j\Phi_{0,j} are well defined elements of L~2(μ)\widetilde{L}^{2}(\mu) since

In addition, by elliptic regularity, the functions Φ0,j\Phi_{0,j} are smooth. By Ito’s lemma, we therefore obtain

so that the integrated displacement from the origin can be rewritten as

The ergodic properties of the diffusion process qtq_{t} allow to prove that

Since L-\mathcal{L} has a positive spectral gap on L2(μ)L^{2}(\mu) (see (5)), we can write the following operator equality on L~2(μ)\widetilde{L}^{2}(\mu) :

Therefore, for general functions ψ,φ\psi,\varphi with vanishing average with respect to μ\mu,

Combining this result with (32) leads to the expression (9) of the diffusion matrix D\mathscr{D}. To prove the convergence of the processes, two arguments should be made precise (see for an elementary account):

the convergence of the finite-dimensional laws, which can be obtained very simply here by considering the exponential martingales

which are such that the conditional expectations converge to those of a Brownian motion as ε0\varepsilon\to 0:

Fs/ε2\mathcal{F}_{s/\varepsilon^{2}} denoting the filtration of events until the time s/ε2s/\varepsilon^{2}. Finite-dimensional laws are then obtained by a simple induction, as made precise in for instance.

the tightness of the process, proved using Prohorov’s criterion (see for instance ):

This criterion is satisfied in view of the tightness of the martingale MtF,ε\mathscr{M}^{F,\varepsilon}_{t}, itself easily obtained using Doob’s inequality (see ).

2. Proof of Lemma 1

We first determine the magnitude of the acceptance probability in the Metropolis algorithm, which reads

Using the following expansions (with integral remainders)

while there exists a constant C>0C>0 such that ξ~Δt(q,G)C(1+G6)\left|\widetilde{\xi}_{\Delta t}(q,G)\right|\leqslant C(1+|G|^{6}) for any 0Δt10\leqslant{\Delta t}\leqslant 1. We next use the inequality

obtained by distinguishing the cases x0x\leqslant 0 and x0x\geqslant 0. This shows that

with ξ+(q,G)=max(0,ξ(q,G))\xi_{+}(q,G)=\max(0,\xi(q,G)) and where ξ^Δt(q,G)C(1+G12)\left|\widehat{\xi}_{\Delta t}(q,G)\right|\leqslant C(1+|G|^{12}). The estimate on the average acceptance rate (17) is obtained by taking the expectation over all possible realizations of GnG^{n}, with the definition

To obtain the action of the operator AA, we start from the expression of the Metropolis transition operator (see for instance [13, Section 2.1.2])

We now write q=qΔtV(q)+2Δtgq^{\prime}=q-{\Delta t}\,\nabla V(q)+\sqrt{2{\Delta t}}\,g, so that, using (37) to estimate the rejection rate,

where we have used for the first integral a Taylor expansion at fourth order similar to (34) to obtain (see the computations in [12, Section 4.9])

and a Taylor expansion at first order for the term involving the rejection rate to obtain

The remainder rψ,Δtr_{\psi,{\Delta t}} is uniformly bounded in L(M)L^{\infty}(\mathcal{M}) for Δt{\Delta t} sufficiently small. The conclusion follows by setting A=(L2+D1+D2)/2A=(\mathcal{L}^{2}+D_{1}+D_{2})/2.

Finally, the invariance of μ\mu by PΔtP_{\Delta t} implies

This equality, together with the expansion (15) proves (16).

3. Proof of Lemma 2

The proof below is a simplification of the argument presented in , made possible since we work on a compact state space. The idea of the proof is to compare the Metropolis dynamics to the continuous dynamics, using the standard, un-Metropolized Euler-Maruyama scheme as an intermediate. Alternatively, it would be possible to directly compare the Metropolized and un-Metropolized schemes, by proving as in [12, Section 4.2] that the standard, un-Metropolized Euler-Maruyama scheme is geometrically ergodic since it satisfies a minorization condition.

Since we work on a compact state space, it is enough to show by Harris’ theorem (see the presentation in ) that there exist α(0,1)\alpha\in(0,1) such that

uniformly in 0<ΔtΔt0<{\Delta t}\leqslant{\Delta t}^{*} and (q,q)M2(q,q^{\prime})\in\mathcal{M}^{2}. We now introduce the transition kernel QtQ_{t} of the continuous dynamics (1), defined as

and consider it at time t=1t=1. The transition kernel is well defined and regular since the generator is elliptic. By the triangle inequality,

From [4, Lemma 2.7], since we work on a compact space, we know that there exists ε>0\varepsilon>0 such that

To control the second term in (39), we introduce the transition kernel of the standard, un-Metropolized Euler-Maruyama scheme (12), denoted by P~Δt\widetilde{P}_{\Delta t}, and write

By [4, Lemma 4.2], the transition kernel of the standard, un-Metropolized dynamics is uniformly close to the transition kernel of the continuous dynamics when the state space is compact: There exists CEM>0C_{\rm EM}>0 and Δt>0{\Delta t}^{*}>0 such that, for any 0<ΔtΔt0<{\Delta t}\leqslant{\Delta t}^{*},

It therefore remains to control the distance between the transition rate of the Metropolized and un-Metropolized dynamics. It is at this stage that the argument of [4, Lemma 4.6] can be simplified. As in the proof of this lemma, we use a coupling argument, and consider two chains qiq^{i} and q~i\widetilde{q}^{i}, corresponding respectively to the Metropolized dynamics and the standard un-Metropolized one, starting from the same initial condition q0Mq^{0}\in\mathcal{M}. The probability that qnq~nq^{n}\neq\widetilde{q}^{n} is bounded from above by the probability that qiq~iq^{i}\neq\widetilde{q}^{i} for some 1in1\leqslant i\leqslant n, i.e. at least one rejection occurred along the discrete trajectory. Since the probability to reject the move from qiq^{i} to qi+1q^{i+1} is 1RΔt(qi,qi+1)1-R_{\Delta t}(q^{i},q^{i+1}), it holds

The combination of all previous estimates finally gives (38) provided Δt{\Delta t} is sufficiently small.

from which the bound (19) immediately follows.

4. Proof of Theorem 2

We follow the strategy of [12, Section 4.8]. The proof starts by noticing that the integrated correlation function can be written using L1\mathcal{L}^{-1}, as made precise in (33). The strategy of the proof is to write an approximation of L1\mathcal{L}^{-1} using the discrete evolution operator PΔtP_{\Delta t}.

In view of (40) and (15), and since C~(M)\widetilde{C}^{\infty}(\mathcal{M}) is stable under L1\mathcal{L}^{-1}, it holds

where the sum is convergent in view of (18). In conclusion,

5. Proof of Theorem 3

We start by highlighting the martingale part of the increments δΔt(qn,Gn,Un)=Qn+1Qn\delta_{\Delta t}(q^{n},G^{n},U^{n})=Q^{n+1}-Q^{n}, similarly to the continuous case (compare (31)) :

where the expectation on the right-hand side is over the Gaussian random variable GG and the uniform variable UU (the configuration qnq^{n} being fixed). It will be useful to decompose the increment as

In view of Lemma 3 (which shows in particular that δΔtreject(q,G,U)\delta^{\rm reject}_{\Delta t}(q,G,U) can be thought of as being of order Δt2{\Delta t}^{2} by setting p=1p=1 in (43)), the first term on the right-hand side of (41) is equal to 2ΔtGn\sqrt{2{\Delta t}}\,G^{n} at dominant order in Δt{\Delta t}. This term therefore corresponds to the term 2Wt\sqrt{2}W_{t} in the decomposition (31). The second term in the right-hand side of (41) is handled by introducing an appropriate Poisson equation (see (44) below), which is the discrete analogue of (30).

The average increment has the following expansion in powers of Δt{\Delta t}:

where rδ,Δtr_{\delta,{\Delta t}} is uniformly bounded for Δt{\Delta t} sufficiently small. In addition, there exists a constant K>0K>0 such that δΔtreject(q,G,U)K(1+ΔtG)\left|\delta^{\rm reject}_{\Delta t}(q,G,U)\right|\leqslant K\left(1+\sqrt{{\Delta t}}|G|\right) and, for any p>0p>0,

There exists, for any Δt>0{\Delta t}>0, a unique function ΦΔt=(ΦΔt,1,,ΦΔt,dN)(L~(M))dN\Phi_{\Delta t}=(\Phi_{{\Delta t},1},\dots,\Phi_{{\Delta t},dN})\in\left(\widetilde{L}^{\infty}(\mathcal{M})\right)^{dN} such that

Moreover, recalling the definition (30) of Φ0\Phi_{0},

where ΨΔt\Psi_{\Delta t} is uniformly bounded for Δt{\Delta t} sufficiently small, and Φ~1\widetilde{\Phi}^{1} is the unique solution of the Poisson equation

Since ΦΔt\Phi_{\Delta t} is uniformly bounded as Δt0{\Delta t}\to 0, we obtain

We now expand MΔt0M^{0}_{\Delta t} in powers of Δt{\Delta t}. By Lemma 4, it is possible to replace the function ΦΔt\Phi_{\Delta t} in the second term on the right-hand side of (46) by Φ0+ΔtΦ~1\Phi_{0}+{\Delta t}\widetilde{\Phi}^{1}, up to a remainder of order Δt3/2{\Delta t}^{3/2}. We next use the following lemma to compute the cross correlation between the various functions of Δt{\Delta t} appearing in MΔt0MΔt0M^{0}_{\Delta t}\otimes M^{0}_{\Delta t}.

For any smooth functions f,gf,g, growing at most polynomially,

The conclusion then follows by applying this result with the functions f,gf,g replaced by xFTxx\mapsto F^{T}x, xFTΦ0(q+x)x\mapsto F^{T}\Phi_{0}(q+x) and xFTΦ~1(q+x)x\mapsto F^{T}\widetilde{\Phi}^{1}(q+x) for a given test direction FF. Indeed,

with rF,Δt/F2|r_{F,{\Delta t}}|/|F|^{2} uniformly bounded as Δt0{\Delta t}\to 0. Manipulations similar to the ones leading to (32) finally give the claimed result.

In order to characterize the leading order term in the error and prove that the subleading order term indeed is of order Δt3/2{\Delta t}^{3/2}, it would be necessary to compute correlation terms involving components of the remainder term ΨΔt\Psi_{\Delta t}. This is not possible as such because the regularity of ΨΔt\Psi_{\Delta t} is not established, and obtaining regularity result from (44) is difficult. An expansion of ΦΔt\Phi_{\Delta t} up to Δt2{\Delta t}^{2} terms (as Φ0+ΔtΦ~1+Δt3/2Φ~3/2+Δt2Ψ~Δt\Phi_{0}+{\Delta t}\widetilde{\Phi}^{1}+{\Delta t}^{3/2}\widetilde{\Phi}^{3/2}+{\Delta t}^{2}\widetilde{\Psi}_{\Delta t}) is therefore needed in order not have to treat correlations involving the remainder ΨΔt\Psi_{\Delta t}. This, in turn, would require an expansion of PΔtP_{\Delta t} up to remainders of order Δt3{\Delta t}^{3}, instead of Δt5/2{\Delta t}^{5/2} as in (15). Although this does not pose any problem in principle, we chose not to follow this path in order to keep the arguments as simple as possible.

We conclude this section with the proofs of the technical results quoted above.

The bound (43) is a straightforward consequence of the equality

while δΔtreject(q,G,U)2ΔtGΔtV(q)\left|\delta^{\rm reject}_{\Delta t}(q,G,U)\right|\leqslant\left|\sqrt{2{\Delta t}}\,G-{\Delta t}\,\nabla V(q)\right| immediately gives δΔtreject(q,G,U)K(1+ΔtG)\left|\delta^{\rm reject}_{\Delta t}(q,G,U)\right|\leqslant K\left(1+\sqrt{{\Delta t}}|G|\right). ∎

We introduce the normalized average increment, defined as the following periodic function :

Lemma 2 shows that ΦΔt\Phi_{\Delta t} is well defined provided the periodic function δΔt\overline{\delta}_{\Delta t} has a vanishing average with respect to μ\mu. To prove this statement, we start from

Since qq and qq^{\prime} play symmetric roles, I0I_{0} vanishes. For n0n\geqslant 0, we obtain, by first exchanging the names of the dummy variables qq and qq^{\prime} and then using TΔt(q+nL,q)=TΔt(q,qnL)T_{\Delta t}(q+nL,q^{\prime})=T_{\Delta t}(q,q^{\prime}-nL) as well as the invariance of μ\mu by translations of the periodic cell,

Therefore, In=InI_{n}=-I_{-n}, which allows to conclude that

To obtain an expansion of ΦΔt\Phi_{\Delta t} in terms of fractional powers of Δt{\Delta t}, we rely on (42), which implies that

This gives the result upon defining ΨΔt=Δt3/2(ΦΔtΦ0ΔtΦ~1)\Psi_{\Delta t}={\Delta t}^{-3/2}\left(\Phi_{\Delta t}-\Phi_{0}-{\Delta t}\,\widetilde{\Phi}^{1}\right). ∎

A Taylor expansion with integral remainder gives

A simple computation using (42) and (43) shows that

where rf,Δtr_{f,{\Delta t}} is uniformly bounded for Δt{\Delta t} sufficiently small. Therefore,

where the remainder rf,g,Δtr_{f,g,{\Delta t}} is uniformly bounded. ∎

References