Delay Bounds for Networks with Heavy-Tailed and Self-Similar Traffic

Jorg Liebeherr, Almut Burchard, Florin Ciucu

I Introduction

Traffic measurements in the 1990s provided evidence of self-similarity in aggregate network traffic , and heavy-tailed files sizes and bursts were found to be among the root causes . Since such traffic induces backlog and delay distributions whose tails decay slower than exponential, the applicability of analytical techniques based on Poisson or Markovian traffic models in network engineering has been called into question , thus creating a need for new approaches to teletraffic theory.

A random process XX is said to have a heavy-tailed distribution if its tail distribution is governed by a power-law Pr(X(t)>x)Kxα{Pr}(X(t)>x)\sim Kx^{-\alpha}, with a tail index α(0,2)\alpha\in(0,2) and a scaling constant KK.We write f(x)g(x)f(x)\sim g(x), if limxf(x)/g(x)=1\lim_{x\rightarrow\infty}f(x)/g(x)=1. We will consider tail indices in the range 1<α<21<\alpha<2, where the distribution has a finite mean, but infinite variance. A random process XX is said to be self-similar if a properly rescaled version of the process has the same distribution as the original process. We can write this as X(t)distaHX(at)X(t)\sim_{dist}a^{-H}X(at) for every a>0a>0. The exponent H(0,1)H\in(0,1), referred to as the Hurst parameter, specifies the degree of self-similarity.The networking literature frequently uses the weaker concept of second-order self-similarity. Since we will work with heavy-tailed distributions, for which higher moments are not available, we use the more general definition of self-similarity. We refer to a process as heavy-tailed self-similar if it satisfies both criteria.

A performance analysis of networks with heavy-tailed self-similar traffic or service, where no higher moments are available, is notoriously hard, especially an analysis of a network path across multiple nodes. Single node queueing systems with heavy-tailed processes have been studied extensively . However, there exist only few works that can be applied to analyze multi-node paths. These works generally consider an asymptotic regime with large buffers, many sources, or in the steady state. Tail asymptotics for multi-node networks have been derived for various topologies, such as feedforward networks , cyclic networks , tandem networks with identical service times , and tandem networks where packets have independent service times at nodes in the more general context of stochastic event graphs . The accuracy of some asymptotic approximations has been called into question, particularly, the quality of large buffer asymptotics for heavy-tailed service distributions was found to be lacking in , thus motivating a performance analysis in a non-asymptotic regime.

This paper presents a non-asymptotic delay analysis for multi-node networks with heavy-tailed self-similar traffic and heavy-tailed service. We derive the bounds for a flow or flow aggregate that traverses a network path and experiences cross traffic from heavy-tailed self-similar traffic at each node. Both fluid and packetized interpretations of service are supported; in the latter case, we assume that a packet maintains the same size at each traversed node. A key contribution of this paper is a probabilistic sample path bound for heavy-tailed self-similar arrival processes. The derivation of the sample path bound is made possible by a suitable envelope characterization for heavy-tailed self-similar traffic, referred to as htss envelope. We present a characterization for heavy-tailed service and show that it can express end-to-end service available on a path as a composition of the heavy-tailed service at each node. Our end-to-end service characterization enables the computation of end-to-end delay bounds using our single-node result. In an asymptotic regime, our bounds follow (up to a logarithmic correction) the same power law tail decay as asymptotic results that exist in the literature for single nodes. Finally, we show that end-to-end delays of heavy-tailed traffic and service grow polynomially with the number of nodes. For example, for a Pareto traffic source with tail index α\alpha we find that end-to-end delays are bounded by O(Nα+1α1(logN)1α1){O}(N^{\frac{\alpha+1}{\alpha-1}}(\log N)^{\frac{1}{\alpha-1}}) in the number of nodes NN.

Our analysis follows a network calculus approach where traffic is characterized in terms of envelope functions, which specify upper bounds on traffic over time intervals, and service is characterized by service curves, which provide lower bounds on the service available to a flow . An attractive feature of the network calculus is that the service available on a path can be composed from service characterizations for each node of the path. We consider a probabilistic setting that permits performance metrics to be violated with a small probability. Probabilistic extensions of the network calculus are available for traffic with exponential tail distributions , distributions that decay faster than any polynomial , and traffic distributions with an effective bandwidth . The latter two groups include certain self-similar processes, in particular, those governed by fractional Brownian motion , but do not extend to heavy-tailed distributions. There are also efforts for extending the network calculus to heavy-tailed distributions , which are discussed in more detail in the next section.

The remainder of this paper is organized as follows. In Section II and Section III, respectively, we discuss our characterization of heavy-tailed traffic and service by appropriate probabilistic bounds. In Section IV we present our main results: (1) a sample path envelope for heavy-tailed self-similar traffic, (2) probabilistic bounds for delay and backlog, (3) a description of the leftover capacity at a constant-rate link with heavy-tailed self-similar cross traffic, and (4) a composition result for service descriptions at multiple nodes. In Section V we discuss the scaling properties of the derived delay bounds in terms of power laws. We present brief conclusions in Section VI.

II The htss Traffic Envelope

In this section we present and evaluate a probabilistic envelope function, for characterizing heavy-tailed self-similar network traffic that permits the derivation of rigorous backlog and delay bounds. The proposed htss envelope further develops concepts that were previously studied in .

We consider arrivals and departure of traffic at a system, which represents a single node or a sequence of multiple nodes. We use a continuous time model where arrivals and departures of a traffic flow at the system for a time interval [0,t)[0,t) are represented by left-continuous processes A(t)A(t) and D(t)D(t), respectively. The arrivals in the time interval [s,t)[s,t) are denoted by a bivariate process A(s,t):=A(t)A(s)A(s,t):=A(t)-A(s). Backlog and delay at a node are represented by B(t)=A(t)D(t)B(t)=A(t)-D(t) and W(t)=inf{d:A(td)D(t)}W(t)=\inf\left\{d:A(t-d)\leq D(t)\right\}, respectively. When AA and DD are plotted as functions of time, BB and WW are the vertical and horizontal distance, respectively, between these functions.

A statistical envelope G{\cal G} for an arrival process AA is a non-random function which bounds arrivals over a time interval such that, for all s,t0s,t\geq 0 and for all σ>0\sigma>0 :

where ε\varepsilon is a non-increasing function of σ\sigma that satisfies ε(σ)0\varepsilon(\sigma)\to 0 as σ\sigma\to\infty. The function ε(σ)\varepsilon(\sigma) is used as a bound on the violation probability. Statistical envelopes have been developed for many different traffic types, including regulated, Markov modulated On-Off, and Gaussian self-similar traffic. A recent survey provides an overview of envelope concepts .

The computation of performance bounds, e.g., bounds on backlog delay, and output burstiness, requires a statistical envelope that bounds an entire sample path {A(s,t)}st\left\{A(s,t)\right\}_{s\leq t}. A statistical sample path envelope G\overline{{\cal G}} is a statistical envelope that satisfies for all t0t\geq 0 and for all σ>0\sigma>0 :

Clearly, a statistical sample path envelope is also a statistical envelope, but not vice versa. In fact, only few statistical envelopes (in the sense of Eq. (1)) lend themselves easily to the development of sample path envelopes (as in Eq. (2)). One of the earliest such envelopes appears in the Exponentially Bounded Burstiness (EBB) model from , which requires that Pr(A(s,t)>r(ts)+σ)Meaσ,{Pr}(A(s,t)>r(t-s)+\sigma)\leq Me^{-a\sigma}, for some constants MM, rr and aa and for all σ>0\sigma>0. If rr corresponds to the mean rate of traffic, an EBB envelope specifies that the deviation of the traffic flow from its mean rate has an exponential decay. A sample path bound for EBB envelopes in the sense of Eq. (2) is obtained via the union boundFor two events XX and YY, Pr(XY)Pr(X)+Pr(Y){Pr}(X\cup Y)\leq{Pr}(X)+{Pr}(Y). by evaluating the right-hand side of Eq. (2) as kPr(A(sk,t)>G(tsk1;σ))\sum_{k}{Pr}(A(s_{k},t)>\overline{{\cal G}}(t-s_{k-1};\sigma)) for a suitable discretization {sk}k=1,2,\{s_{k}\}_{k=1,2,\ldots}, yielding G(ts;σ)=Rt+σ\overline{{\cal G}}(t-s;\sigma)=Rt+\sigma for R>rR>r and ε(σ)=Meaσ1ea(Rr)\overline{\varepsilon}(\sigma)=\frac{Me^{-a\sigma}}{1-e^{-a(R-r)}} . The EBB envelope has been generalized to distributions with moments of all orders, referred to as Stochastically Bounded Burstiness (SBB) and corresponding sample path bounds have been developed in . SBB envelopes can characterize arrival processes that are self-similar, but not heavy-tailed. For instance, fractional Brownian motion processes can be fitted with an envelope function G(t)=rt+σ{\cal G}(t)=rt+\sigma with a Weibullian bound on the violation probability of the form ε(σ)=Ke(σ/a)α\varepsilon(\sigma)=Ke^{-(\sigma/a)^{\alpha}} for some a>0a>0 and 0<b<10<b<1.

A statistical envelope for general self-similar arrival processes can be expressed as

Note that A(s,t)r(ts)distX(ts)A(s,t)-r(t-s)\leq_{dist}X(t-s), where XX is a process satisfying the self-similar property given in the introduction. For self-similar traffic, it is natural to allow a heavy-tailed violation probability, since self-similarity can arise from heavy-tailed arrival processes with independent increments.This is evident in Eqs. (13)–(14) below in an application of the Generalized Central Limit Theorem for a Pareto traffic source. This consideration leads to our proposed extension of the EBB and SBB concepts that capture characteristics of heavy-tailed and self-similar traffic. We define a heavy-tail self-similar (htss) envelope as a bound that satisfies for all σ>0\sigma>0 that

where KK and rr are constants, and HH and α\alpha, respectively, indicate the Hurst parameter and the tail index. We generally assume that α(1,2)\alpha\in(1,2), that is, arrivals have a finite mean but infinite variance, and H(0,1)H\in\left(0,1\right). In the htss envelope the probability of deviating from the average rate rr follows a power law. Moreover, due to self-similarity, these deviations may increase as a function of time. Since the htss envelope specifies a bound, it can be used to describe any type of traffic, but the characterizations will be loose unless the traffic has some heavy-tailed self-similar properties. In terms of Eq. (1), the htss envelope is a statistical envelope with

In Section IV, we will derive a sample path envelope for the htss envelope, which is necessary for the computation of probabilistic upper bounds on backlog and delay of heavy-tailed self-similar traffic at a network node.

Characterizations of self-similar and heavy-tailed traffic by envelopes have been presented before, generally, by exploiting specific properties of α\alpha-stable processes . An envelope for α\alpha-stable processes in takes the same form G(t;σ)=rt+σtH{\cal G}(t;\sigma)=rt+\sigma t^{H} as the htss envelope, but specifies a fixed violation probability rather than a bound on the distribution. An issue with such a characterization is that it does not easily lead to sample path envelopes. For H=0H=0, a sample path version of Eq. (4) has been obtained in by applying an a-priori bound on the backlog process of an α\alpha-stable self-similar process from . Since the backlog bound given in Eq. (24) of is a lower bound (and not an upper bound) on the tail distribution of the buffer occupancy, the envelope in does not satisfy Eq. (2). In Section IV it will become evident that sample path envelopes for arrivals and backlog bounds are interchangeable, in that the availability of one can be used to derive the other. Thus, the sample path bound derived in this paper for heavy-tailed self-similar processes satisfying the htss envelope from Eq. (4) also provides the first rigorous backlog bounds for this general class of processes.

In the remainder of this section, we show how to construct htss envelopes for relevant distributions, as well as for measurements of packet traces. Ever since traffic measurements at Bellcore from the late 1980s discovered long-range dependence and self-similarity in aggregate network traffic , many studies have supported, refined, sometimes also repudiated (e.g., ) these findings. This report does not participate in the debate whether aggregate network traffic is best characterized as short-range or long-range dependent, self-similar or multi-fractal, short-tailed or heavy-tailed, and so on. Rather we wish to provide tools for evaluating the performance of networks that may see heavy-tailed self-similar traffic, and shed light on the opportunities and pitfalls of envelope descriptions for heavy-tailed traffic.

Stable distributions provide well-established models for non-Gaussian processes with infinite variance. The potential of applying stable processes to data networking was demonstrated in by fitting traces of aggregate traffic (i.e., the Bellcore traces studied in ) to an α\alpha-stable self-similar process.

The point of departure for our characterization of α\alpha-stable processes with htss envelopes is the α\alpha-stable process proposed in which takes the form

Here, rr is the mean arrival rate and bb is a parameter that describes the dispersion around the mean.

Remark: We can use Eq. (6) to observe the statistical multiplexing gain of α\alpha-stable processes. By the defining property of SαS_{\alpha}, the superposition of NN i.i.d. processes as in Eq. (6), denoted by A\mbox\slmuxA_{\mbox{\small\sl mux}}, yields

Since 1/α<11/\alpha<1 in the considered range α(1,2)\alpha\in(1,2), the aggregate of a set of flows increases slower than linearly in the number of flows, thus, giving clear evidence of multiplexing gain. The multiplexing gain diminishes as α1\alpha\rightarrow 1.

We can obtain an htss envelope for Eq. (6) from the tail approximation for α\alpha-stable distributions

where cα=(2Γ(α)sinπα2π)1αc_{\alpha}=\left(\frac{2\Gamma(\alpha)\sin{\frac{\pi\alpha}{2}}}{\pi}\right)^{-\frac{1}{\alpha}} and Γ()\Gamma(\cdot) is the Gamma function. With Eq. (6) we can write

Matching this expression with Eq. (4) we obtain the remaining parameter KK of the htss envelope by setting

By Eq. (7), this envelope only holds for large σ\sigma, or, equivalently, low violation probabilities. An alternative method to obtain an htss envelope for all values of σ\sigma is to take advantage of the quantiles of SαS_{\alpha}. Since the density of SαS_{\alpha} is not available in a closed form, the quantiles must be obtained numerically or by a table lookup. Let the quantile z(ε)z(\varepsilon) be the value satisfying

We obtain a statistical envelope by setting G(t;σ)=rt+z(ε)σtH{\cal G}(t;\sigma)=rt+z(\varepsilon)\sigma t^{H} with a fixed ε\varepsilon. In fact, this is the envelope for α\alpha-stable processes from . However, since z(ε)z(\varepsilon) does not follow a power law, it is not an htss envelope. To obtain an htss envelope from the quantiles, we express Eq. (4) in terms of Eq. (9), which can be done by setting

In Fig. 1, we present envelopes for a process satisfying Eq. (6) with

We show statistical envelopes with fixed violation probabilities ε=101,102,103\varepsilon=10^{-1},10^{-2},10^{-3}. The graph compares the htss envelopes constructed with the tail approximation using Eq. (7) to those obtained from the quantiles via Eq. (10). As can be expected, the envelopes computed from the asymptotic tail approximation are smaller than the quantile envelopes. We add that envelopes computed from the quantiles for a fixed ε\varepsilon, as described in Eq. (9), are very close to the tail approximation envelopes. If the corresponding envelopes were included in the figure, they would appear almost indistinguishable, suggesting that Eq. (7) provides reasonable bounds for all values of σ\sigma.

II-B Pareto Packet Distribution

As second case study, we present an htss envelope construction for a packet source with a Pareto arrival distribution. Packets arrive evenly spaced at rate λ\lambda and packet sizes are i.i.d. described by a Pareto random variable XiX_{i} for the ii-th packet with tail distribution

where α(1,2)\alpha\in(1,2). XX has finite mean E[X]=bαα1E\left[X\right]=\frac{b\alpha}{\alpha-1} and infinite variance. We will construct an htss envelope for the compound arrival process

where N(t)=λtN(t)=\lfloor\lambda t\rfloor denotes the number of packets which arrive by time tt. This arrival process is asymptotically self-similar with a Hurst parameter of H=1/αH=1/\alpha.

For the htss envelope construction of the Pareto source, we take advantage of the generalized central limit theorem (GCLT) , which states that the α\alpha-stable distribution SαS_{\alpha} appears as the limit of normalized sums of i.i.d. random variables. For the independent Pareto random variables XiX_{i}, the GCLT yields

in distribution. Since the GCLT is an asymptotic limit, envelopes derived with the GCLT are approximate, with higher accuracy for larger values of nn. Using that N(t)λtN(t)\approx\lambda t for suitable large values, we can write the arrival function in Eq. (12) with Eq. (13) as

Since this expression takes the same form as Eq. (6), we can now use the tail estimate of Eq. (7) to obtain an htss envelope with parameters

The same parameters are valid when N(t)N(t) is a Poisson process, according to Theorem 3.1 in .

Similar techniques can yield htss envelopes for other heavy-tailed processes. For example, an aggregation of independent On-Off periods, where the duration of ‘On’ and ‘Off’ periods is governed by independent Pareto random variables yields an α\alpha-stable process in the limit of many flows (N)(N\rightarrow\infty) and large time scales (t)(t\rightarrow\infty). This aggregate process is particularly interesting since dependent on the order in which the limits of NN and tt are taken, one obtains processes that are self-similar, but not heavy-tailed (fractional Brownian motion), processes that are heavy-tailed, but not self-similar (α\alpha-stable Lévy motion), or a general α\alpha-stable process. An approximation by an α\alpha-stable process followed by an estimation of htss parameters can also be reproduced for the M/G/\infty arrival model .

Example. We next compare envelope constructions for a Pareto source with evenly spaced packet arrivals with a size distribution given by Eq. (11). The parameters are

With these values, the average packet size is 400400 Byte. We evaluate the following types of envelopes:

htss GCLT envelope. This refers to the envelope constructed with the GCLT according to Eq. (14). The value of σ\sigma of the htss envelope is set so that the right hand side of Eq. (4) satisfies a violation probability of ε=103\varepsilon=10^{-3}.

Deterministic trace envelope. This envelope is computed from a simulation of a packet trace with 1 million packets drawn from the given Pareto distribution. We compute the smallest envelope for the trace that satisfies Eq. (1) with ε(σ)=0\varepsilon(\sigma)=0 for all σ>0\sigma>0. The deterministic trace envelope, which is computed by G(t)=supτ{A(t+τ)A(τ)}{\cal G}(t)=\sup_{\tau}\{A(t+\tau)-A(\tau)\} , provides the smallest envelope of a trace that is never violated.

htss trace envelope. This is an htss envelope created from the same Pareto packet trace. We assume that the values of α\alpha and H=1/αH=1/\alpha are given, but that the distribution is not known. The envelope is created directly from Eq. (4) by inspecting the relative frequency at which subintervals of the trace violate the htss envelope. First, KK is selected as the smallest number that satisfies the right hand side of Eq. (4) for all values of σ\sigma. Then σ\sigma is found by fixing the violation probability ε=103\varepsilon=10^{-3}.

Average rate. For reference, we also include the average rate of the data in the figures, which is obtained from the same packet trace as in the trace envelopes.

The resulting envelopes are plotted in Fig. 3. The discrete steps of the deterministic trace envelope around t=0t=0 ms, t=460t=460 ms, and t=680t=680 ms are due to arrivals of very large packets at certain times in the simulated trace. The htss GCLT envelope is quite close to the data of the average rate. However, since the GCLT is an asymptotic result, this envelope is possibly too optimistic. On the other hand, the safely conservative htss trace envelope is much larger than the corresponding deterministic trace envelope. The reason is that the construction of this envelope performs a heavy-tailed extrapolation of the data trace, and thus amplifies the variability of the underlying trace.

To investigate the variability of htss trace envelopes as a function of the length of the trace, we present envelopes obtained from subintervals of the trace used for Fig. 3. We use non-overlapping subintervals of 100,000 packets from the trace and compute htss trace envelopes for the subintervals. In Fig. 3, we show the envelopes normalized by the values of the htss GCLT envelope. We plot the maximum and the minimum values of the computed htss envelopes for the subintervals. The figure makes drastically clear that the range of values of the htss envelopes for the shorter intervals cover a wide range. This illustrates an inherent problem with generating a traffic characterization for heavy-tailed traffic from limited data sets.

II-C Measured Packet Traces

We next show how to obtain an htss envelope from measured traffic traces. The trace data was collected in October 2005 at the 1 Gbps uplink of the Munich Scientific Network, a network with more than 50,000 hosts, to the German research backbone network. The complete trace contains more than 6 billion packets, collected over a 24-hour time period. Further details on the data trace and the collection methodology can be found in . From this data, we select the first 10910^{9} packets corresponding to 2.752.75 hours worth of data, with an overall average rate of r=465r^{*}=465 Mbps. We refer to this data as the Munich trace.

To extract the tail index and the Hurst parameter from the trace, we take advantage of parameter estimation methods for stable processes from ,We use source code provided to us by the authors of . which yields the following parameters for the Munich trace:

The remaining parameter KK needed for the htss envelope can now be obtained in the same way as described for the htss trace envelope.

To provide a sense of the trace data, we present in Fig. 5 a normalized log-log plot of the Munich trace data in terms of the normalized random variable

Since Pr(Y>σ)=Pr(A(s,t)>G(ts;σ)){Pr}(Y>\sigma)={Pr}(A(s,t)>{\cal G}(t-s;\sigma)) where G{\cal G} is given in Eq. (5), the distribution of YY corresponds to that of violations of the htss envelope. In the figure, we show the log-log plot of YY for different values of (ts)(t-s), namely, ts=10,100,1000t-s=10,100,1000 ms. If the trace data was self-similar with the exact Hurst parameter HH^{*}, the log-log data curves should match perfectly for all values of (ts)(t-s). (We note that by reducing the value of the Hurst parameter slightly, the curves for different values of (ts)(t-s) can be made to match up almost perfectly). Since the decay of the log-log plots is obviously not linear, the distribution of the Munich trace does not appear to be heavy-tailed. We will see that a characterization of such a non-heavy-tailed process by an htss envelope leads to a pessimistic estimation.

We can also use Fig. 5 to graphically construct an htss envelope for the Munich trace. Since we already have determined the tail index α\alpha^{*} and the Hurst parameter HH^{*} as given above, we only need to find KK. The value of this parameter can be obtained by taking the logarithm of Eq. (4). Using the definition of YY, this yields

Applying this relationship to Fig. 5, we should select KK as the smallest value such that the linear function logKαlogσ\log K-\alpha^{*}\log\sigma lies above the log-log plots of Pr(Y>σ){Pr}(Y>\sigma) in the figure. In Fig. 5, we include the linear segment with K=1225K=1225 as a thick line. Clearly, any other selection of KK and α\alpha^{*} providing an upper bound of the log-log plots of the Munich trace also yields a valid htss envelope for all values of σ\sigma. An htss envelope for a fixed violation probability ε\varepsilon can be obtained from Fig. 5 by finding the value of σ\sigma that corresponds to the desired violation probability of the linear segment. Finally, we can use Fig. 5 to assess the accuracy of the htss envelope. The linear segment (the thick black line) is close to the trace data when Pr(Y>σ)101{Pr}(Y>\sigma)\approx 10^{-1}. Otherwise, the linear segment is quite far apart from the plots of the trace. This indicates that the htss envelopes developed with the parameter settings used for the linear segment are accurate only when the violation probability is around 10110^{-1}. If the data trace was truly heavy-tailed, the data curves would maintain a linear rate of decline at a rate around α\alpha^{*}, and would remain close to the linear segment for any σ\sigma sufficiently large.

In Fig. 5, we show htss envelopes for the Munich trace obtained with the linear segment from Fig. 5 for ε=101\varepsilon=10^{-1} and ε=103\varepsilon=10^{-3}. For comparison, we include in Fig. 5 the average rate of the traffic trace, as well as a deterministic envelopes of the 1 sec subinterval of the trace that generates the most traffic. (Since the computation of a deterministic trace envelope as defined in Subsection II-B grows quadratically in the size of the trace, the computation time to construct a deterministic envelope for the complete Munich trace is prohibitive. The included deterministic envelope for a subinterval of the trace is a lower bound for the deterministic envelope of the complete trace. However, for the depicted time intervals, the deterministic envelope for the subinterval is a good representation of the deterministic envelope of the entire trace, for several reasons. First, by selection of the subinterval, at t=1000t=1000 ms the envelope of the subinterval and the envelope of the complete trace are identical. Second, since any deterministic envelope is a subadditive function, the slope of the envelope decreases for larger values of time. Now, any function that satisfies these properties cannot vary significantly from the depicted envelope of the selected subinterval.) Comparing the htss envelopes with the reference curves confirms our earlier discussion on the accuracy of the htss envelopes: For ε=101\varepsilon=10^{-1}, the htss envelope is close to the plot of the average rate. On the other hand, the envelope for ε=103\varepsilon=10^{-3} is quite pessimistic, and lies well above the deterministic envelope.

III Service Guarantees with Heavy Tails

We next formulate service guarantees with a power-law decay. In the network calculus, service guarantees are expressed in terms of functions that express for a given arrival function a lower bound on the departures. In general, a statistical service curve is a function S(t;σ){\cal S}(t;\sigma) such that for all t0t\geq 0 and for all σ>0\sigma>0

denotes the min-plus convolution of the arrivals with the service curve S(t;σ){\cal S}(t;\sigma), and ε\varepsilon is a non-increasing function that satisfies ε(σ)0\varepsilon(\sigma)\to 0 as σ\sigma\to\infty.

We define a heavy-tailed (ht) service curve as a service curve of the form

for some β\beta with 0<β<20<\beta<2 and some constant LL. In analogy to the formulation of traffic envelopes in Section II, the ht service curve specifies that the deviation from the service rate guarantee RR has a heavy-tailed decay. The rationale for not including a Hurst parameter in the definition of the ht service guarantees is that the form of Eq. (15) facilitates the computation of service bounds over multiple nodes. In this paper, we consider two types of ht service curves, one characterizing the available capacity at a link with cross-traffic, the other modeling a packetizer.

\bullet Service at link with cross traffic (Leftover Service): This service curve seeks to describe the service available to a selected flow at a constant-rate link with capacity CC, where the competing traffic at the link, referred as cross traffic, is given by an htss envelope. By considering the pessimistic case that the selected flow receives a lower priority than the cross traffic, we will obtain a lower bound for the service guarantees for most workconserving multiplexers . Since the service guarantee of the selected flow consists of the capacity that is left unused by cross traffic, we refer to the service interpretation as leftover service. Since the derivation of an ht service curve for such a leftover service requires a sample path bound for the htss cross traffic, we defer the derivation to Subsection IV-B.

\bullet Packetizer: We will also use the ht service model to express a packetized view of traffic with a heavy-tailed packet size distribution. We model discrete packet sizes by a service element that delays traffic until all bits belonging to the same packet have arrived, and then releases all bits of the packet at once. Such an element is referred to as a packetizer. By investigating packetized traffic we can relate our bounds to a queuing theoretic analysis with a packet-level interpretation of traffic (see Section V). We now derive a service curve for a packetizer. For a packet-size distribution satisfying {Pr}\bigl{(}X>\sigma\bigr{\}}\leq L\sigma^{-\alpha}, we show that a constant-rate workconserving link of capacity CC provides an ht service curve with rate R=CR=C and a suitable function ε(σ)\varepsilon(\sigma).

Denote by X(t)X^{*}(t) the part of the packet in transmission at time tt that has already been transmitted. We can view X(t)X^{*}(t) as the current lifetime of a renewal process. It is known from the theory of renewal processes (see , pp. 194) that

The bound on the tail probability holds for all times tt, provided that the arrival time of the first packet after the network is started with empty queues at t=0t=0 is properly randomized. Using X(t)X^{*}(t), the departures of a packetizer are given by

where t\underline{t} is the beginning of the busy period of tt. Set S(t;σ)=[Ctσ]+{\cal S}(t;\sigma)=[Ct-\sigma]_{+}. If ρ\rho is the utilization of AA as a fraction of the link rate CC, that is,

IV Network Calculus with htss Envelopes

We consider a network as in Fig. 6. A flow traverses NN nodes in series. Its traffic is referred to as through traffic. At each node, the through traffic is multiplexed with arrivals from competing flows, called cross traffic. Both through and cross traffic are described by htss envelopes. We will present results that yield bounds on the end-to-end delay and backlog of the through traffic.

The network calculus for heavy-tailed traffic is enabled by a statistical sample path envelope for traffic with htss envelopes. To motivate the relevance of the sample path bound, let us consider the backlog of a flow at a workconserving link that operates at a constant rate CC. The backlog at time tt is given by

Notice that the backlog expression depends on the entire arrival sample path {A(s,t)}st\left\{A(s,t)\right\}_{s\leq t}. To compute an upper bound for the tail probability {Pr}\bigl{(}B(t)>\sigma\bigr{)}, in many places in the literature, in particular, in all prior works attempting a network calculus analysis with heavy-tailed traffic , the tail distribution is approximated by

However, the right hand side is generally smaller than the left hand side. Applying to the right hand side a statistical envelope that only satisfies Eq. (1) but not Eq. (2) does not yield an upper bound but rather an upper bound to a lower bound. The derivation of rigorous upper bounds requires a sample path bound for the arrivals. To derive such bounds, we discretize time by setting xk=τγkx_{k}=\tau\gamma^{k}, where τ>0\tau>0 and γ>1\gamma>1 are constants that will be chosen below. If txks<txk1t-x_{k}\leq s<t-x_{k-1}, then

If the arrivals satisfy an htss envelope G(t)=rt+σtH{\cal G}(t)=rt+\sigma t^{H} with ε(σ)=Kσα\varepsilon(\sigma)=K\sigma^{-\alpha}, we obtain with the union bound

In the second line we have used Lemma 6 from the appendix to evaluate the sum. Writing C=r+μC=r+\mu and minimizing over γ\gamma gives the constant

We remark that, typically, we have 1<α<21<\alpha<2 and α1H<1\alpha^{-1}\leq H<1, so that α(1H)<1\alpha(1-H)<1. This means that the backlog is almost surely finite, but cannot be expected to have finite mean.

The main technical ingredient of the above proof of the backlog bound is the discretization of time by the geometric sequence xk=γkτx_{k}=\gamma^{k}\tau. This is an instance of under-sampling, where not every time step is used in probabilistic estimates. Commonly in the literature, time is discretized by dividing it into equal units with xk=kτx_{k}=k\tau. In , the choice is described as a general optimization problem over arbitrary sequences x0x1tx_{0}\leq x_{1}\leq\ldots\leq t, but not applied, since all examples in only optimize over τ\tau in uniformly spaced sequences. Note that using a uniform discretization in the derivation of Eq. (17) would cause the infinite sum to become unbounded.

An immediate consequence of the backlog bound is a sample path bound for htss envelopes.

ht sample path Envelope. If arrivals to a flow are bounded by an htss envelope

The proof follows immediately from Eq. (17) by replacing CC with the relaxed arrival rate r+μr+\mu. The ht sample path envelope is reminiscent of a leaky-bucket constraint with a single burst and rate, and does not reflect the self-similar scaling of the htss envelope.

We note that a small modification of the proof would yield a sample path envelope of the form

which retains the self-similar scaling properties of the htss envelope. The constant LL depends on the parameters α,H,r,μ\alpha,H,r,\mu and on the choice of M>0M>0. The reason we prefer the simpler envelope given by Lemma 1 is that it facilitates the estimation of the service provided to a flow across multiple nodes.

IV-B Heavy-Tailed Leftover Service Curve

With a sample path envelope for heavy-tailed traffic at hand, we can now derive a service curve for the heavy-tailed leftover service from Section III at a constant-rate link with capacity CC, as illustrated in Fig. 7. We denote arrivals of the through flow by AA and cross traffic arrivals by AcA_{c}. Departures are denoted by DD and DcD_{c}, respectively. Assuming that AcA_{c} is characterized by an htss envelope of the form Gc(t)=rc(ts)+σ(ts)Hc{\cal G}_{c}(t)=r_{c}(t-s)+\sigma(t-s)^{H_{c}} with ε(σ)=Kcσαc\varepsilon(\sigma)=K_{c}\sigma^{-\alpha_{c}} where the bound on the arrival rate satisfies rc<Cr_{c}<C, we will show that the through flow is guaranteed a ht service curve S(t;σ)=[Rtσ]+{\cal S}(t;\sigma)=[Rt-\sigma]_{+} with rate R=CrcμR=C-r_{c}-\mu, and a violation probability ε(σ)\varepsilon(\sigma) that can be estimated explicitly. Here, μ\mu is a free parameter.

Let t\underline{t} be the beginning of the busy period of tt at the link. Then, the aggregate departures in [t,t)[\underline{t},t) satisfy (D+Dc)(t,t)=C(tt)(D+D_{c})(\underline{t},t)=C(t-\underline{t}), and departures for the cross traffic satisfy Dc(t,t)min{C(tt),Ac(t)Ac(t)}D_{c}(\underline{t},t)\leq\min\{C(t-\underline{t}),A_{c}(t)-A_{c}(\underline{t})\}. With this we can derive

The description of the leftover service in Eq. (19) can be combined with Eq. (16) to characterize the leftover service available to a packetized through flow at a node. The result (which we state without proof) is that at a link that operates at rate C>rcC>r_{c}, the through flow receives a service guarantee given by the ht service curve

where β=min{αp1,αc(1Hc)}\beta=\min\{\alpha_{p}-1,\alpha_{c}(1-H_{c})\}, αp\alpha_{p} is the tail decay in Eq. (16), and μ>0\mu>0 is a free parameter. The violation probability is given by

IV-C Single Node Delay Analysis

We next present a delay bound at a single node where arrivals are described by htss envelopes and service is described by an ht service curve.

Single Node Delay Bound. Consider a flow that is characterized by an htss envelope with G(t,σ)=rt+σ(ts)H{\cal G}(t,\sigma)=rt+\sigma(t-s)^{H} and ε(σ)=Kσα\varepsilon(\sigma)=K\sigma^{-\alpha}, and that receives an ht service curve at a node given by S(t;σ)=[Rtσ]+{\cal S}(t;\sigma)=[Rt-\sigma]_{+} and ε(σ)=Lσβ\varepsilon(\sigma)=L\sigma^{-\beta}. If r<Rr<R, then the delay WW satisfies

where MM is a constant that depends on α\alpha, HH, rr, μ=Rr\mu=R-r, and β\beta.

Proof. Let A(t)A(t) and D(t)D(t) denote the arrival and departures of the (tagged) flow at the node. The delay is given by

Fix σ1,σ2>0\sigma_{1},\sigma_{2}>0 with R(σ1+σ2)=wR(\sigma_{1}+\sigma_{2})=w. Suppose that on a particular sample path,

If the infimum is assumed for some stws\leq t-w, then

If, on the other hand, the infimum is assumed for some stws\geq t-w, then

by monotonicity. In both cases, we see that W(t)wW(t)\leq w. It follows with union bound that

Example: We compute the delay experienced by a Pareto traffic source at a 100 Mbps link. The parameters are

With these values, the average data unit has a size of 400400 Byte, and the link utilization is 75%. The service curve is computed from Eq. (16). The reason for selecting this example (which does not have cross traffic) is that it permits a comparison with a queueing theoretic result in , which presents a lower bound on the quantiles of a Pareto source in a tandem network with NN nodes and no cross traffic, wN(z)w_{N}(z), as

In Fig. 8 we show a log-log plot of the delay distribution. The graph illustrates the power-law decay for the upper bound and the lower bound from . We also show the results of four simulation runs of an initially empty system with 106,107,10810^{6},10^{7},10^{8} and 10910^{9} packets. The simulation traces indicate that the actual delays may be closer to the lower bounds. Note that the fidelity of the simulations deteriorates at smaller violation probabilities. Since even long simulations runs do not contain sufficiently many events with large delays, they violate analytical lower bounds. Even the simulation run of 1 billion arrivals does not maintain the power-law decay for violation probabilities below ε=103\varepsilon=10^{-3}.

IV-D Multi-Node Delay Analysis

We turn to the computation of end-to-end delays for a complete network path. As in the deterministic version of the network calculus we express the service given by all nodes on the path in terms of a single service curve, and then apply single-node delay bounds. We start with a network of two nodes. We denote by A1A_{1} the arrivals of the analyzed flow at the first node, and by D1D_{1} or A2A_{2} the departures of the first node that arrive to the second node.

Concatenation of two ht service curves. Consider an arrival flow traversing two nodes in series. The first node offers an ht service curve with S1(t,σ)=[R1tσ]+{\cal S}_{1}(t,\sigma)=[R_{1}t-\sigma]_{+} and ε1(σ)=L1σβ1\varepsilon_{1}(\sigma)=L_{1}\sigma^{-\beta_{1}}, and the second node offers a service curve S2(t;σ)=[R2tσ]+{\cal S}_{2}(t;\sigma)=[R_{2}t-\sigma]_{+} and an arbitrary function ε2(σ)\varepsilon_{2}(\sigma). Then for any γ>1\gamma>1, the two nodes offer the combined service curve given by

The service rate R=min{R1,R2/γ}R=\min\{R_{1},R_{2}/\gamma\} in the expression for the service curve is the result of a min-plus convolution of the service curves at the individual nodes. The logarithmic term can be removed at the expense of lowering the exponent using Eq. (43) from Lemma 5. If the second node also offers an ht service curve, with ε2(σ)=L2σβ2\varepsilon_{2}(\sigma)=L_{2}\sigma^{-\beta_{2}}, then for every choice of β\beta with β<β1\beta<\beta_{1} and ββ2\beta\leq\beta_{2} there exists a constant L=L(β,R2,γ)L=L(\beta,R_{2},\gamma) such that ε(σ)Lσβ\varepsilon(\sigma)\leq L\sigma^{-\beta}. The value of the constant LL can be computed from Lemma 5 (Eqs. (42) and (45)).

Proof. We proceed by inserting the service guarantee for D1=A2D_{1}=A_{2} at the first node into the service guarantee at the second node. Similar to the backlog and delay bounds, this requires an estimate for an entire sample path of the service at the first node.

Fix t0t\geq 0. We consider discretized time points tykt-y_{k}, where y0=0y_{0}=0 and yk=τ+γyk1y_{k}=\tau+\gamma^{\prime}y_{k-1} for some τ>0\tau>0 and γ>1\gamma^{\prime}>1 to be chosen below. For tyks<tyk1t-y_{k}\leq s<t-y_{k-1}, we have

Set R=min{R1,R2γ}R=\min\left\{R_{1},\frac{R_{2}}{\gamma}\right\} and let γ>1\gamma^{\prime}>1 and δ>0\delta>0 be chosen so that R2γδ=R\frac{R_{2}}{\gamma^{\prime}}-\delta=R. Also fix σ1,σ2>0\sigma_{1},\sigma_{2}>0 and set σ=σ1+σ2\sigma=\sigma_{1}+\sigma_{2}. If for a given sample path

and, for all k1k\geq 1 with ykty_{k}\leq t,

then we can insert the lower bound for D1=A2D_{1}=A_{2} from Eq. (26) into Eq. (24). After collecting terms, the result is D2(t)A1S(t;σ)D_{2}(t)\geq A_{1}*{\cal S}(t;\sigma).

The violation probability of Eq. (25) is given by ε2(σ2)\varepsilon_{2}(\sigma_{2}). Assume for the moment that σδτγ1\sigma\geq\frac{\delta\tau}{\gamma^{\prime}-1}. We estimate the violation probability of Eq. (26) by

In the first step, we have used the union bound and the ht service curve S1{\cal S}_{1}. In the second step, we have used Lemma 7 to evaluate the sum (with γ\gamma^{\prime} in place of γ\gamma, and a=δa=\delta), and recalled that R2/γδ=RR_{2}/\gamma^{\prime}-\delta=R. (Here, we have used the assumption on σ\sigma given before the equation). We eliminate the shift with Lemma 5 (Eq. (44)), and insert the optimal choice τ=R1(L1β1logγ)1β1\tau=R^{-1}\left(\frac{L_{1}}{\beta_{1}\log\gamma^{\prime}}\right)^{\frac{1}{\beta_{1}}}. Taking γ=γ\gamma^{\prime}=\sqrt{\gamma} and δ=R(γ1)\delta=R(\gamma^{\prime}-1), we arrive at

Iterating the lemma results in the following end-to-end service guarantee, referred to as network service curve. To keep the statement of the theorem simple, we have assumed that each node offers an ht service guarantee with the same rate RR, the same constant LL, and the same power law β\beta. The general case can be reduced to this with the help of Lemma 5 (Eqs. (42) and (45)).

ht Network Service Curve. Consider an arrival flow traversing NN nodes in series, and assume that the service at each node n=1,,Nn=1,\dots,N satisfies an ht service curve

Then, for every choice of γ>1\gamma>1, the network provides the service guarantee

and the claim follows by collecting the factors of NN. \Box

Example: We perform a multi-node delay analysis for a sequence of homogeneous nodes with the same parameters used for Fig. 8. The reason for using this example, is that it permits us to draw a comparison with the lower bound for multi-node networks from given in Eq. (23). In Fig. 10 we show lower and upper bounds for networks with N=1,2,4,8N=1,2,4,8 nodes. For reference, we also include the results of individual simulation runs with 10810^{8} packets. The difference between lower and upper bounds is more pronounced than in the single-node analysis, and increases with the number of nodes NN. For both lower and upper bounds, the straight lines make the power-law decay in ww apparent. The growth of the bounds in NN suggests a power-law growth in NN, where the larger spacing for the upper bounds indicates a higher power. As before, we see that simulations violate analytical lower bounds. Since simulations of heavy-tailed traffic have little predictive values for larger delays, our analytical bounds provide more reliable estimates, even with the significant gap between upper and lower bounds.

V Scaling of Delay Bounds

We now explore the scaling properties of the delay bounds from the previous section. Throughout this section, we consider a network as in Fig. 6. We assume that the network is homogeneous, in the sense that all nodes have the same capacity CC, and all traffic is bounded by htss envelopes as in Eq. (4) with the same power α\alpha and Hurst parameter HH. The cross traffic at each node has rate rcr_{c} and constant KcK_{c}, and the through flow has rate r0r_{0} and constant K0K_{0}. Traffic can be either fluid-flow or packetized. In the latter case, the packet-size distribution of the through flow satisfies

We assume the stability condition r0+rc<Cr_{0}+r_{c}<C holds at each node.

Single node, large delays (ww\to\infty). Our first result concerns the power-law decay of the delay distribution at a single node. We choose a relaxation of μ=12(Crcr0)\mu=\frac{1}{2}(C-r_{c}-r_{0}), and use the leftover service curve from Eq. (20), given by

and LL is an explicitly computable constant. (For fluid-flow traffic, that is, without a packetizer, the first term does not appear, and we have β=α(1H)\beta=\alpha(1-H).) We then apply the delay bound of Theorem 1 with R=r0+μR=r_{0}+\mu to obtain

The constant MM is determined by Eq. (22) of Theorem 1 with β=β\beta^{\prime}=\beta. This shows that the delay decays with the same power law as the backlog bound in Eq. (17).

Multiple nodes, large delays (ww\to\infty). Now we consider scaling in networks with N>1N>1 nodes. We choose μ=13(Crcr0)\mu=\frac{1}{3}(C-r_{c}-r_{0}). We obtain at each node the service curve in Eq. (27), with β\beta given by Eq. (28). We next choose γ=Crcμr0+μ\gamma=\frac{C-r_{c}\mu}{r_{0}+\mu} and obtain from Theorem 2 the network service curve

where Rnet=r0+μR_{net}=r_{0}+\mu, and with violation probability bounded by

The constants M1M_{1}, M2M_{2}, and M3M_{3} are again explicitly computable, and do not depend on NN. The tail of the delay distribution, i.e., when ww\to\infty, is dominated by the first summand in the brackets, thus, we have the asymptotic upper bound

Multiple nodes, long paths (NN\to\infty). For long paths, i.e., NN\to\infty, the second summand dominates. The quantiles of the delay, defined by

Comparison of scaling bounds. We next compare these upper bounds with scaling results from the literature for a Pareto service time distribution and no cross traffic, where traffic arrives in the form of evenly spaced packets XiX_{i}, with an i.i.d. Pareto packet-size distribution, as characterized in Section II. We assume that service times of packets are identical at each node in the sense of . By scaling the units of time and traffic, we may assume an average packet size of E[X]=1E[X]=1 and a link rate C=1C=1, resulting in a rate λ=ρ\lambda=\rho, where ρ\rho is the utilization.

For this model, it is known from queueing theory that the delay at a single node decays with a power law with exponent α1\alpha-1 . Theorem 1 from yields for the queueing time of the kk-th packet in the steady state Q=limkQkQ=\lim_{k\rightarrow\infty}Q_{k} that

The delay of the kkth packet is the sum of its queueing time QkQ_{k} and its processing time XkX_{k}. This per-packet delay is related with the delay W(t)W(t) at a given time by

where k(t)k(t) is the number of the packet being processed at time tt, and X(t)X^{*}(t) is the lifetime of the current packet, as defined in Section III. Since packets are i.i.d., Wk(t)W_{k(t)} is independent of X(t)X^{*}(t) and its distribution agrees with WkW_{k}, and we can compute

where c(α)c(\alpha) is a constant that depends on the tail index.

If we compare this asymptotic exact result with our bound from Eq. (29) and(30), we see that β=α1\beta=\alpha-1, and so Eq. (30) provides (up to a logarithmic correction) the same power-law decay as Eq. (32). The constant MM in Eq. (29) is of order O\bigl{(}(1-\rho)^{-2}\bigr{)}, while the right hand side of Eq. (32) is of order (1ρ)1(1-\rho)^{-1}, which indicates that our delay bound becomes pessimistic as ρ1\rho\to 1.

Exploring the scaling in a tandem network, we first note that Eq. (17) states that for a single node, the tail probability of the delays decays with O\bigl{(}w^{-(\alpha-1)}\log w\bigr{)}. Since end-to-end delays exceed the delay at a single node, Eq. (32) guarantees that W(t)=\Omega\bigl{(}w^{-(\alpha-1)}\bigr{)}. Thus, our upper and lower bound differ by at most a logarithmic factor. Eq. (31) implies furthermore that delay quantiles are bounded by O\bigl{(}N^{\frac{\alpha+1}{\alpha-1}}(\log N)^{\frac{1}{\alpha-1}}\bigr{)} as NN\to\infty. From the lower bound from given in Eq. (23) we can obtain that quantiles of the end-to-end delay grow at least as fast as w_{net}(\varepsilon)=\Omega\bigl{(}N^{\frac{\alpha}{\alpha-1}}\bigr{)}.

Lastly, we note that end-to-end delays are expected to grow more slowly if service times are independently regenerated at each node. A large buffer asymptotic from for multi-node networks could be used to obtain the scaling properties of such a network.

VI Conclusions

We have presented an end-to-end analysis of networks with heavy-tailed and self-similar traffic. Working within the framework of the network calculus, we developed envelopes for heavy-tailed self-similar traffic and service curves for heavy-tailed service models. By presenting new sample path bounds for arrivals and service, we were able to derive non-asymptotic performance bounds on backlog and delay, as well as network-wide service characterizations. We explored the scaling behavior of the derived bounds and showed that, for single nodes, the tail probabilities of our delay bounds observe the same power-law decay as known results for G/G/1 systems. We also described the scaling behavior of end-to-end delays. Our paper may motivate further study of the conditions under which performance bounds in a heavy-tailed regime can be tightened. A useful, possibly difficult extension is the derivation of a multi-node service curve that accounts for self-similarity, in addition to heavy-tails.

Acknowledgments

We thank Prof. Anja Feldmann at TU Berlin for giving us access to a collection of data trace. J. Liebeherr thanks Deutsche Telekom Laboratories for hospitality during a sabbatical. We thank Prof. Stephen McLaughlin for sharing a software library for alpha-stable distributions. We thank Yashar Ghiassi for comments on the manuscript.

References

Appendix A Self-Similar Traffic with Gaussian Tails

While self-similar traffic is expected to be heavy tailed, some self-similar traffic types appearing in network analysis exhibit a faster decay. In the following we present a derivation of a network service curve for traffic with a Gaussian bound on the violation probability. More precisely, will consider processes that satisfy for all σ>0\sigma>0

with some constants K>0K>0, a>0a>0, and α0\alpha\geq 0. In terms of Eq. (1), this corresponds to a statistical envelope

To motivate that envelopes given by Eq. (34) can provide useful traffic models, consider first the fractional Brownian motion (fBm) traffic model. In this model, arrivals are given by

where N0,1N_{0,1} is the standard normal distribution, and bb is a parameter that relates to the standard deviation. Eq. (35) is the special case of the α\alpha-stable process from Eq. (6) with α=2\alpha=2. Using that the density of N0,1N_{0,1} is given by ϕ(x)=12πe12x2\phi(x)=\frac{1}{\sqrt{2\pi}}e^{-\frac{1}{2}x^{2}}, we see from Eq. (35) that

i.e., Eq. (34) with K=1/2K=1/2 is a statistical envelope.

An alternative description of arrival processes with Gaussian tails is through their effective bandwidth, defined by

The effective bandwidth of the self-similar process in Eq. (35) is given by eb(θ,t)=ρ+12b2θt2H1eb(\theta,t)=\rho+\frac{1}{2}b^{2}\theta t^{2H-1}. Let us consider more generally processes whose effective bandwidth satisfies the self-similar bound

By the Chernoff bound, Eq. (37) implies that

i.e., Eq. (34) with K=1K=1 is a statistical envelope. Note that for H>12H>\frac{1}{2} and any given choice of θ\theta, Eq. (37) does not define a linear bounded envelope process as defined in .

sample path envelope for self-similar processes with Gaussian tail. Assume that arrivals to a flow are bounded by a statistical envelope, given by

Then, for any choice of μ>0\mu>0, a statistical sample path envelope for the arrival process is given by

with parameters β=2(1H)\beta=2(1-H), c=(2bμH)11Hc=\left(\frac{2b}{\mu^{H}}\right)^{\frac{1}{1-H}}, and L=emax{1,4HKr/μ+2HH(1H)}L=e\cdot\max\left\{1,4^{H}K\frac{r/\mu+2-H}{H(1-H)}\right\}.

Proof. Fix μ>0\mu>0, let 1<γ<1+μr1<\gamma<1+\frac{\mu}{r}, and set b=r+μγr>0b=\frac{r+\mu}{\gamma}-r>0. As done for the backlog bound from Eq. (17), we argue that

where z=bHσ1HγH(1H)z=\frac{b^{H}\sigma^{1-H}}{\gamma^{H(1-H)}}. Here, the first inequality follows by discretization. In the second step, we have used Lemma 6 and replaced the integration variable xx with x/bx/b. In the third step, we have evaluated the integral.

We want to replace the variable zz by y=μHσ1H/by=\mu^{H}\sigma^{1-H}/b, so that 12y2=(σ/c)β\frac{1}{2}y^{2}=(\sigma/c)^{\beta}. Suppose for the moment that y22y^{2}\geq 2, and choose γ\gamma such that logγ=(r/μ+2H)1y2\log\gamma=(r/\mu+2-H)^{-1}y^{-2}. Then γ<1+r/μ\gamma<1+r/\mu, as required. Moreover,

This proves the claim for y22y^{2}\geq 2. On the other hand, for y2<2y^{2}<2, the claim holds trivially since the right hand side exceeds 11. \Box

As in Section IV-B, the sample path envelope immediately yields a leftover service curve. If fluid-flow traffic arrives to a link of constant rate CC, where it is subject to cross traffic that has a statistical envelope with Gaussian tails, as in Eq. (34), then for every choice of μ>0\mu>0,

provides a statistical service curve for the through flow with a Weibullian bound on the violation probability. Here, the parameters β\beta, cc, and LL are given by Lemma 3. Next, we show how to concatenate such service curves:

Concatenation of Weibull-tailed service curves. Consider two consecutive nodes on the path of a flow through a network. Assume that the first node offers a statistical service curve S1(t;σ)=[R1tσ]+{\cal S}_{1}(t;\sigma)=[R_{1}t-\sigma]_{+} with ε1(σ)=L1e(σ/c1)β1\varepsilon_{1}(\sigma)=L_{1}e^{-(\sigma/c_{1})^{\beta_{1}}}, and the second node offers a statistical service curve S2(t;σ)=[R2tσ]+{\cal S}_{2}(t;\sigma)=[R_{2}t-\sigma]_{+} with an arbitrary function ε2(σ)\varepsilon_{2}(\sigma). Then, for every choice of γ>1\gamma>1, the two nodes offer the combined service curve given by

Proof. We follow the proof of Lemma 2, with the Weibullian function ε1(σ)=L1e(σ/c1)β1\varepsilon_{1}(\sigma)=L_{1}e^{-(\sigma/c_{1})^{\beta_{1}}} in place of the power law. Set R=min{R1,R2/γ}R=\min\{R_{1},R_{2}/\gamma\}, δ=R2R\delta=R_{2}-R, and consider the event

where yky_{k} is a sequence of discretization time steps. We argue as in the proof of Lemma 2 that S{\cal S} is a service curve, with violation probability given by

We could now use Lemma 7 to bound the violation probability of Eq. (41). However, since ε1(σ1)\varepsilon_{1}(\sigma_{1}) is an integrable function, we can obtain a slightly stronger bound by using the arithmetic sequence yk=kτy_{k}=k\tau, as in . The resulting estimate is

We have first used the union bound and the ht service curve, then replaced the sum by an integral, and finally evaluated the integral. Suppose for the moment that (σ1/c1)β12(\sigma_{1}/c_{1})^{\beta_{1}}\geq 2. We choose τ=c1β1β1R2σ1β11\tau=\frac{c_{1}^{\beta_{1}}}{\beta_{1}R_{2}\sigma_{1}^{\beta_{1}-1}} and use that

But this inequality clearly also holds for (σ1/c1)β1<2(\sigma_{1}/c_{1})^{\beta_{1}}<2. The lemma follows by combining this with the service guarantee at the second node. \Box

With this result, we finally present a network service curve that concatenates an arbitrary number of Weibullian service curves.

Weibullian Network Service Curve. Consider an arrival flow traversing NN nodes in series, and assume that the service at each node n=1,,Nn=1,\dots,N satisfies a service curve

Then, for every choice of γ>1\gamma>1, the network provides the service curve

Proof. See the proof of Theorem 2. We use Lemma 4 to iteratively derive the service guarantee for the last nn nodes. In each step, we reduce the rate by a factor of γ1N1\gamma^{\frac{1}{N-1}}. To simplify the formula, we have replaced N1N-1 with NN whenever it appeared convenient, and used that Nlogγγ1Nγ1N11+Nlogγ\frac{N}{\log\gamma}\leq\frac{\gamma^{\frac{1}{N}}}{\gamma^{\frac{1}{N}}-1}\leq 1+\frac{N}{\log\gamma}. \Box

Consider now a network as in Figure 6 where all flows have envelopes with a Gaussian tail, given by Eq. (33). Assume that the network is homogeneous, i.e., all nodes have the same link rate, and the cross traffic has the same parameters at each node. Combining the results from this section, we obtain, similarly to the analysis in Section V, the end-to-end delay bound

where MM and w0w_{0} are constants that do not depend on NN, and β=2(1H)\beta=2(1-H). It follows that, over long paths, the quantiles of the delays scale as

For β=1\beta=1, this bounds recovers the O(NlogN)O(N\log N) bound for exponential traffic from .

Appendix B Technical Lemmas

In our derivations, we frequently use properties of the function ε(σ)=Kσα\varepsilon(\sigma)=K\sigma^{-\alpha} that appears in the definition of the htss envelope. The properties are summarized in the following lemma, and presented without proof.

(Lower power.) We can lower the power by using that for Kσα1K\sigma^{-\alpha}\leq 1 and α<α\alpha^{\prime}<\alpha

(Eliminate logarithmic factor.) For β<β\beta^{\prime}<\beta,

(Remove shift.) For α>0\alpha>0, σ0>0\sigma_{0}>0, and K(σσ0)α1K(\sigma-\sigma_{0})^{-\alpha}\leq 1, we can remove a negative shift by

(Minimize sum.) We can minimize sums of such functions by

where K=1n(K1+K2++Kn)\overline{K}=\frac{1}{n}(K_{1}+K_{2}+\ldots+K_{n}).

The following derives auxiliary estimates for two sums that involve geometric sequences.

Assume that ε(x)\varepsilon(x) is a nonincreasing nonnegative function. Fix γ>1\gamma>1 and τ>0\tau>0, and set xk=τγkx_{k}=\tau\gamma^{k}. Then, for every σ0\sigma\geq 0 and every c>0c>0,,

Proof. Consider first the case where c=τ=1c=\tau=1, i.e., xk=γkx_{k}=\gamma^{k}. Each summand in the series satisfies

Since the first term on the right hand side increases with kk while the second term decreases, we can bound the series by the sum of two integrals

where the overlap between the intervals of integration compensates for the change of monotonicity. The optimal choice for the limit of integration is T=H+logσlogγT=-H+\frac{\log\sigma}{\log\gamma}, so that σγH(T+1)=γ(1H)t\sigma\gamma^{-H(T+1)}=\gamma^{(1-H)t}. In the first integral, the change of variables x=σγHtx=\sigma\gamma^{-Ht} yields

where z=σ1HγH(1H)z=\sigma^{1-H}\gamma^{-H(1-H)}. In the second integral, the change of variables x=γ(1H)tx=\gamma^{(1-H)t} yields

Assume that ε(x)\varepsilon(x) is a nonincreasing nonnegative function. Fix τ>0\tau>0 and γ>1\gamma>1, and define recursively y0=0y_{0}=0, yk=τ+γyk1y_{k}=\tau+\gamma y_{k-1}. Then, for every c>0c>0 and σcτγ1\sigma\geq\frac{c\tau}{\gamma-1},

Proof. Consider first the case where c=1c=1 and τ=γ1\tau=\gamma-1, i.e., yk=γk1y_{k}=\gamma^{k}-1, and set z=σ+γ1z=\sigma+\gamma-1. For σ1\sigma\geq 1, each summand is bounded by

Since both terms are nonincreasing, we can bound the series by

We choose T=log(σ+γ1)logγ1T=\frac{\log(\sigma+\gamma-1)}{\log\gamma}\geq 1, so that γT=σ+γ1\gamma^{T}=\sigma+\gamma-1, and change variables x=γtx=\gamma^{t} in the second integral to obtain