Federated Learning for Ultra-Reliable Low-Latency V2V Communications

Sumudu Samarakoon, Mehdi Bennis, Walid Saad, Merouane Debbah

I Introduction

Vehicle-to-vehicle (V2V) communication is a key enabler for autonomous driving and intelligent transportation systems . The ability to extend vehicles’ field-of-view improves traffic safety and enhances the driving experience. Furthermore, V2V communication helps to increase the efficiency of transportation systems via platooning . However, the performance of autonomous applications such as real-time navigation, collision avoidance, and platooning heavily rely on the ability to communicate with extremely low errors and delays. In this regard, achieving ultra-reliable low-latency communication (URLLC) is instrumental in enabling this vision . Since over-the-air latency and queuing latency are coupled, ensuring low queuing latency is required to achieve the target end-to-end latency of 1 ms calling for efficient radio resource management (RRM) techniques . Towards this end, the majority of existing works including and focus on improving the average latency of vehicular networks while a handful such as and impose a probabilistic constraint to maintain small queue lengths.

Although a probabilistic constraint on the queue length improves network reliability, it fails to control rare events in which large queue lengths occur with low probability. Therefore, some of the vehicular users (VUEs) may experience unacceptable latencies yielding degraded performance . Extreme value theory (EVT), a powerful tool that characterizes the occurrences of extreme events with low probability, can be used to model the tail distribution in terms of three parameters known as location, scale, and shape . EVT asymptotically characterizes the statistics of extreme events by providing analytical models to analyze network traffic, worst-case delays, peak-to-average-power ratios, and ultra-reliable V2V communication in wireless systems . The challenge behind using EVT lies in gathering sufficient samples of extreme events that occur rarely to accurately model the tail distributions. In V2V communication, VUEs may have access to limited number of queue length samples locally exceeding a high threshold, and hence they are unable to estimate the tail distribution of network-wide queue lengths. Therefore, roadside units (RSUs) can assist in gathering samples over the network at the cost of additional data exchange overheads. Furthermore, VUEs may be unwilling to share their individual queue state information (QSI) with an RSU and other VUEs as it may exhaust resources available for V2V communication.

This shortcoming warrants a collaborative learning model that does not rely on sharing individual QSI. Recently, federated learning (FL) was proposed as a decentralized learning technique where training datasets are unevenly distributed over learners, instead of centralizing all the data . FL allows building learning models by sharing local models (set of learning parameters) with a central entity within few communication iterations. Another notable advantage of FL is that it does not rely on synchronization among learners (e.g., VUEs). Henceforth, even during a loss of connectivity between VUEs and RSUs, VUEs can still build their local models and navigate; this feature is crucial for highly dynamic and mission critical V2V communication. To the best of our knowledge, no prior work has studied the use of federated learning in the context of ultra-reliable and low-latency communication.

The main contribution of this paper is to propose a distributed joint transmit power and resource allocation framework for enabling ultra-reliable and low-latency vehicular communication. We formulate a network wide power minimization problem while ensuring low latency and high reliability in terms of probabilistic queue lengths. To model reliability, we first obtain the statistics of the queue lengths exceeding a high threshold by using the EVT concepts of a generalized Pareto distribution (GPD) . Using the statistics of the GPD, we impose a local constraint on the excess queues for each VUE. Here, the parameters of the GPD known as scale and shape, are obtained by using maximum likelihood estimation (MLE). In contrast to the classical design of MLE which requires a central controller (e.g., RSU) collecting samples of queues exceeding a threshold from all VUEs in the network, leveraging principles of FL, we propose a distributed MLE technique that does not require sharing the queue length samples with an RSU or VUEs. Specially, over time, every vehicle builds and sends its own local model (GPD parameters) to the RSU, which in turn aggregates, does model averaging across vehicles, and feeds back the global model to VUEs. Leveraging different time scales, each VUE learns its GPD parameters locally in a short time scale while the model averaging (global learning) takes place in a longer time scale. Finally, Lyapunov optimization is used to decouple and solve the network-wide optimization problem per VUE. Simulation results show that the proposed approach estimates GPD parameters as accurate as a centralized learning module and yields significant gains in terms of reducing VUEs with large queue lengths and improving power consumption.

The rest of the paper is organized as follows. Section II describes the system model and the problem formulation. The distributed solution based on EVT and Lyapunov optimization is presented in Section III. In Section IV, estimation of the extreme value distribution using FL is discussed. Section V evaluates the proposed solution by extensive set of simulations. Finally, conclusions are drawn in Section VI.

II System Model and Problem Definition

Consider a vehicular network consisting a set of U\mathcal{U} of UU VUE pairs exchanging information, with the aid of an RSU that allocates a set N\mathcal{N} of resource blocks (RBs) over a partition of the network Z\mathcal{Z} defined as zones. Here, a zone consists of VUE pairs that are located far from each other. Moreover, RBs are orthogonally allocated over the zones to reduce the interference among nearby VUE pairs. Hence, a VUE transmitter-receiver (vTx-vRx) pair uu is only allowed to use the set Nz(t,u)\mathcal{N}_{z(t,u)} of RBs allocated for its corresponding zone z(t,u)z(t,u) at time tt. The system layout is illustrated in Fig. 1.

Let pu(t)=[pun(t)]nNz(t,u)\boldsymbol{p}_{u}(t)=[p_{u}^{n}(t)]_{n\in\mathcal{N}_{z(t,u)}} and huu(t)=[huun(t)]nNz(t,u)\boldsymbol{h}_{uu^{\prime}}(t)=[h_{uu^{\prime}}^{n}(t)]_{n\in\mathcal{N}_{z(t,u)}} be, respectively, the transmit power of VUE uu and the channel gain between vTx uu and vRx uu^{\prime} over the RBs at time tt. Depending on whether the vTx and vRx are located in the same lane or separately in perpendicular lanes, the channel model is categorized into three types: i) Line-of-sight (LOS): both vTx uu and vRx uu^{\prime} are located in the same lane, ii) Weak-line-of-sight (WLOS): vTx uu and vRx uu^{\prime} are in perpendicular lanes and at least one of them is located at a distance of no more than d0d_{0} from the corresponding intersection, and iii) Non-line-of-sight (NLOS), otherwise. Let (xu,yu)(x_{u},y_{u}) and (xu,yu)(x_{u^{\prime}},y_{u^{\prime}}) be the Cartesian coordinates of vTx uu and vRx uu^{\prime}, respectively. As such, the path loss model of channel huuh_{u^{\prime}u} is based on the path loss model for urban areas using 5.9 GHz carrier frequency as follows :

where Iun(t)=uU{u}huun(t)pun(t)I_{u}^{n}(t)=\sum_{u^{\prime}\in\mathcal{U}\setminus\{u\}}h_{u^{\prime}u}^{n}(t)p_{u^{\prime}}^{n}(t) is the interference from other vTxs, WW is the bandwidth of each RB, and N0N_{0} is the noise power spectral density. At each time tt, au(t)a_{u}(t) data bits are randomly generated with a mean of au\overline{a}_{u} at vTx uu that must be delivered to its corresponding vRx. Thus, at the vTx, a data queue is maintained which has the following dynamics:

Since most vehicular applications rely on exchanging information with guaranteed low end-to-end latency (mission critical), such networks demand higher network resources. As such, our objective is to minimize the network-wide power consumption while ensuring URLLC. Here, the reliability is achieved by ensuring queue stability for each vTx while maintaining outages in terms of exceeding a pre-defined queue threshold q0q_{0} below a certain probability ϵ\epsilon. The reliability conditions are defined as follows:

Note that the above reliability constraints have no control over the extreme cases with qu(t)>q0q_{u}(t)>q_{0}. This impacts the worst case network queuing latency (as well as end-to-end latency ) and thus, needs to be properly addressed. In this regard, let M(t)M(t) be a sample of excess value of the queue of any VUE over the threshold q0q_{0} at time tt and M{M(t)}tM\in\{M(t)\}_{\forall t}. By imposing a constraint,

Here, (7c) ensures queue dynamics and reliability while controlling the worst-case latency over all VUEs and p0p_{0} is the transmit power budget of a VUE. Solving (7) which implies finding the optimal transmission control policy over time, is challenging due to two reasons: i) A decision at time tt relies on future network states, and ii) The characteristics of the distribution of MM for constraint (6) are unavailable. Furthermore, in the current context, solving (7) relies on a centralized controller. This requires exchanging channel state information (CSI) and QSI over the whole network yielding unacceptable signaling overheads. Therefore, new analytical tools are required to build a tractable model and to provide a distributed solution.

III Proposed Distributed Framework using EVT and Lyapunov Optimization

Proposing a distributed solution for network-wide power minimization problem relies on the ability to decouple (7) over VUE pairs. Therefore, the rest of the discussion investigates techniques to decouple the objective function (7a) and the constraint (6) based on the statistics of excess queues over the network.

The excess queues {M(t)}t\{M(t)\}_{\forall t} are known as extreme statistics of the system, and can be characterized using EVT. Assume that the individual queues at a given time [qu(t)]uU[q_{u}(t)]_{u\in\mathcal{U}} are samples of independent and identical distributions (i.i.d.) and the queue threshold q0q_{0} is large. Then, the distribution of MM can be modeled as a generalized Pareto distribution (GPD) using [12, Theorem 3.2.5]. This theorem mainly shows that as q0sup{qPr(M(t)>q)>0}q_{0}\to\sup\{q|\text{Pr}(M(t)>q)>0\}, the conditional probability distribution of M(t)=q(t)q0M(t)=q(t)-q_{0} is given by,

Assisted by the RSU, each VUE pair estimates ξ\xi and σ\sigma locally without sharing the QSI allowing to decouple the constraint (6) and impose it locally as in (9).

III-B Lyapunov Optimization

By using EVT to model MM and its expected value, we recast the original problem into an equivalent form:

Let Ξu(t)=[qu(t),Υu(t),Au(t))]\Xi_{u}(t)=[q_{u}(t),\Upsilon_{u}(t),A_{u}(t))] be the combined queue with Ξ(t)=[Ξu(t)]uU\boldsymbol{\Xi}(t)=[\Xi_{u}(t)]_{u\in\mathcal{U}} and its quadratic Lyapunov function L(Ξ(t))=12Ξ(t)Ξ(t)L(\boldsymbol{\Xi}(t))=\frac{1}{2}\boldsymbol{\Xi}^{\dagger}(t)\boldsymbol{\Xi}(t). The one-slot drift of the Lyapunov function is defined as ΔLt=L(Ξ(t+1))L(Ξ(t))\Delta L_{t}=L(\boldsymbol{\Xi}(t+1))-L(\boldsymbol{\Xi}(t)). Given that,

the Lyapunov drift can be simplified and bounded as follows:

IV Learning the Parameters of the Maximum Queue Distribution

The proposed distributed control mechanism relies on the characteristics of the excess queue distribution GMd(m)G_{M}^{\boldsymbol{d}}(m). Therefore, the estimation of the parameters σ\sigma and ξ\xi with high accuracy using QSI samples gathered at each VUE is crucial. In this regard, modeling the excess queue distribution requires a central controller (e.g., the RSU) to compute and communicate with all VUEs at each time tt. However, this RSU-centric approach is impractical due to the fact that: i) The overhead needed for frequent communications with all the VUEs in a highly dynamic network will degrade the network-wide performance, and ii) VUEs may prefer not to share their QSI with other vehicles, in which warrants collaborative learning techniques . Therefore, next, we propose a distributed solution based on FL that allows each VUE to learn the GPD parameters (local model) individually using local QSI observations and minimal communication with the RSU. In turn, the RSU averages the received parameters (global model) and sends them back to the VUEs, as summarized in Fig. 2.

IV-B Estimating the GPD parameters

As shown in Section III-A, the distribution of the excess queue length samples is characterized by two parameters d=[σ,ξ]\boldsymbol{d}=[\sigma,\xi] which need to be estimated. For this purpose, we use MLE . The goal of MLE is to find the best set of parameters d\boldsymbol{d} that fits the GPD GXd()G^{\boldsymbol{d}}_{X}(\cdot) to the samples via maximizing the log likelihood function, or minimizing the negative of it, as follows:

where D(Q)={[σ,ξ]2σ>0,ξ<1,1+ξQ/σ)0 for all QQ}\mathcal{D}(\mathcal{Q})=\{[\sigma,\xi]\in\Re^{2}|\sigma>0,\xi<1,1+\xi Q/\sigma)\geq 0\text{~{}for all~{}}Q\in\mathcal{Q}\} is the feasible set and Q={Qu}uU\mathcal{Q}=\{\mathcal{Q}_{u}\}_{u\in\mathcal{U}} is the set of network queue length samples. Ideally, solving (15) requires a central processor (RSU in our scenario) with queue length samples of all the VUEs. Note that the likelihood function is a smooth function of d\boldsymbol{d} and a summation over all the samples in Q\mathcal{Q}, and thus, its gradient over a sample QQ is given as follows:

The derivative coefficient of the negative log-likelihood function of GPD at queue length sample QQ w.r.t. d\boldsymbol{d} is,

Therefore, any gradient descent technique-based algorithm can be used to determine the optimal d\boldsymbol{d}^{\star} in an iterative manner. However, in our model, this is not applicable due to; i) Presence of large number of VUEs and thus, increased overhead of sharing large amount of samples, and ii) VUE unwillingness for sharing their QSI by exhausting network resources that may incur additional latencies. Therefore, a mechanism to estimate the parameters locally while leveraging the RSU for model aggregation is needed. For this purpose, we adopt FL as discussed below.

To this end, we rewrite the likelihood function as follows:

where κu=QuQ=KuuKu\kappa_{u}=\frac{|\mathcal{Q}_{u}|}{|\mathcal{Q}|}=\frac{K_{u}}{\sum_{u^{\prime}}K_{u^{\prime}}}. Here, the likelihood function of the network is presented as a weighted sum of likelihood functions per VUE. Hereinafter, for simplicity, we use fdf^{\boldsymbol{d}} and fudf^{\boldsymbol{d}}_{u} instead of fd(Q)f^{\boldsymbol{d}}(\mathcal{Q}) and fd(Qu)f^{\boldsymbol{d}}(\mathcal{Q}_{u}), respectively. The idea behind FL is to use fudf^{\boldsymbol{d}}_{u} to evaluate dfud\nabla_{\boldsymbol{d}}f^{\boldsymbol{d}}_{u} and d\boldsymbol{d} locally, say du\boldsymbol{d}_{u}, and update the local estimations via sharing the individual learning models (dfud,du,Ku)(\nabla_{\boldsymbol{d}}f^{\boldsymbol{d}}_{u},\boldsymbol{d}_{u},K_{u}). As the local sample size can be large, the complexity of computing the gradient can be high. Therefore, it is assumed that each VUE uses the stochastic variance reduced gradient (SVRG) to evaluate the gradients with a predefined step size δ(>0)\delta(>0) . The GPD parameter estimation procedure using FL-based MLE is presented in Algorithm 1.

V Numerical Results

In order to evaluate the performance of the proposed solution, a network based on a 250 m×\times250 m Manhattan mobility model with nine intersections is considered. Therein, a road consists of two lanes with 4 m width for each direction. VUE pairs are uniformly located within each lane with a fixed gap of 50 m and vRx always follows vTx with 60 kmph speed. VUEs share 60 RBs and have a maximum transmit power of p0=10p_{0}=10 W. The RB allocation per zone is adopted from and . The rest of the parameter values are presented in Table I.

Here, the proposed FL approach is compared with a centralized solution based on the SVRG method at the RSU to estimate the GPD parameters. Therein, all VUEs upload their local excess queue length samples to the RSU which estimates and shares the GPD parameters with all VUEs. Fig. 3 illustrates the estimated GPDs for both FL and centralized approaches with the corresponding excess queue length samples for different VUEs, U=U= 20, 60, and 100. It can be noted that the FL estimations are almost equivalent to the centralized estimations.

In Fig. 4, we compare the amount of data exchange and the achieved reliability in terms of maintaining the queue length below the threshold q0q_{0} for different VUE densities. As the reliability decreases with increasing the number of VUEs, our FL-based approach achieves a reliability that is nearly equal to the one resulting from the centralized approach. Note that the centralized method requires all VUEs to upload all their queue length samples to the RSU and to receive the estimated GPD parameters. In contrast, in the proposed method, VUEs upload their locally estimated model (GPD parameters, gradient, and sample size) and receive the global estimation of the model. For fewer number of VUEs, U=20U=20, the sample size of the network is small, and thus the centralized method can operate efficiently using very few data samples. In contrast, in FL, VUEs must upload and download both parameters and gradients yielding higher data exchange compared to the centralized method. However, as the number of VUEs increases (beyond 28), the sample size grows, and thus, the centralized method incurs higher amount of data exchanged between the RSU and VUEs compared to the proposed method. The reductions of the exchanged data in the proposed method compared to the centralized model is about 27% for U=28U=28 and improves up to 79% when U=100U=100. Since the overall performance in terms of reliability of both methods is similar, the proposed method becomes efficient due to the reductions in data exchange overhead.

V-B Performance evaluation

Next, the proposed approach is compared with a “fixed power” model using fixed transmit power and two other baseline models focusing on minimizing the total power consumption namely: i) Baseline 1: a V2V network with the only interest of queue stability (3)-(4), and ii) Baseline 2: a V2V network with the probabilistic constraint on average queue length (3)-(5).

Fig. 5 compares the transmit power and excess queue length tradeoff for VUEs exceeding the queue threshold q0q_{0} for the aforementioned methods. In Table II, the statistics of average transmit power, queuing latency, excess queue length, and unreliability defined as the probability that the queue length grows larger than q0q_{0} corresponding to the scenarios used in Fig. 5 are presented. Note that the fixed power method has no control on queue stability, and hence, it yields VUEs with large queues. Therefore, the excess queues for the fixed power model is intentionally neglected from Fig. 5. From Table II, we can see that the fixed power model yields the highest excess queue lengths on average, as well as the highest number of VUEs exceeding q0q_{0} for all three cases with different VUE densities.

From Fig. 5 and Table II, we can see that Baseline 1 uses the lowest transmit powers in all three cases. The reason is that Baseline 1’s only concern is to minimize the transmit power while ensuring queue stability without a constraint on queue lengths. However, Baseline 1 results in larger excess queue lengths as shown in Fig. 5. Moreover, Table II shows that Baseline 1 exhibits higher excess queue lengths on average, increased average latencies, and more VUEs exceeding q0q_{0} compared to Baseline 2 and the proposed method. It can be noted that this performance gap increases with the increasing VUE densities.

The advantage of Baseline 2 over Baseline 1 is in terms of the reduced excess queue lengths as shown in Fig. 5. Its improvements are close to the proposed method due to the additional probabilistic constraint on the queue length compared to Baseline 1. Although the difference between the proposed approach and Baseline 2 method in Fig. 5 is hard to distinguish as UU increases, from Table II, we can observe that the reductions of excess queue lengths on average increase up to 3% in the proposed method over Baseline 2 for U=100U=100 VUEs. However, the average latency of Baseline 2 outperforms the proposed for U=U= 20 and 60 while being slightly higher when U=100U=100. The main advantage of the proposed method is the reduction in the extreme cases where VUEs exceed the queue threshold. The proposed method reduces the fraction of VUEs exceeding queue threshold by 60%, 8%, and 9% compared to Baseline 2 without additional transmit power on average for U=U= 20, 60, and 100, respectively. Compared to the fixed power and Baseline 1 models, the reductions in the fraction of VUEs exceeding queue threshold are as high as 97-99%.

VI Conclusions

In this paper, we have formulated the problem of joint power control and resource allocation for V2V communication network as a network-wide power minimization problem subject to ultra reliability and low latency constraints. The constraints on URLLC are characterized using extreme value theory and modeled as the tail distribution of the network-wide queue lengths over a predefined threshold. Leveraging concepts of federated learning, a distributed learning mechanism is proposed where VUEs estimate the tail distribution locally with the assistance of a RSU. Here, FL enables VUEs to learn the tail distribution of the network-wide queues locally without sharing the actual queue length samples reducing unnecessary overheads. Combining both EVT and FL approaches, we have proposed a Lyapunov-based distributed transmit power and resource allocation procedure for VUEs. Using simulations, we have shown that the proposed method learns the statistics of the network-wide queues with high accuracy. Furthermore, the proposed method shows considerable gains in reducing extreme events where the queue lengths grow beyond a predefined threshold compared to systems that account for reliability by imposing probabilistic constraints on the average queue lengths.

Appendix A Proof of Proposition 1

Let gd(Q)=(1+ξQ/σ)1/ξg^{\boldsymbol{d}}(Q)=(1+\xi Q/\sigma)^{-1/\xi}. Since gd(Q)eQ/σg^{\boldsymbol{d}}(Q)\to e^{-Q/\sigma} as ξ0\xi\to 0, the distribution can be rewritten as GXd(Q)=1σgd(Q)ξ+1G^{\boldsymbol{d}}_{X}(Q)=\frac{1}{\sigma}g^{\boldsymbol{d}}(Q)^{\xi+1}. Using the above notation, it can be noted that,

Hence, dfd(Q)=1QQQdfd(Q)\nabla_{\boldsymbol{d}}f^{\boldsymbol{d}}(\mathcal{Q})=\frac{1}{|\mathcal{Q}|}\sum_{Q\in\mathcal{Q}}\nabla_{\boldsymbol{d}}f^{\boldsymbol{d}}(Q) is held.

First, the gradient of gd(Q)g^{\boldsymbol{d}}(Q) is found by,

Thus, the gradient of fd(Q)f^{\boldsymbol{d}}(Q) can be calculated as follows:

References