Optimal Quantization for Compressive Sensing under Message Passing Reconstruction

Ulugbek Kamilov, Vivek K Goyal, Sundeep Rangan

I Introduction

By exploiting signal sparsity and smart reconstruction schemes, compressive sensing (CS) can enable signal acquisition with fewer measurements than traditional sampling. In CS, an nn-dimensional signal xx is measured through mm random linear measurements. Although the signal may be undersampled (m<nm<n), it may be possible to recover xx assuming some sparsity structure.

So far, most of the CS literature has considered signal recovery directly from linear measurements. However, in many practical applications, measurements have to be discretized to a finite number of bits. The effect of such quantized measurements on the performance of the CS reconstruction has been studied in . In the authors adapt CS reconstruction algorithms to mitigate quantization effects. In , high-resolution functional scalar quantization theory was used to design quantizers for LASSO reconstruction .

The contribution of this paper to the quantized CS problem is twofold: First, for quantized measurements, we propose reconstruction algorithms based on Gaussian approximations of belief propagation (BP). BP is a graphical model-based estimation algorithm widely used in machine learning and channel coding that has also received significant recent attention in compressed sensing . Although exact implementation of BP for dense measurement matrices is generally computationally difficult, Gaussian approximations of BP have been effective in a range of applications . We consider a recently developed Gaussian-approximated BP algorithm, called relaxed belief propagation , that extends earlier methods to nonlinear output channels. We show that the relaxed BP method is computationally simple and, with quantized measurements, provides significantly improved performance over traditional CS reconstruction based on convex relaxations.

Our second contribution concerns the quantizer design. With linear reconstruction and mean-squared error distortion, the optimal quantizer simply minimizes the mean squared error (MSE) of the transform outputs. Thus, the quantizer can be optimized independently of the reconstruction method. However, when the quantizer outputs are used as an input to a nonlinear estimation algorithm, minimizing the MSE between quantizer input and output is not necessarily equivalent to minimizing the MSE of the final reconstruction. To optimize the quantizer for the relaxed BP algorithm, we use the fact that the MSE under large random transforms can be predicted accurately from a set of simple state evolution (SE) equations . Then, by modeling the quantizer as a part of the measurement channel, we use the SE formalism to optimize the quantizer to asymptotically minimize distortions after the reconstruction by relaxed BP.

II Background

A common method for recovering the signal from the measurements is basis pursuit. This involves solving the following optimization problem:

II-B Scalar Quantization

where minimization is done over all NN-level regular scalar quantizers. One standard way of optimizing QQ is via the Lloyd algorithm, which iteratively updates the decision boundaries and output levels by applying necessary conditions for quantizer optimality .

However, for the CS framework finding the quantizer that minimizes MSE between s\mathbf{s} and s^\mathbf{\hat{s}} is not necessarily equivalent to minimizing MSE between the sparse vector x\mathbf{x} and its CS reconstruction from quantized measurements x^\mathbf{\hat{x}} . This is due to the nonlinear effect added by any particular CS reconstruction function. Hence, instead of solving (3), it is more interesting to solve

where minimization is performed over all NN-level regular scalar quantizers and x^\mathbf{\hat{x}} is obtained through a CS reconstruction method like relaxed BP or AMP. This is the approach taken in this work.

II-C Relaxed Belief Propagation

where ZZ is the normalization constant and za=(Ax)az_{a}=(Ax)_{a}. By marginalizing this distribution it is possible to estimate each xix_{i}. Although direct marginalization of pxy(xy)p_{\mathbf{x}\mid\mathbf{y}}(x\mid y) is computationally intractable in practice, we approximate marginals through BP . BP is an iterative algorithm commonly used for decoding of LDPC codes . We apply BP by constructing a bipartite factor graph G=(V,F,E)G=(V,F,E) from (5) and passing the following messages along the edges EE of the graph:

where \propto means that the distribution is to be normalized so that it has unit integral and integration is over all the elements of xx except xix_{i}. We refer to messages {pia}(i,a)E\{p_{i\rightarrow a}\}_{(i,a)\in E} as variable updates and to messages {p^ai}(i,a)E\{\hat{p}_{a\rightarrow i}\}_{(i,a)\in E} as measurement updates. We initialize BP by setting pia0(xi)=px(xi)p_{i\rightarrow a}^{0}(x_{i})=p_{\mathbf{x}}(x_{i}).

Earlier works on BP reconstruction have shown that it is asymptotically MSE optimal under certain verifiable conditions. These conditions involve simple single-dimensional recursive equations called state evolution (SE), which predicts that BP is optimal when the corresponding SE admits a unique fixed point . Nonetheless, direct implementation of BP is still impractical due to the dense structure of AA, which implies that the algorithm must compute the marginal of a high-dimensional distribution at each measurement node. However, as mentioned in Section I, BP can be simplified through various Gaussian approximations, including the relaxed BP method and approximate message passing (AMP) . Recent theoretical work and extensive numerical experiments have demonstrated that, in the case of certain large random measurement matrices, the error performance of both relaxed BP and AMP can also be accurately predicted by SE. Hence the optimal quantizers can be obtained in parallel for both of the methods, however in this paper we concentrate on design for relaxed BP, while keeping in mind that identical work can be done for AMP as well.

Due to space limitations, in this paper we will limit our presentation of relaxed BP and SE equations to the setting in Figure 1. See for more general and detailed analysis.

III Quantized Relaxed BP

for a=1,2,,ma=1,\,2,\,\ldots,\,m and where ϕ()\phi(\cdot) is Gaussian function

Relaxed BP can be implemented by replacing probability densities in (6) and (7) by two scalar parameters each, which can be computed according to the following rules:

where σ2\sigma^{2} is the variance of the components ηa\eta_{a}. Additionally, at each iteration we estimate the signal via

We refer to messages {x^ia,τ^ia}(i,a)E\{\hat{x}_{i\rightarrow a},\hat{\tau}_{i\rightarrow a}\}_{(i,a)\in E} as variable updates and to messages {uai,τai}(i,a)E\{u_{a\rightarrow i},\tau_{a\rightarrow i}\}_{(i,a)\in E} as measurement updates. The algorithm is initialized by setting x^ia0=x^init\hat{x}^{0}_{i\rightarrow a}=\hat{x}_{\textrm{init}} and τ^ia0=τ^init\hat{\tau}^{0}_{i\rightarrow a}=\hat{\tau}_{\textrm{init}} where x^init\hat{x}_{\textrm{init}} and τ^init\hat{\tau}_{\textrm{init}} are the mean and variance of the prior px(xi)p_{\mathbf{x}}(x_{i}). The nonlinear functions FinF_{\textrm{in}} and Ein\mathcal{E}_{\textrm{in}} are the conditional mean and variance

where q=x+v\mathbf{q}=\mathbf{x}+\mathbf{v}, xpx(xi)\mathbf{x}\sim p_{\mathbf{x}}\left(x_{i}\right), and vN(0,ν)\mathbf{v}\sim\mathcal{N}\left(0,\nu\right). Note that these functions admit closed-form expressions and can easily be evaluated for the given values of qq and ν\nu. Similarly, the functions D1D_{1} and D2D_{2} can be computed via

where the functions FoutF_{\textrm{out}} and Eout\mathcal{E}_{\textrm{out}} are the conditional mean and variance

IV State Evolution for Relaxed BP

The equations (11)–(15) are easy to implement, however they provide us no insight into the performance of the algorithm. The goal of SE equations is to describe the asymptotic behavior of relaxed BP under large measurement matrices. The SE for our setting in Figure 1 is given by the recursion

where t0t\geq 0 is the iteration number, β=n/m\beta={{n}/{m}} is a fixed number denoting the measurement ratio, and σ2\sigma^{2} is the variance of the AWGN components which is also fixed. We initialize the recursion by setting νˉ0=τ^init\bar{\nu}_{0}=\hat{\tau}_{\textrm{init}}, where τ^init\hat{\tau}_{\textrm{init}} is the variance of xix_{i} according to the prior px(xi)p_{\mathbf{x}}(x_{i}). We define the function Eˉin\bar{\mathcal{E}}_{\textrm{in}} as

where the expectation is taken over the scalar random variable q=x+v\mathbf{q}=\mathbf{x}+\mathbf{v}, with xpx(xi)\mathbf{x}\sim p_{\mathbf{x}}(x_{i}), and vN(0,ν)\mathbf{v}\sim\mathcal{N}(0,\nu). Similarly, the function Eˉout\bar{\mathcal{E}}_{\textrm{out}} is defined as

where D2D_{2} is given by (19) and the expectation is taken over pyz(yaza)p_{\mathbf{y}\mid\mathbf{z}}(y_{a}\mid z_{a}) and (z,z^)N(0,Pz(ν))(\mathbf{z},\mathbf{\hat{z}})\sim\mathcal{N}(0,P_{z}(\nu)), with the covariance matrix

One of the main results of , which we present below for completeness, was to demonstrate the convergence of the error performance of the relaxed BP algorithm to the SE equations under large sparse measurement matrices. Denote by dmd\leq m the number of nonzero elements per column of AA. In the large sparse limit analysis, first let nn\rightarrow\infty with m=βnm=\beta n and keeping dd fixed. This enables the local-tree properties of the factor graph GG. Then let dd\rightarrow\infty, which will enable the use of a central limit theorem approximation.

Consider the relaxed BP algorithm under the large sparse limit model above with transform matrix AA and index ii satisfying the Assumption 1 of for some fixed iteration number tt. Then the error variances satisfy the limit

where νˉt\bar{\nu}_{t} is the output of the SE equation (22).

Another important result regarding SE recursion in (22) is that it admits at least one fixed point. It has been showed that as tt\rightarrow\infty the recursion decreases monotonically to its largest fixed point and if the SE admits a unique fixed point, then relaxed BP is asymptotically mean-square optimal .

Although in practice measurement matrices are rarely sparse, simulations show that SE predicts well the behavior of relaxed BP. Moreover, recently more sophisticated techniques were used to demonstrate the convergence of approximate message passing algorithms to SE under large i.i.d. Gaussian matrices .

V Optimal Quantization

We now return to the problem of designing MSE-optimal quantizers under relaxed BP presented in (4). By modeling the quantizer as part of the channel and working out the resulting equations for relaxed BP and SE, we can make use of the convergence results to recast our optimization problem to

where minimization is done over all NN-level regular scalar quantizers. In practice, about 10 to 20 iterations are sufficient to reach the fixed point of νˉt\bar{\nu}_{t}. Then by applying Theorem 1, we know that the asymptotic performance of QQ^{*} will be identical to that of QSEQ^{\textrm{SE}}. It is important to note that the SE recursion behaves well under quantizer optimization. This is due to the fact that SE is independent of actual output levels and small changes in the quantizer boundaries result in only minor change in the recursion (see (21)). Although closed-form expressions for the derivatives of νˉt\bar{\nu}_{t} for large tt’s are difficult to obtain, we can approximate them by using finite difference methods. Finally, the recursion itself is fast to evaluate, which makes the scheme in (27) practically realizable under standard optimization methods like sequential quadratic programming (SQP).

VI Experimental Results

We now present experimental validation for our results. Assume that the signal x\mathbf{x} is generated with i.i.d. elements from the Gauss-Bernoulli distribution

where ρ\rho is the sparsity ratio that represents the average fraction of nonzero components of x\mathbf{x}. In the following experiments we assume ρ=0.1\rho=0.1. We form the measurement matrix AA from i.i.d. Gaussian random variables, i.e., AaiN(0,1/m)A_{ai}\sim\mathcal{N}(0,1/m); and we assume that AWGN with variance σ2=105\sigma^{2}=10^{-5} perturbs measurements before quantization.

Now, we can formulate the SE equation (22) and perform optimization (27). We compare two CS-optimized quantizers: Uniform and Optimal. We fix boundary points b0=b_{0}=-\infty and bN=+b_{N}=+\infty, and compute the former quantizer through optimization of type (3). In particular, by applying the central limit theorem we approximate elements sas_{a} of s\mathbf{s} to be Gaussian and determine the Uniform quantizer by solving (3), but with an additional constraint of equally-spaced output levels. To determine Optimal quantizer, we perform (27) by using a standard SQP optimization algorithm for nonlinear continuous optimization.

Figure 2 presents an example of quantization boundaries. For the given bit rate RxR_{x} over the components of the input vector x\mathbf{x}, we can express the rate over the measurements s\mathbf{s} as Rs=βRxR_{s}=\beta R_{x}, where β=n/m\beta=n/m is the measurement ratio. To determine the optimal quantizer for the given rate RxR_{x} we perform optimization for all β\betas and return the quantizer with the least MSE. As we can see, in comparison with the uniform quantizer obtained by merely minimizing the distortion between the quantizer input and output, the one obtained via SE minimization is very different; in fact, it looks more concentrated around zero. This is due to the fact that by minimizing SE we are in fact searching for quantizers that asymptotically minimize the MSE of the relaxed BP reconstruction by taking into consideration the nonlinear effects due to the method. The trend of having more quantizer points near zero is opposite to the trend shown in for quantizers optimized for LASSO reconstruction.

Figure 3 presents a comparison of reconstruction distortions for our two quantizers and confirms the advantage of using quantizers optimized via (22). To obtain the results we vary the quantization rate from 11 to 22 bits per component of x\mathbf{x}, and for each quantization rate, we optimize quantizers using the methods discussed above. For comparison, the figure also plots the MSE performance for two other reconstruction methods: linear MMSE estimation and the widely-used LASSO method , both assuming a bounded uniform quantizer. The LASSO performance was predicted by state evolution equations in , with the thresholding parameter optimized by the iterative approach in . It can be seen that the proposed relaxed BP algorithm offers dramatically better performance—more that 1010 dB improvement at low rates. At higher rates, the gap is slightly smaller since relaxed BP performance saturates due to the AWGN at the quantizer input. Similarly we can see that the MSE of the quantizer optimized for the relaxed BP reconstruction is much smaller than the MSE of the standard one, with more than 4 dB difference for many rates.

VII Conclusions

We present relaxed belief propagation as an efficient algorithm for compressive sensing reconstruction from the quantized measurements. We integrate ideas from recent generalization of the algorithm for arbitrary measurement channels to design a method for determining optimal quantizers under relaxed BP reconstruction. Although computationally simpler, experimental results show that under quantized measurements relaxed BP offers significantly improved performance over traditional reconstruction schemes. Additionally, performance of the algorithm is further improved by using the state evolution framework to optimize the quantizers.

References