Research

Federated Learning

Federated learning faces huge challenges from model overfitting due to the lack of data and statistical diversity among clients. To address these challenges, this paper proposes a novel personalized federated learning method via Bayesian variational inference named pFedBayes. To alleviate the overfitting, weight uncertainty is introduced to neural networks for clients and the server. To achieve personalization, each client updates its local distribution parameters by balancing its construction error over private data and its KL divergence with global distribution from the server. Theoretical analysis gives an upper bound of averaged generalization error and illustrates that the convergence rate of the generalization error is minimax optimal up to a logarithmic factor. Experiments show that the proposed method outperforms other advanced personalized methods on personalized models, e.g., pFedBayes respectively outperforms other SOTA algorithms on MNIST, FMNIST and CIFAR-10 under non-i.i.d. limited data. 

Related publications

1. Personalized Federated Learning via Variational Bayesian Inference

Xu Zhang, Yinchuan Li, Wenpeng Li, Kaiyang Guo, Yunfeng Shao, International Conference on Machine Learning (ICML),  July 2022. [Paper] 

2. Sparse Personalized Federated Learning 

Xiaofeng Liu ,  Yinchuan Li,  Qing Wang ,  Xu Zhang*, Yunfeng Shao ,  and Yanhui Geng , IEEE Transactions on Neural Networks and Learning Systems, March 2023. [Paper]

3. Federated Learning via Variational Bayesian Inference: Personalization, Sparsity and Clustering

Xu Zhang, Wenpeng Li, Yunfeng Shao, Yinchuan Li*, submitted to IEEE Transactions on  Pattern Analysis and Machine Intelligence, 2023. [paper]


Distributed Remote Estimation over the Collision Channel

Internet of Things networks are large-scale distributed systems consisting of a massive number of simple devices communicating, typically, over a shared wireless medium. This new paradigm requires novel ways of coordinating access to limited communication resources without introducing unreasonable delays. Herein, the optimal design of a remote estimation system with $n$ sensors communicating with a fusion center via a collision channel of limited capacity k < n is considered.  In particular, for independent and identically distributed observations with a symmetric probability density function, we show that the problem of minimizing the mean-squared error with respect to a threshold strategy is quasi-convex. When coordination among sensors via a local communication network is available, the on-line learning of possibly unknown parameters of the probabilistic model is possible, enabling each sensor to optimize its own threshold autonomously. We propose two strategies for remote estimation with local communication: One strategy swiftly reaches the performance of the optimal decentralized threshold policy, whereas a second strategy approaches the performance of the optimal centralized scheme with a slower convergence rate. A hybrid scheme that combines the best of both approaches is proposed, offering fast convergence and excellent performance.

Related publications

1. An optimal symmetric threshold strategy for remote estimation over the collision channel

Xu Zhang, Marcos M. Vasconcelos, Wei Cui, and Urbashi Mitra, International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2020. [Paper]

2. Distributed remote estimation over the collision channel with and without local communication

Xu Zhang, Marcos M. Vasconcelos, Wei Cui, and Urbashi Mitra,  IEEE Transactions on  Control of Network System, June 2021. [Paper]

3. Robust remote estimation over the collision channel in the presence of an intelligent jammer  

Xu Zhang  and Marcos M. Vasconcelos*, accepted in IEEE Conference on Decision and Control, Dec. 2022. [Paper]

4. Robust one-shot estimation over shared networks in the presence of denial-of-service attacks 

Xu Zhang and Marcos Vasconcelos*,  submitted to IEEE Transactions on  Automatic Control , 2023. [Paper]


Recovery of Structured Signals with Prior Information via Maximizing Correlation

This research considers the problem of recovering a structured signal from a relatively small number of noisy measurements with the aid of a similar signal which is known beforehand. We propose a new approach to integrate prior information into the standard recovery procedure by maximizing the correlation between the prior knowledge and the desired signal. We then establish performance guarantees (in terms of the number of measurements) for the proposed method under sub-Gaussian measurements. Specific structured signals including sparse vectors, block-sparse vectors, and low-rank matrices are also analyzed. Furthermore, we present an interesting geometrical interpretation for the proposed procedure. Our results demonstrate that if prior information is good enough, then the proposed approach can (remarkably) outperform the standard recovery procedure.

Related Publications

1. Recovery of Structured Signals With Prior Information via Maximizing Correlation

Xu Zhang, Wei Cui, and Yulong Liu, IEEE Transactions on Signal Processing, May 2018. [Paper]

2. Compressed Sensing with Prior Information via Maximizing Correlation

Xu Zhang, Wei Cui, and Yulong Liu,  IEEE International Symposium on Information Theory (ISIT), June 2017. [Paper]

3. Matrix Completion with Prior Subspace Information via Maximizing Correlation 

Xu Zhang, Wei Cui, and Yulong Liu, 2020. [Paper]

4. Spectrally Sparse Signal Recovery via Hankel Matrix Completion with Prior Information 

Xu Zhang, Yulong Liu, and Wei Cui, IEEE Transactions on Signal Processing, March 2021. [Paper]


Covariance Matrix Estimation from Correlated Samples

Covariance matrix estimation concerns the problem of estimating the covariance matrix from a collection of samples, which is of extreme importance in many applications. Classical results have shown that O(n) samples are sufficient to accurately estimate the covariance matrix from n -dimensional independent Gaussian samples. However, in many practical applications, the received signal samples might be correlated, which makes the classical analysis inapplicable. In this paper, we develop a non-asymptotic analysis for the covariance matrix estimation from linearly-correlated Gaussian samples. Our theoretical results show that the error bounds are determined by the signal dimension n , the sample size m , and the shape parameter of the distribution of the correlated sample covariance matrix. Particularly, when the shape parameter is a class of Toeplitz matrices (which is of great practical interest), O(n) samples are also sufficient to faithfully estimate the covariance matrix from correlated samples. Simulations are provided to verify the correctness of the theoretical results. 

Related publications

1. Covariance Matrix Estimation from Correlated Sub-Gaussian Samples

Xu Zhang, Wei Cui, and Yulong Liu, 2019. [Paper]

2. Covariance Matrix Estimation from Linearly-Correlated Gaussian Samples

Wei Cui, Xu Zhang, and Yulong Liu, IEEE Transactions on Signal Processing, March 2019. [Paper]