Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent (DSVGD)

Distributed Stein Variational Gradient Descent (DSVGD) (Kassab & Simeone, 2020), is a non-parametric generalized Bayesian inference framework for federated learning. DSVGD maintains a number of non-random and interacting particles at a central server to represent the current iterate of the model global posterior. The particles are iteratively downloaded and updated by one of the agents with the end goal of minimizing the global free energy. By varying the number of particles, DSVGD enables a flexible trade-off between per-iteration communication load and number of communication rounds. DSVGD is shown to compare favorably to benchmark frequentist and Bayesian federated learning strategies while also providing well-calibrated, and hence trustworthy, predictions.

This repository contains the code for the different experiments conducted in the paper: Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent(Kassab & Simeone, 2020), namely, Bayesian Logistic Regression, Regression with Bayesian Neural Networks and Multi-label classification with Bayesian Neural Networks. We also include the code for different benchmarks used in each experiment, namely:

Federated Averaging (FedAvg) (McMahan et al., 2017).
Distributed Stochastic Gradient Langevin Dynamics (DSGLD) (Ahn et al., 2014).
Stochastic Gradient Langevin Dynamics (SGLD) (Welling & Teh., 2011).

Remaining benchmarks code urls can be found in their corresponding paper.

Toy Example: 1-D Mixture of Gaussians

In the animated figure below, we show DSVGD's operation with two agents. Green and orange dashed curves correspond to the local posterior at agents 1 and 2 scheduled during odd and even global iteration index i respectively. The shaded area represents the optimal normalized global posterior (obtained by normalizing the product of the two local posteriors) to be approximated at both agents. The blue solid line represents a Kernel Density Estimate over the particles of the scheduled agents after L local SVGD iterations. A uniform U(-6, 6) prior is used. Notice that at i=1, agent 1 obtains an approximation of its local posterior (green dashed line), while at i=2, agent 2 integrates the knowledge from agent's 1 approximation with its local posterior (orange dashed line) to obtain a two modes distribution. This process continues until convergence of the particles distribution (blue solid line) to an approximate of the optimal normalized global posterior (shaded area).

Required Packages

We use Numpy v1.18.1, PyTorch v1.3.1 and Theano v1.0.4 for the Bayesian Neural Networks experiments.

Usage

Each experiment folder contains one main .py file per algorithm. When executed, it prints the performance of the corresponding algorithm as function of the communication rounds. Algorithms use general purpose functions in the Libray/general_functions.py file and experiment_name specific functions in the Library/experiment_name.py file.

Citation

If you find this repository helpful, please consider citing our paper using the following BibTeX entry:

@article{kassab2020federated,
  title={Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent},
  author={Kassab, Rahif and Simeone, Osvaldo},
  journal={arXiv preprint arXiv:2009.06419},
  year={2020}
}

Feedback

If you have any questions, suggestions or would like to discuss or collaborate, feel free to drop me an email.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.idea		.idea
Bayesian Logistic Regression		Bayesian Logistic Regression
Library		Library
Multi Label Classification with Bayesian Neural Networks		Multi Label Classification with Bayesian Neural Networks
Regression with Bayesian Neural Networks		Regression with Bayesian Neural Networks
data		data
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent (DSVGD)

Toy Example: 1-D Mixture of Gaussians

Required Packages

Usage

Citation

Feedback

About

Releases

Packages

Languages

kclip/DSVGD

Folders and files

Latest commit

History

Repository files navigation

Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent (DSVGD)

Toy Example: 1-D Mixture of Gaussians

Required Packages

Usage

Citation

Feedback

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages