Improved Information-Theoretic Generalization Bounds for Distributed, Federated, and Iterative Learning

Leighton Pate Barnes; Alex Dytso; Harold Vincent Poor

doi:10.3390/e24091178

Entropy (Aug 2022)

Improved Information-Theoretic Generalization Bounds for Distributed, Federated, and Iterative Learning

Leighton Pate Barnes,
Alex Dytso,
Harold Vincent Poor

Affiliations

Leighton Pate Barnes: Department of Electrical and Computer Engineering, Princeton University, Princeton, NJ 08544, USA
Alex Dytso: Department of Electrical and Computer Engineering, New Jersey Institute of Technology, Newark, NJ 07102, USA
Harold Vincent Poor: Department of Electrical and Computer Engineering, Princeton University, Princeton, NJ 08544, USA

DOI: https://doi.org/10.3390/e24091178
Journal volume & issue: Vol. 24, no. 9
p. 1178

Abstract

Read online

We consider information-theoretic bounds on the expected generalization error for statistical learning problems in a network setting. In this setting, there are K nodes, each with its own independent dataset, and the models from the K nodes have to be aggregated into a final centralized model. We consider both simple averaging of the models as well as more complicated multi-round algorithms. We give upper bounds on the expected generalization error for a variety of problems, such as those with Bregman divergence or Lipschitz continuous losses, that demonstrate an improved dependence of 1/K on the number of nodes. These “per node” bounds are in terms of the mutual information between the training dataset and the trained weights at each node and are therefore useful in describing the generalization properties inherent to having communication or privacy constraints at each node.

Published in Entropy

ISSN: 1099-4300 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Astronomy: Astrophysics; Science: Physics
Website: http://www.mdpi.com/journal/entropy

About the journal

Abstract

Keywords