ISFL: Federated Learning for Non-i.i.d. Data With Local Importance Sampling

Zheqi Zhu, Yuchen Shi, Pingyi Fan*, Chenghui Peng, Khaled B. Letaief

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

18 Citations (Scopus)

Abstract

As a promising learning paradigm integrating computation and communication, federated learning (FL) proceeds the local training and the periodic sharing from distributed clients. Due to the non-i.i.d. data distribution on clients, FL model suffers from the gradient diversity, poor performance, bad convergence, etc. In this work, we aim to tackle this key issue by adopting importance sampling (IS) for local training. We propose importance sampling FL(ISFL), an explicit framework with theoretical guarantees. First, we derive the convergence theorem of ISFL to involve the effects of local IS. Then, we formulate the problem of selecting optimal IS weights and obtain the theoretical solutions. We also employ a water-filling method to calculate the IS weights and develop the ISFL algorithms. The experimental results on CIFAR-10 fit the proposed theorems well and verify that ISFL reaps better performance, convergence, sampling efficiency, as well as explainability on the non-i.i.d. data. To the best of our knowledge, ISFL is the first non-i.i.d. FL solution from the local sampling aspect which exhibits theoretical compatibility with neural network models. Furthermore, as a local sampling approach, ISFL can be easily migrated into the other emerging FL frameworks.

Original languageEnglish
Pages (from-to)27448-27462
Number of pages15
JournalIEEE Internet of Things Journal
Volume11
Issue number16
DOIs
Publication statusPublished - 2024

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

Keywords

  • Federated learning (FL)
  • importance sampling (IS)
  • non i.i.d data
  • water-filling optimization

Fingerprint

Dive into the research topics of 'ISFL: Federated Learning for Non-i.i.d. Data With Local Importance Sampling'. Together they form a unique fingerprint.

Cite this