IEEE Access (Jan 2021)

ST-BFL: A Structured Transparency Empowered Cross-Silo Federated Learning on the Blockchain Framework

  • Umer Majeed,
  • Latif U. Khan,
  • Abdullah Yousafzai,
  • Zhu Han,
  • Bang Ju Park,
  • Choong Seon Hong

DOI
https://doi.org/10.1109/ACCESS.2021.3128622
Journal volume & issue
Vol. 9
pp. 155634 – 155650

Abstract

Read online

Federated Learning (FL) relies on on-device training to avoid the migration of devices’ data to a centralized server to address privacy leakage. Moreover, FL is feasible for scenarios (e.g., autonomous cars) where an enormous amount of data is generated every day. Transferring only local model updates in the case of FL is highly communication-efficient compared to transferring all data in the case of centralized machine learning (ML). Although FL offers many advantages, it also has some challenges. A malicious aggregation server can infer device information via local model updates. Another downside of FL is the centralized aggregation server that can malfunction due to an attack or physical damage. To address these issues, we propose a novel Structured Transparency empowered cross-silo Federated Learning on the Blockchain (ST-BFL) framework. In ST-BFL, homomorphic encryption, FL-aggregators, FL-verifiers, and smart contract are employed, which satisfy various structured transparency components, such as input privacy, output privacy, output verification, and flow governance. We present the framework architecture, algorithms, and sequence diagram of our ST-BFL framework to show how different entities interact in ST-BFL for the FL process. We also present a simplified class diagram of ST-BFL’s smart contract for an FL task. Finally, we perform a simulation to analyze our framework from the perspective of aggregation time, accuracy, and storage size. The qualitative and quantitative evaluation shows that ST-BFL has the same accuracy as traditional FL. However, ST-BFL provides input privacy, output privacy, input verification, output verification, and flow governance at the expense of relatively higher computation and communication costs than traditional FL.

Keywords