A FPGA Accelerator of Distributed A3C Algorithm with Optimal Resource Deployment

Fen Ge; Guohui Zhang; Ziyu Li; Fang Zhou

doi:10.1049/2024/7855250

IET Computers & Digital Techniques (Jan 2024)

A FPGA Accelerator of Distributed A3C Algorithm with Optimal Resource Deployment

Fen Ge,
Guohui Zhang,
Ziyu Li,
Fang Zhou

Affiliations

Fen Ge: College of Integrated Circuits
Guohui Zhang: College of Integrated Circuits
Ziyu Li: College of Integrated Circuits
Fang Zhou: College of Integrated Circuits

DOI: https://doi.org/10.1049/2024/7855250
Journal volume & issue: Vol. 2024

Abstract

Read online

The asynchronous advantage actor-critic (A3C) algorithm is widely regarded as one of the most effective and powerful algorithms among various deep reinforcement learning algorithms. However, the distributed and asynchronous nature of the A3C algorithm brings increased algorithm complexity and computational requirements, which not only leads to an increased training cost but also amplifies the difficulty of deploying the algorithm on resource-limited field programmable gate array (FPGA) platforms. In addition, the resource wastage problem caused by the distributed training characteristics of A3C algorithms and the resource allocation problem affected by the imbalance between the computational amount of inference and training need to be carefully considered when designing accelerators. In this paper, we introduce a deployment strategy designed for distributed algorithms aimed at enhancing the resource utilization of hardware devices. Subsequently, a FPGA architecture is constructed specifically for accelerating the inference and training processes of the A3C algorithm. The experimental results show that our proposed deployment strategy reduces resource consumption by 62.5% and decreases the number of agents waiting for training by 32.2%, and the proposed A3C accelerator achieves 1.83× and 2.39× improvements in speedup compared to CPU (Intel i9-13900K) and GPU (NVIDIA RTX 4090) with less power consumption respectively. Furthermore, our design shows superior resource efficiency compared to existing works.

Published in IET Computers & Digital Techniques

ISSN: 1751-8601 (Print); 1751-861X (Online)
Publisher: Hindawi-IET
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.hindawi.com/journals/ietcdt/

About the journal