Toward Near-Real-Time Training With Semi-Random Deep Neural Networks and Tensor-Train Decomposition

Humza Syed; Ryan Bryla; Uttam Majumder; Dhireesha Kudithipudi

doi:10.1109/JSTARS.2021.3096195

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2021)

Toward Near-Real-Time Training With Semi-Random Deep Neural Networks and Tensor-Train Decomposition

Humza Syed,
Ryan Bryla,
Uttam Majumder,
Dhireesha Kudithipudi

Affiliations

Humza Syed: ORCiD; Neuromorphic AI Lab, Rochester Institute of Technology, Rochester, NY, USA
Ryan Bryla: Neuromorphic AI Lab, Rochester Institute of Technology, Rochester, NY, USA
Uttam Majumder: ORCiD; Air Force Research Lab, Rome, NY, USA
Dhireesha Kudithipudi: ORCiD; Neuromorphic AI Lab, University of Texas at San Antonio, San Antonio, TX, USA

DOI: https://doi.org/10.1109/JSTARS.2021.3096195
Journal volume & issue: Vol. 14
pp. 8171 – 8179

Abstract

Read online

In recent years, deep neural networks have shown to achieve state-of-the-art performance on several classification and prediction tasks. However, these networks demand undesirable lengthy training times coupled with high computational resources (memory, I/O, processing time). In this work, we explore semi-random deep neural networks to achieve near real-time training and less computational resource usage. Although many works enhance the underlying hardware for real-time training, this work focuses on algorithmic optimization. It is shown that random projection networks with additional skipped connectivity and randomly weighted layers can boost the overall network performance while enabling for real-time training. Additionally, a tensor-train decomposition technique is leveraged to further reduce the model complexity of these networks. Our investigation accomplishes the following: 1) Tensor-train decomposition decreases the complexity of random projection networks, 2) compression of the fully connected hidden layer leads to a minimum $\sim40\times$ decrease in memory size, and 3) training under random projection networks can be achieved in near-real time.

Published in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ISSN: 1939-1404 (Print); 2151-1535 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Ocean engineering; Science: Physics: Geophysics. Cosmic physics
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4609443

About the journal

Abstract

Keywords