Marginal Deep Architecture: Stacking Feature Learning Modules to Build Deep Learning Models

Guoqiang Zhong; Kang Zhang; Hongxu Wei; Yuchen Zheng; Junyu Dong

doi:10.1109/ACCESS.2019.2902631

IEEE Access (Jan 2019)

Marginal Deep Architecture: Stacking Feature Learning Modules to Build Deep Learning Models

Guoqiang Zhong,
Kang Zhang,
Hongxu Wei,
Yuchen Zheng,
Junyu Dong

Affiliations

Guoqiang Zhong: ORCiD; Department of Computer Science and Technology, Ocean University of China, Qingdao, China
Kang Zhang: Department of Computer Science and Technology, Ocean University of China, Qingdao, China
Hongxu Wei: Department of Computer Science and Technology, Ocean University of China, Qingdao, China
Yuchen Zheng: Department of Advanced Information Technology, Kyushu University, Fukuoka, Japan
Junyu Dong: Department of Computer Science and Technology, Ocean University of China, Qingdao, China

DOI: https://doi.org/10.1109/ACCESS.2019.2902631
Journal volume & issue: Vol. 7
pp. 30220 – 30233

Abstract

Read online

Recently, many deep models have been proposed in different fields, such as image classification, object detection, and speech recognition. However, most of these architectures require a large amount of training data and employ random initialization. In this paper, we propose to stack feature learning modules for the design of deep architectures. Specifically, marginal Fisher analysis (MFA) is stacked layer-by-layer for the initialization and we call the constructed deep architecture marginal deep architecture (MDA). When implementing the MDA, the weight matrices of MFA are updated layer-by-layer, which is a supervised pre-training method and does not need a large scale of data. In addition, several deep learning techniques are applied to this architecture, such as backpropagation, dropout, and denoising, to fine-tune the model. We have compared MDA with some feature learning and deep learning models on several practical applications, such as handwritten digits recognition, speech recognition, historical document understanding, and action recognition. The extensive experiments show that the performance of MDA is better than not only shallow feature learning models but also related deep learning models in these tasks.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords