Weight asynchronous update: Improving the diversity of filters in a deep convolutional network

Dejun Zhang; Linchao He; Mengting Luo; Zhanya Xu; Fazhi He

doi:10.1007/s41095-020-0185-5

Computational Visual Media (Oct 2020)

Weight asynchronous update: Improving the diversity of filters in a deep convolutional network

Dejun Zhang,
Linchao He,
Mengting Luo,
Zhanya Xu,
Fazhi He

Affiliations

Dejun Zhang: School of Geography and Information Engineering, China University of Geosciences
Linchao He: College of Information and Engineering, Sichuan Agricultural University
Mengting Luo: College of Information and Engineering, Sichuan Agricultural University
Zhanya Xu: School of Geography and Information Engineering, China University of Geosciences
Fazhi He: School of Computer, Wuhan University

DOI: https://doi.org/10.1007/s41095-020-0185-5
Journal volume & issue: Vol. 6, no. 4
pp. 455 – 466

Abstract

Read online

Abstract Deep convolutional networks have obtained remarkable achievements on various visual tasks due to their strong ability to learn a variety of features. A well-trained deep convolutional network can be compressed to 20%–40% of its original size by removing filters that make little contribution, as many overlapping features are generated by redundant filters. Model compression can reduce the number of unnecessary filters but does not take advantage of redundant filters since the training phase is not affected. Modern networks with residual, dense connections and inception blocks are considered to be able to mitigate the overlap in convolutional filters, but do not necessarily overcome the issue. To do so, we propose a new training strategy, weight asynchronous update, which helps to significantly increase the diversity of filters and enhance the representation ability of the network. The proposed method can be widely applied to different convolutional networks without changing the network topology. Our experiments show that the stochastic subset of filters updated in different iterations can significantly reduce filter overlap in convolutional networks. Extensive experiments show that our method yields noteworthy improvements in neural network performance.

Published in Computational Visual Media

ISSN: 2096-0433 (Print); 2096-0662 (Online)
Publisher: SpringerOpen
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41095

About the journal

Abstract

Keywords