BTSC: Binary tree structure convolution layers for building interpretable decision‐making deep CNN

Yuqi Wang; Dawei Dai; Da Liu; Shuyin Xia; Guoyin Wang

doi:10.1049/cit2.12328

CAAI Transactions on Intelligence Technology (Oct 2024)

BTSC: Binary tree structure convolution layers for building interpretable decision‐making deep CNN

Yuqi Wang,
Dawei Dai,
Da Liu,
Shuyin Xia,
Guoyin Wang

Affiliations

Yuqi Wang: College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing China
Dawei Dai: College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing China
Da Liu: College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing China
Shuyin Xia: College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing China
Guoyin Wang: College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing China

DOI: https://doi.org/10.1049/cit2.12328
Journal volume & issue: Vol. 9, no. 5
pp. 1331 – 1345

Abstract

Read online

Abstract Although deep convolution neural network (DCNN) has achieved great success in computer vision field, such models are considered to lack interpretability in decision‐making. One of fundamental issues is that its decision mechanism is considered to be a “black‐box” operation. The authors design the binary tree structure convolution (BTSC) module and control the activation level of particular neurons to build the interpretable DCNN model. First, the authors design a BTSC module, in which each parent node generates two independent child layers, and then integrate them into a normal DCNN model. The main advantages of the BTSC are as follows: 1) child nodes of the different parent nodes do not interfere with each other; 2) parent and child nodes can inherit knowledge. Second, considering the activation level of neurons, the authors design an information coding objective to guide neural nodes to learn the particular information coding that is expected. Through the experiments, the authors can verify that: 1) the decision‐making made by both the ResNet and DenseNet models can be explained well based on the "decision information flow path" (known as the decision‐path) formed in the BTSC module; 2) the decision‐path can reasonably interpret the decision reversal mechanism (Robustness mechanism) of the DCNN model; 3) the credibility of decision‐making can be measured by the matching degree between the actual and expected decision‐path.

Published in CAAI Transactions on Intelligence Technology

ISSN: 2468-2322 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/24682322

About the journal

Abstract

Keywords