HAR_Locator: a novel protein subcellular location prediction model of immunohistochemistry images based on hybrid attention modules and residual units

Kai Zou; Simeng Wang; Ziqian Wang; Zhihai Zhang; Fan Yang; Fan Yang

doi:10.3389/fmolb.2023.1171429

Frontiers in Molecular Biosciences (Aug 2023)

HAR_Locator: a novel protein subcellular location prediction model of immunohistochemistry images based on hybrid attention modules and residual units

Kai Zou,
Simeng Wang,
Ziqian Wang,
Zhihai Zhang,
Fan Yang,
Fan Yang

Affiliations

Kai Zou: School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
Simeng Wang: School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
Ziqian Wang: School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
Zhihai Zhang: School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
Fan Yang: School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang, China
Fan Yang: Artificial Intelligence and Bioinformation Cognition Laboratory, Jiangxi Science and Technology Normal University, Nanchang, China

DOI: https://doi.org/10.3389/fmolb.2023.1171429
Journal volume & issue: Vol. 10

Abstract

Read online

Introduction: Proteins located in subcellular compartments have played an indispensable role in the physiological function of eukaryotic organisms. The pattern of protein subcellular localization is conducive to understanding the mechanism and function of proteins, contributing to investigating pathological changes of cells, and providing technical support for targeted drug research on human diseases. Automated systems based on featurization or representation learning and classifier design have attracted interest in predicting the subcellular location of proteins due to a considerable rise in proteins. However, large-scale, fine-grained protein microscopic images are prone to trapping and losing feature information in the general deep learning models, and the shallow features derived from statistical methods have weak supervision abilities.Methods: In this work, a novel model called HAR_Locator was developed to predict the subcellular location of proteins by concatenating multi-view abstract features and shallow features, whose advanced advantages are summarized in the following three protocols. Firstly, to get discriminative abstract feature information on protein subcellular location, an abstract feature extractor called HARnet based on Hybrid Attention modules and Residual units was proposed to relieve gradient dispersion and focus on protein-target regions. Secondly, it not only improves the supervision ability of image information but also enhances the generalization ability of the HAR_Locator through concatenating abstract features and shallow features. Finally, a multi-category multi-classifier decision system based on an Artificial Neural Network (ANN) was introduced to obtain the final output results of samples by fitting the most representative result from five subset predictors.Results: To evaluate the model, a collection of 6,778 immunohistochemistry (IHC) images from the Human Protein Atlas (HPA) database was used to present experimental results, and the accuracy, precision, and recall evaluation indicators were significantly increased to 84.73%, 84.77%, and 84.70%, respectively, compared with baseline predictors.

Published in Frontiers in Molecular Biosciences

ISSN: 2296-889X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Biology (General)
Website: https://www.frontiersin.org/journals/molecular-biosciences

About the journal

Abstract

Keywords