CB-HVT Net: A Channel-Boosted Hybrid Vision Transformer Network for Lymphocyte Detection in Histopathological Images

Momina Liaqat Ali; Zunaira Rauf; Asifullah Khan; Anabia Sohail; Rafi Ullah; Jeonghwan Gwak

doi:10.1109/access.2023.3324383

IEEE Access (Jan 2023)

CB-HVT Net: A Channel-Boosted Hybrid Vision Transformer Network for Lymphocyte Detection in Histopathological Images

Momina Liaqat Ali,
Zunaira Rauf,
Asifullah Khan,
Anabia Sohail,
Rafi Ullah,
Jeonghwan Gwak

Affiliations

Momina Liaqat Ali: ORCiD; Department of Computer and Information Sciences, Pattern Recognition Laboratory, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, Pakistan
Zunaira Rauf: PIEAS Artificial Intelligence Center (PAIC), Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, Pakistan
Asifullah Khan: ORCiD; Center for Mathematical Sciences, Pakistan Institute of Engineering and Applied Sciences, Nilore, Islamabad, Pakistan
Anabia Sohail: Department of Electrical Engineering and Computer Science, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
Rafi Ullah: Department of Computer and Information Sciences, Universiti Teknologi PETRONAS (UTP), Seri Iskandar, Perak, Malaysia
Jeonghwan Gwak: ORCiD; Department of Software, Korea National University of Transportation, Chungju, Republic of Korea

DOI: https://doi.org/10.1109/access.2023.3324383
Journal volume & issue: Vol. 11
pp. 115740 – 115750

Abstract

Read online

Detection of Tumor-Infiltrating Lymphocytes (TILs) has a high prognostic value in cancer diagnosis due to their ability to identify and kill cancer cells. However, this task is non-trivial due to their diverse morphology, overlapping boundaries, and presence of artifacts. Vision Transformers (ViTs) have the ability to capture long-range relationships, but they lack local correlation in the images and require large training datasets. In this work, we propose a Channel Boosted Hybrid Vision Transformer (CB-HVT) to detect lymphocytes in histopathological images. The proposed network constitutes: 1) channel generation module; 2) channel exploitation module; 3) channel merging module; 4) region-aware module; and 5) detection and segmentation head. The proposed CB-HVT exploits the learning capacity of both CNN and ViT-based architectures to capture lymphocytic diverse morphology. In addition, we developed a feature fusion block to systematically and gradually merge the diverse feature maps to improve the learning capability of the network. The attention mechanism in the fusion block retains the most contributing features. We evaluated the effectiveness of the proposed CB-HVT on two publicly available datasets for lymphocyte detection in histopathological images. The proposed network showed good results as compared to the existing architectures in terms of F-Score (LYSTO: 0.88 and NuClick: 0.82). In addition, the performance of the proposed CB-HVT on an unseen test set reveals its significance as a valuable tool for pathologists for real-time lymphocyte detection.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords