PBVit: A Patch-Based Vision Transformer for Enhanced Brain Tumor Detection

Pratikkumar Chauhan; Munindra Lunagaria; Deepak Kumar Verma; Krunal Vaghela; Ghanshyam G. Tejani; Sunil Kumar Sharma; Ahmad Raza Khan

doi:10.1109/ACCESS.2024.3521002

IEEE Access (Jan 2025)

PBVit: A Patch-Based Vision Transformer for Enhanced Brain Tumor Detection

Pratikkumar Chauhan,
Munindra Lunagaria,
Deepak Kumar Verma,
Krunal Vaghela,
Ghanshyam G. Tejani,
Sunil Kumar Sharma,
Ahmad Raza Khan

Affiliations

Pratikkumar Chauhan: ORCiD; Department of Computer Engineering, Marwadi University, Rajkot, Gujarat, India
Munindra Lunagaria: ORCiD; Department of Computer Engineering, Marwadi University, Rajkot, Gujarat, India
Deepak Kumar Verma: ORCiD; Department of Computer Engineering, Marwadi University, Rajkot, Gujarat, India
Krunal Vaghela: ORCiD; Department of Computer Engineering, Marwadi University, Rajkot, Gujarat, India
Ghanshyam G. Tejani: ORCiD; Department of Industrial Engineering and Management, Yuan Ze University, Taoyuan, Taiwan
Sunil Kumar Sharma: ORCiD; Department of Information System, College of Computer and Information Sciences, Majmaah University, Majmaah, Saudi Arabia
Ahmad Raza Khan: ORCiD; Department of Information Technology, College of Computer and Information Sciences, Majmaah University, Majmaah, Saudi Arabia

DOI: https://doi.org/10.1109/ACCESS.2024.3521002
Journal volume & issue: Vol. 13
pp. 13015 – 13029

Abstract

Read online

Brain Tumor holds a significant holds in human health, classified into three primary types: glioma, meningioma, and pituitary tumors. Early detection and accurate classification are vital for effective diagnosis and lowering healthcare costs. In PBvit we presents a novel brain tumor detection framework, the Patch Base Vision Transformer (PBVit). PBVit adopts a patch-based approach where input tumor images are divided into fixed-size patches, with each patch treated as a token. These image patches are linearly projected into lower-dimensional token embeddings, and positional encodings are added to help the model understand spatial relationships within the image. PBVit enhances the detection of intricate patterns and anomalies in brain scans, improving diagnostic accuracy. We trained PBVit using the Figshare brain tumor dataset and observed notable performance improvements compared to traditional CNN-based models. The PBVit reached an accuracy of 95.8%, a precision of 95.3%, a recall of 93.2%, and an F1-score of 92%, indicating its robustness in identifying brain tumors. The promising results demonstrate that PBVit can play a important role in facilitating early-stage diagnosis, reducing unnecessary biopsies, and ultimately enhancing patient care, while also showcasing the potential of transformer-based architectures in medical imaging.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords