IEEE Access (Jan 2025)
PBVit: A Patch-Based Vision Transformer for Enhanced Brain Tumor Detection
Abstract
Brain Tumor holds a significant holds in human health, classified into three primary types: glioma, meningioma, and pituitary tumors. Early detection and accurate classification are vital for effective diagnosis and lowering healthcare costs. In PBvit we presents a novel brain tumor detection framework, the Patch Base Vision Transformer (PBVit). PBVit adopts a patch-based approach where input tumor images are divided into fixed-size patches, with each patch treated as a token. These image patches are linearly projected into lower-dimensional token embeddings, and positional encodings are added to help the model understand spatial relationships within the image. PBVit enhances the detection of intricate patterns and anomalies in brain scans, improving diagnostic accuracy. We trained PBVit using the Figshare brain tumor dataset and observed notable performance improvements compared to traditional CNN-based models. The PBVit reached an accuracy of 95.8%, a precision of 95.3%, a recall of 93.2%, and an F1-score of 92%, indicating its robustness in identifying brain tumors. The promising results demonstrate that PBVit can play a important role in facilitating early-stage diagnosis, reducing unnecessary biopsies, and ultimately enhancing patient care, while also showcasing the potential of transformer-based architectures in medical imaging.
Keywords