Attention Score Enhancement Model Through Pairwise Image Comparison

Yeong Seok Ju; Zong Woo Geem; Joon Shik Lim

doi:10.3390/app14219928

Applied Sciences (Oct 2024)

Attention Score Enhancement Model Through Pairwise Image Comparison

Yeong Seok Ju,
Zong Woo Geem,
Joon Shik Lim

Affiliations

Yeong Seok Ju: Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea
Zong Woo Geem: Department of Smart City, Gachon University, Seongnam 13120, Republic of Korea
Joon Shik Lim: Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea

DOI: https://doi.org/10.3390/app14219928
Journal volume & issue: Vol. 14, no. 21
p. 9928

Abstract

Read online

This study proposes the Pairwise Attention Enhancement (PAE) model to address the limitations of the Vision Transformer (ViT). While the ViT effectively models global relationships between image patches, it encounters challenges in medical image analysis where fine-grained local features are crucial. Although the ViT excels at capturing global interactions within the entire image, it may potentially underperform due to its inadequate representation of local features such as color, texture, and edges. The proposed PAE model enhances local features by calculating cosine similarity between the attention maps of training and reference images and integrating attention maps in regions with high similarity. This approach complements the ViT’s global capture capability, allowing for a more accurate reflection of subtle visual differences. Experiments using Clock Drawing Test data demonstrated that the PAE model achieved a precision of 0.9383, recall of 0.8916, F1-Score of 0.9133, and accuracy of 92.69%, showing a 12% improvement over API-Net and a 1% improvement over the ViT. This study suggests that the PAE model can enhance performance in computer vision fields where local features are crucial by overcoming the limitations of the ViT.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords