A Systematic Literature Review of Deep Learning Approaches for Sketch-Based Image Retrieval: Datasets, Metrics, and Future Directions

Fan Yang; Nor Azman Ismail; Yee Yong Pang; Victor R. Kebande; Arafat Al-Dhaqm; Tieng Wei Koh

doi:10.1109/ACCESS.2024.3357939

IEEE Access (Jan 2024)

A Systematic Literature Review of Deep Learning Approaches for Sketch-Based Image Retrieval: Datasets, Metrics, and Future Directions

Fan Yang,
Nor Azman Ismail,
Yee Yong Pang,
Victor R. Kebande,
Arafat Al-Dhaqm,
Tieng Wei Koh

Affiliations

Fan Yang: ORCiD; Faculty of Computing, Universiti Teknologi Malaysia (UTM), Skudai, Johor, Malaysia
Nor Azman Ismail: ORCiD; Faculty of Computing, Universiti Teknologi Malaysia (UTM), Skudai, Johor, Malaysia
Yee Yong Pang: Faculty of Computing, Universiti Teknologi Malaysia (UTM), Skudai, Johor, Malaysia
Victor R. Kebande: ORCiD; Department of Computer Science (DIDA), Blekinge Institute of Technology, Karlskrona, Sweden
Arafat Al-Dhaqm: Computer and Information Sciences Department, Universiti Teknologi PETRONAS, Bandar Seri Iskandar, Perak, Malaysia
Tieng Wei Koh: ORCiD; Computer and Information Sciences Department, Universiti Teknologi PETRONAS, Bandar Seri Iskandar, Perak, Malaysia

DOI: https://doi.org/10.1109/ACCESS.2024.3357939
Journal volume & issue: Vol. 12
pp. 14847 – 14869

Abstract

Read online

Sketch-based image retrieval (SBIR) utilizes sketches to search for images containing similar objects or scenes. Due to the proliferation of touch-screen devices, sketching has become more accessible and therefore has received increasing attention. Deep learning has emerged as a potential tool for SBIR, allowing models to automatically extract image features and learn from large amounts of data. To the best of our knowledge, there is currently no systematic literature review (SLR) of SBIR with deep learning. Therefore, the aim of this review is to incorporate related works into a systematic study, highlighting the main contributions of individual researchers over the years, with a focus on past, present and future trends. To achieve the purpose of this study, 90 studies from 2016 to June 2023 in 4 databases were collected and analyzed using the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) framework. The specific models, datasets, evaluation metrics, and applications of deep learning in SBIR are discussed in detail. This study found that Convolutional Neural Networks (CNN) and Generative Adversarial Networks (GAN) are the most widely used deep learning methods for SBIR. A commonly used dataset is Sketchy, especially in the latest Zero-shot sketch-based image retrieval (ZS-SBIR) task. The results show that Mean Average Precision (mAP) is the most commonly used metric for quantitative evaluation of SBIR. Finally, we provide some future directions and guidance for researchers based on the results of this review.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords