Universal Image Embedding: Retaining and Expanding Knowledge With Multi-Domain Fine-Tuning

Socratis Gkelios; Anestis Kastellos; Yiannis S. Boutalis; Savvas A. Chatzichristofis

doi:10.1109/ACCESS.2023.3267804

IEEE Access (Jan 2023)

Universal Image Embedding: Retaining and Expanding Knowledge With Multi-Domain Fine-Tuning

Socratis Gkelios,
Anestis Kastellos,
Yiannis S. Boutalis,
Savvas A. Chatzichristofis

Affiliations

Socratis Gkelios: Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Kimmeria, Greece
Anestis Kastellos: Department of Computer Science, Intelligent Systems Laboratory, Neapolis University Pafos, Paphos, Cyprus
Yiannis S. Boutalis: ORCiD; Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Kimmeria, Greece
Savvas A. Chatzichristofis: ORCiD; Department of Computer Science, Intelligent Systems Laboratory, Neapolis University Pafos, Paphos, Cyprus

DOI: https://doi.org/10.1109/ACCESS.2023.3267804
Journal volume & issue: Vol. 11
pp. 38208 – 38217

Abstract

Read online

The overall purpose of this study is to propose a novel fine-tuning method for the CLIP architecture that enables the retention of pre-existing knowledge from large datasets and the creation of a domain-agnostic image encoder for universal image embedding, addressing the challenge of transferring knowledge from source to target tasks using deep learning models. The basic design of the study involves applying the proposed method directly (without fine-tuning) to a wide range of instance retrieval and recognition tasks to evaluate its effectiveness. The study’s major findings indicate that the proposed method significantly enhances performance on unseen domains without requiring separate fine-tuning for each domain. The authors’ success in the Google Universal Image Embedding competition, where they were awarded a Gold medal out of 1200 teams, inspired their proposed method. These results have significant implications for real-life applications where multiple domains are common. In conclusion, the study offers a practical solution for transfer learning that addresses the challenges of dealing with multiple domains and advances deep learning, potentially inspiring further research in this area and driving progress in the field.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords