SFT: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer

Jit Yan Lim; Kian Ming Lim; Chin Poo Lee; Yong Xuan Tan

doi:10.1109/ACCESS.2024.3416327

IEEE Access (Jan 2024)

SFT: Few-Shot Learning via Self-Supervised Feature Fusion With Transformer

Jit Yan Lim,
Kian Ming Lim,
Chin Poo Lee,
Yong Xuan Tan

Affiliations

Jit Yan Lim: ORCiD; Faculty of Information Science and Technology, Multimedia University, Malacca, Malaysia
Kian Ming Lim: ORCiD; School of Computer Science, University of Nottingham Ningbo China, Ningbo, China
Chin Poo Lee: ORCiD; Faculty of Information Science and Technology, Multimedia University, Malacca, Malaysia
Yong Xuan Tan: ORCiD; Faculty of Information Science and Technology, Multimedia University, Malacca, Malaysia

DOI: https://doi.org/10.1109/ACCESS.2024.3416327
Journal volume & issue: Vol. 12
pp. 86690 – 86703

Abstract

Read online

The few-shot learning paradigm aims to generalize to unseen tasks with limited samples. However, a focus solely on class-level discrimination may fall short of achieving robust generalization, especially when neglecting instance diversity and discriminability. This study introduces a metric-based few-shot approach, named Self-supervised Feature Fusion with Transformer (SFT), which integrates self-supervised learning with a transformer. SFT addresses the limitations of previous approaches by employing two distinct self-supervised tasks in separate models during pre-training, thus enhancing both instance diversity and discriminability in the feature space. The training process unfolds in two stages: pre-training and transfer learning. In pre-training, each model undergoes training with specific self-supervised tasks to harness the benefits of enhanced feature space. In the subsequent transfer learning stage, model weights are frozen, acting as feature extractors. The features from both models are amalgamated using a feature fusion technique and are transformed into task-specific features by a transformer, boosting discrimination on unseen tasks. The combined features enable the model to learn a well-generalized representation, effectively tackling the challenges posed by few-shot tasks. The proposed SFT method achieves state-of-the-art results on three benchmark datasets in few-shot image classification.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords