CG-CNN: Self-Supervised Feature Extraction Through Contextual Guidance and Transfer Learning

Olcay Kursun; Ahmad Patooghy; Peyman Poursani; Oleg V. Favorov

doi:10.1109/ACCESS.2024.3484663

IEEE Access (Jan 2024)

CG-CNN: Self-Supervised Feature Extraction Through Contextual Guidance and Transfer Learning

Olcay Kursun,
Ahmad Patooghy,
Peyman Poursani,
Oleg V. Favorov

Affiliations

Olcay Kursun: ORCiD; Department of Computer Science, Auburn University at Montgomery, Montgomery, AL, USA
Ahmad Patooghy: ORCiD; Department of Computer Systems Technology, North Carolina A&T State University, Greensboro, NC, USA
Peyman Poursani: Department of Computer Systems Technology, North Carolina A&T State University, Greensboro, NC, USA
Oleg V. Favorov: ORCiD; Joint Department of Biomedical Engineering, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA

DOI: https://doi.org/10.1109/ACCESS.2024.3484663
Journal volume & issue: Vol. 12
pp. 155851 – 155866

Abstract

Read online

Contextually Guided Convolutional Neural Networks (CG-CNNs) employ self-supervision and contextual information to develop transferable features across diverse domains, including visual, tactile, temporal, and textual data. This work showcases the adaptability of CG-CNNs through applications to various datasets such as Caltech and Brodatz textures, the VibTac-12 tactile dataset, hyperspectral images, and challenges like the XOR problem and text analysis. In text analysis, CG-CNN employs an innovative embedding strategy that utilizes the context of neighboring words for classification, while in visual and signal data, it enhances feature extraction by exploiting spatial information. CG-CNN mimics the context-guided unsupervised learning mechanisms of biological neural networks and it can be trained to learn its features on limited-size datasets. Our experimental results on natural images reveal that CG-CNN outperforms comparable first-layer features of well-known deep networks such as AlexNet, ResNet, and GoogLeNet in terms of transferability and classification accuracy. In text analysis, CG-CNN learns word embeddings that outperform traditional models like Word2Vec in tasks such as the 20 Newsgroups text classification. Furthermore, ongoing development involves training CG-CNN on outputs from another CG-CNN to explore multi-layered architectures, aiming to construct more complex and descriptive features. This scalability and adaptability to various data types underscore the potential of CG-CNN to handle a wide range of applications, making it a promising architecture for tackling diverse data representation challenges.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords