Continual Learning for Object Classification: Integrating AutoML for Binary Classification Tasks Within a Modular Dynamic Architecture

Daniel Turner; Pedro J. S. Cardoso; Joao M. F. Rodrigues

doi:10.1109/ACCESS.2024.3510536

IEEE Access (Jan 2024)

Continual Learning for Object Classification: Integrating AutoML for Binary Classification Tasks Within a Modular Dynamic Architecture

Daniel Turner,
Pedro J. S. Cardoso,
Joao M. F. Rodrigues

Affiliations

Daniel Turner: ORCiD; NOVA LINCS, FCT, ISE, Universidade do Algarve, Faro, Portugal
Pedro J. S. Cardoso: ORCiD; NOVA LINCS, ISE, Universidade do Algarve, Faro, Portugal
Joao M. F. Rodrigues: ORCiD; NOVA LINCS, ISE, Universidade do Algarve, Faro, Portugal

DOI: https://doi.org/10.1109/ACCESS.2024.3510536
Journal volume & issue: Vol. 12
pp. 183725 – 183742

Abstract

Read online

For humans it is quite easy to identify a new object after learning to identify existing ones, but not for a machine. Deep neural networks (DNN) are the foundation of the current state-of-the-art methods for training machines to recognize sets of objects. The issue is that any modification to the DNN weights that were trained to classify an initial set of objects has the potential to seriously impair the network’s ability to make those initial classifications; this behaviour is referred to as catastrophic forgetting (CF). This paper presents a continual learning (CL) architecture that can deal with CF. The architecture is composed of two primary parts: (i) The feature extraction component, which is based on the ResNet50 backbone and (ii) the modular dynamic classification (MDC) component. The latter is made up of multiple sub-networks that gradually assemble themselves into a tree-like structure that reorganizes itself as it learns over time, so that each sub-network can operate independently. The MDC relies heavily on binary classification, and here the application of automated machine learning (AutoML) was introduced, where each binary classifier is tailored on-the-fly, and is/can be different from object to object. The strategy involves a calculated selection from a predefined list of model types and parameters, optimizing them for their respective tasks. Results demonstrate that we advanced the adaptability and performance of the network, emphasizing the transformative potential of AutoML in modular CL approaches. Tests on the CORe50 dataset showed accuracy results of 81.1%, which are above the state of the art for CL architectures.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords