Loading [MathJax]/extensions/MathZoom.js
Continual Learning for Object Classification: Integrating AutoML for Binary Classification Tasks Within a Modular Dynamic Architecture | IEEE Journals & Magazine | IEEE Xplore

Continual Learning for Object Classification: Integrating AutoML for Binary Classification Tasks Within a Modular Dynamic Architecture


Graphical representation of the MDNN2.0 tree structure showcasing a hybrid approach using SVMs, LRs, and NNs selected via AutoML. The architecture optimizes classificatio...

Abstract:

For humans it is quite easy to identify a new object after learning to identify existing ones, but not for a machine. Deep neural networks (DNN) are the foundation of the...Show More

Abstract:

For humans it is quite easy to identify a new object after learning to identify existing ones, but not for a machine. Deep neural networks (DNN) are the foundation of the current state-of-the-art methods for training machines to recognize sets of objects. The issue is that any modification to the DNN weights that were trained to classify an initial set of objects has the potential to seriously impair the network’s ability to make those initial classifications; this behaviour is referred to as catastrophic forgetting (CF). This paper presents a continual learning (CL) architecture that can deal with CF. The architecture is composed of two primary parts: (i) The feature extraction component, which is based on the ResNet50 backbone and (ii) the modular dynamic classification (MDC) component. The latter is made up of multiple sub-networks that gradually assemble themselves into a tree-like structure that reorganizes itself as it learns over time, so that each sub-network can operate independently. The MDC relies heavily on binary classification, and here the application of automated machine learning (AutoML) was introduced, where each binary classifier is tailored on-the-fly, and is/can be different from object to object. The strategy involves a calculated selection from a predefined list of model types and parameters, optimizing them for their respective tasks. Results demonstrate that we advanced the adaptability and performance of the network, emphasizing the transformative potential of AutoML in modular CL approaches. Tests on the CORe50 dataset showed accuracy results of 81.1%, which are above the state of the art for CL architectures.
Graphical representation of the MDNN2.0 tree structure showcasing a hybrid approach using SVMs, LRs, and NNs selected via AutoML. The architecture optimizes classificatio...
Published in: IEEE Access ( Volume: 12)
Page(s): 183725 - 183742
Date of Publication: 02 December 2024
Electronic ISSN: 2169-3536

Funding Agency:


References

References is not available for this document.