I. Introduction
The growth of information technology and advancements in artificial intelligence (AI) have dramatically increased data creation and usage. In AI systems, algorithms and data are two crucial elements. As a result, AI research can be divided into two categories: model-centric AI and data-centric AI [1]. Model-centric AI focuses on improving the performance of individual models, while data-centric AI focuses on enhancing the quality of data for machine learning (ML) tasks [2]. As a traditional approach, model-centric AI focuses on using the same data and making changes to model hyper-parameters, architectures, and other configurations. The data-centric approach, on the other hand, prioritizes improving existing data or incorporating new data, then training and evaluating ML algorithms. Overall, data-centric AI can generate new synthetic data, augment existing data, and balance class distributions, helping to overcome the limitations of limited data and improve model performance [3].