Abstract:
Concept drift is an important characteristic and inevitable difficult problem in streaming data mining. Ensemble learning is commonly used to deal with concept drift. How...Show MoreMetadata
Abstract:
Concept drift is an important characteristic and inevitable difficult problem in streaming data mining. Ensemble learning is commonly used to deal with concept drift. However, most ensemble methods cannot balance the accuracy and diversity of base learners after drift occurs, and cannot adjust adaptively according to the drift type. To solve these problems, this paper proposes a targeted ensemble learning (Targeted EL) method to improve the accuracy and diversity of ensemble learning for streaming data with abrupt and gradual concept drift. First, to improve the accuracy of the base learners, the method adopts different sample weighting strategies for different types of drift to realize bidirectional transfer of new and old distributed samples. Second, the difference matrix is constructed by the prediction results of the base learners on the current samples. According to the drift type, the submatrix with appropriate size and maximum difference sum is extracted adaptively to select appropriate, accuracy and diverse base learners for ensemble. The experimental results show that the proposed method can achieve good generalization performance when dealing with the streaming data with abrupt and gradual concept drift.
Published in: IEEE Transactions on Knowledge and Data Engineering ( Volume: 36, Issue: 12, December 2024)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Dynamical ,
- Data Streams ,
- Concept Drift ,
- Sampling Weights ,
- Data Mining ,
- Current Sample ,
- Generalization Performance ,
- Ensemble Method ,
- Submatrix ,
- Base Learners ,
- Ensemble Learning Method ,
- Learning Accuracy ,
- Bidirectional Transfer ,
- Model Performance ,
- Accuracy Of Model ,
- Sample Distribution ,
- Effective Learning ,
- Transfer Learning ,
- Real-time Performance ,
- Ensemble Model ,
- Source Domain ,
- Target Domain ,
- Ensemble Diversity ,
- Source Domain Samples ,
- Adaptive Learning ,
- Positive Transfer ,
- Data Block ,
- Ensemble Size ,
- Historical Samples ,
- Number Of Learners
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Dynamical ,
- Data Streams ,
- Concept Drift ,
- Sampling Weights ,
- Data Mining ,
- Current Sample ,
- Generalization Performance ,
- Ensemble Method ,
- Submatrix ,
- Base Learners ,
- Ensemble Learning Method ,
- Learning Accuracy ,
- Bidirectional Transfer ,
- Model Performance ,
- Accuracy Of Model ,
- Sample Distribution ,
- Effective Learning ,
- Transfer Learning ,
- Real-time Performance ,
- Ensemble Model ,
- Source Domain ,
- Target Domain ,
- Ensemble Diversity ,
- Source Domain Samples ,
- Adaptive Learning ,
- Positive Transfer ,
- Data Block ,
- Ensemble Size ,
- Historical Samples ,
- Number Of Learners
- Author Keywords