Abstract:
Water is a primary need and the quality of drinking water greatly affects human life. The purpose of this research is to classify the quality or feasibility of drinking w...Show MoreMetadata
Abstract:
Water is a primary need and the quality of drinking water greatly affects human life. The purpose of this research is to classify the quality or feasibility of drinking water using machine learning. The use of a single k-Nearest Neighbors (k-NN) algorithm often results in low accuracy due to many factors, so in this study SMOTE (Synthetic Minority Oversampling Technique) is used to handle imbalanced data and k-NN is used as an estimator in the Bagging technique for classify. This research will compare the performance of the model in classifying when only using the k-NN algorithm, k-NN with SMOTE and SMOTE integrated with k-NN as a Bagging estimator (SMOTE + k-NN as Bagging estimator) and to evaluate the performance results, researcher use accuracy, precision, recall, f1-score and ROC-AVC. In this research of classifying the quality or feasibility of drinking water, the single k-NN algorithm only produces an average AVC of 0.75 while the SMOTE +\mathrm{k}-\text{NN} as Bagging estimator produces an average AVC of 0.958. In this case study, the our proposed model is able to increase the AVC by 20.8% so it can be said that the out proposed model is able to improve the performance of the model in distinguishing classes on the target.
Date of Conference: 10-12 October 2023
Date Added to IEEE Xplore: 28 December 2023
ISBN Information: