Loading [MathJax]/extensions/MathZoom.js
Text Mining in Healthcare for Disease Classification using Machine Learning Algorithm | IEEE Conference Publication | IEEE Xplore

Text Mining in Healthcare for Disease Classification using Machine Learning Algorithm


Abstract:

The development of information technology and smartphones has caused production of many data around us. In every second million of new data is created in the form of text...Show More

Abstract:

The development of information technology and smartphones has caused production of many data around us. In every second million of new data is created in the form of text, audio, image and even videos. This environment then has triggered big data analytics demand. One of big data that is produced daily is data on the history of healthcare services in hospitals. Important new information can be retrieved through this huge dataset, especially concerning the patient symptoms, drug usage and new diseases report. In this study, text processing technique is applied on text data of patient medical record data from public hospital during 2017 till 2019 regarding the patient symptoms and the disease classification. Naïve Bayes Classifier and Random Forest algorithms are used to classify diseases in medical record data with 19 diseases in preprocessing data. A list of modified Indonesian stop words was used to filter the symptom sentences. The result indicates that the Random Forest classification algorithm can achieve the highest accuracy of around 99.9%, better and more accurate than the Naïve Bayes classification algorithm. This experiment shows that our proposed method provides a robust system and good accuracy for classifying medical record data with many diseases.
Date of Conference: 29-30 September 2021
Date Added to IEEE Xplore: 08 November 2021
ISBN Information:
Conference Location: Surabaya, Indonesia

Contact IEEE to Subscribe

References

References is not available for this document.