Logistic Regression and Random Forest Comparison in Predicting Students’ Qualification Based on Students’ Half-Semester Performance | IEEE Conference Publication | IEEE Xplore

Logistic Regression and Random Forest Comparison in Predicting Students’ Qualification Based on Students’ Half-Semester Performance


Abstract:

The main aim of this paper is to utilize machine learning algorithms for predicting and identifying students who are more likely to fail in university examinations at an ...Show More

Abstract:

The main aim of this paper is to utilize machine learning algorithms for predicting and identifying students who are more likely to fail in university examinations at an early stage, based on their performance during the learning process. The objective is to prevent the risk of failing. This paper compares and reports the use of two machine learning algorithms: Logistic Regression (LR) and Random Forest (RF) for classifying students' success and failure, with the major goal being to achieve better prediction accuracy of the classifier. After conducting experiments using the Laboratory Operations Dataset, which includes valuable features like student attendance, subject difficulty level, and mid-score, the results show that Logistic Regression outperforms Random Forest with an accuracy of 70.8%, precision of 40.4%, and recall of 97.0%, whereas Random Forest achieves an accuracy of 74.7%, precision of 59.8%, and recall of 82.7%. Furthermore, the evaluation metric using the "Not Passed" class recall is applied as the primary focus to evaluate the certainty in predicting students who will fail in their learning process.
Date of Conference: 23-24 August 2023
Date Added to IEEE Xplore: 29 September 2023
ISBN Information:
Conference Location: Melaka, Malaysia

I. Introduction

The success of students in every learning process not only comes from the efforts of the students themselves, but also from all stakeholders involved in the education sector, including teachers, who bear the responsibility and make significant contributions [1]. One of the main obstacles that teachers encounter is the task of identifying students who are more likely to fail the final exam and provide personalized support to improve their chances of passing successfully. Early identification of at-risk students becomes vital for taking preventive actions to avoid the risk of failure [2]. Encouraging students and providing supportive feedback in the early stages can help reduce this risk. Universities should leverage student data and information to predict academic success.

Contact IEEE to Subscribe

References

References is not available for this document.