Loading [MathJax]/extensions/MathMenu.js
A Survey on Alleviating the Naive Bayes Conditional Independence Assumption | IEEE Conference Publication | IEEE Xplore

A Survey on Alleviating the Naive Bayes Conditional Independence Assumption


Abstract:

Naive Bayes (NB) is a simple and effective probabilistic classification method based on the Bayes theorem. NB method is used in a wide variety of real-world applications ...Show More

Abstract:

Naive Bayes (NB) is a simple and effective probabilistic classification method based on the Bayes theorem. NB method is used in a wide variety of real-world applications due to its effectiveness and ease of implementation. These applications include product recommendations, medical domain, credit scoring, sentiment analysis, spam filtering, etc. One of the keys to NB's efficacy is a stringent stipulation that the dataset's attributes are all treated as equal and conditionally independent. But in real-time, the NB assumption about the dataset cannot be satisfied, since the dataset captured has a high possibility of holding correlated, irrelevant, and uncertain variables. The use of these datasets does not satisfy the NB assumption, and results in poor prediction of the model. Many researchers have come up with different ways to solve the problem of the conditional independence presumption. The proposed method can be broken down into four main parts: Local Learning, Data Expansion, Feature Selection and Weighting, and Structure Extension. The goal of this study is to carry out an in-depth survey on the various methodologies that have been suggested as a means of mitigating the conditional independence presumption and determining how the performance of NB can be improved.
Date of Conference: 24-26 November 2022
Date Added to IEEE Xplore: 16 January 2023
ISBN Information:
Conference Location: Trichy, India

I. Introduction

The advancement of technology, sensors, and the internet paved the way for the generation of massive amounts of data in real-time. The generated data are in high volume and high dimensionality, which means that traditional computation methodology is not feasible because the data generation is in high volume. To process such a large volume of data, an efficient application that can provide insight into the data in an efficient manner is required. To conduct an efficient analysis of the data, Machine Learning (ML) algorithms are effective, since this algorithm is efficient in processing high volume data in efficient time. ML approaches are broadly classified into 4 types: 1. Supervised, 2. Unsupervised, 3. Semi-Supervised and 4. Reinforcement Learning.

Contact IEEE to Subscribe

References

References is not available for this document.