Abstract:
Bengali News Headline Categorization Using Machine Learning aims to categorize Bengali online news headlines into six distinct categories using Natural Language Processin...Show MoreMetadata
Abstract:
Bengali News Headline Categorization Using Machine Learning aims to categorize Bengali online news headlines into six distinct categories using Natural Language Processing. Researchers in different application fields have recently paid great attention to the fantastic accomplishments of Machine Learning Models in Natural Language Processing. Several machine learning algorithms categorize Bengali news headlines, including Logistic Regression, Random Forest Classifier, Multinomial Naive Bayes, and RBF Support Vector Machine. Also, deep learning models like LSTM, Bi-LSTM, GRU, Bi-GRU, and CNN, and the Bangla-BERT and XLM-RoBERTa transformer learning models are presented in this research. This paper’s primary purpose is to provide a comparative observation of several machine learning models, deep learning models, and transformer learning methods in Bengali news headline classification. We used 1,36,811 text data of Bengali news headlines for evaluation, and our dataset had an accuracy of 86.50% with XLM-RoBERTa.
Date of Conference: 17-19 December 2022
Date Added to IEEE Xplore: 03 March 2023
ISBN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Machine Learning ,
- Headlines ,
- News Headlines ,
- Logistic Regression ,
- Deep Learning ,
- Learning Models ,
- Learning Algorithms ,
- Random Forest ,
- Machine Learning Models ,
- Deep Learning Models ,
- Random Forest Classifier ,
- Accuracy Of Model ,
- Types Of Models ,
- Classification Performance ,
- F1 Score ,
- Accuracy Scores ,
- Classification Score ,
- Amusement ,
- Language Model ,
- Class Struggle ,
- DNN Model ,
- ML Models ,
- Maximum Accuracy ,
- Tokenized ,
- Sport Classes ,
- Flat Layer ,
- Balanced Dataset ,
- Political News ,
- Imbalanced Datasets ,
- Lowest Accuracy
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Machine Learning ,
- Headlines ,
- News Headlines ,
- Logistic Regression ,
- Deep Learning ,
- Learning Models ,
- Learning Algorithms ,
- Random Forest ,
- Machine Learning Models ,
- Deep Learning Models ,
- Random Forest Classifier ,
- Accuracy Of Model ,
- Types Of Models ,
- Classification Performance ,
- F1 Score ,
- Accuracy Scores ,
- Classification Score ,
- Amusement ,
- Language Model ,
- Class Struggle ,
- DNN Model ,
- ML Models ,
- Maximum Accuracy ,
- Tokenized ,
- Sport Classes ,
- Flat Layer ,
- Balanced Dataset ,
- Political News ,
- Imbalanced Datasets ,
- Lowest Accuracy