Assessing Accuracy: A Study of Lexicon and Rule-Based Packages in R and Python for Sentiment Analysis | IEEE Journals & Magazine | IEEE Xplore

Assessing Accuracy: A Study of Lexicon and Rule-Based Packages in R and Python for Sentiment Analysis


The accuracy of sentiment analysis packages in R and Python

Abstract:

Sentiment analysis has become a focal point of interdisciplinary research, prompting the use of diverse methodologies and the continual emergence of programming language ...Show More

Abstract:

Sentiment analysis has become a focal point of interdisciplinary research, prompting the use of diverse methodologies and the continual emergence of programming language packages. Notably, Python and R have introduced comprehensive packages in this realm. In this study, we analyze established packages in these languages, focusing on accuracy while also considering time complexity. Across experiments conducted on seven distinct datasets, a crucial revelation surfaces: the accuracy of these packages significantly varies depending on the dataset used. Among these, the ‘sentimentr’ package consistently performs well across diverse datasets. Generally, Python libraries showcase superior processing speed. However, it’s essential to note that while these packages adeptly classify sentences as positive or negative, capturing sentiment intensity proves challenging. Our findings highlight a prevalent trend of overfitting, where these packages excel on familiar datasets but struggle when faced with unfamiliar ones.
The accuracy of sentiment analysis packages in R and Python
Published in: IEEE Access ( Volume: 12)
Page(s): 20169 - 20180
Date of Publication: 12 January 2024
Electronic ISSN: 2169-3536

Funding Agency:


References

References is not available for this document.