Abstract:
Many websites such as Yelp provide platform for users to write reviews about places they have visited. But not all reviews are equally useful. However, it generally takes...Show MoreMetadata
Abstract:
Many websites such as Yelp provide platform for users to write reviews about places they have visited. But not all reviews are equally useful. However, it generally takes from several weeks to months to receive feedback about “usefulness” of review from online community. So there is a need to automatically predict the “usefulness” of review. In this paper, we are trying to solve the specific question “How many ‘useful’ votes a Yelp review will receive?” by using bag-of-words, linguistic, geographical, statistical, popularity and other qualitative features extracted from user, business and review information provided by Yelp. We use state-of-the-art machine learning algorithms for regression to predict required numeric value of ‘usefulness’ of review. We further gained performance improvement by introducing a batch mode localized weighted regression model. This localized regression approach resulted into RMSLE of 0.47769, which is better than traditional methods.
Published in: 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)
Date of Conference: 26-28 August 2016
Date Added to IEEE Xplore: 23 March 2017
ISBN Information:
Electronic ISSN: 2327-0594