sentiment analysis of amazon product reviews using text mining
Fájlok
Dátum
Szerzők
Folyóirat címe
Folyóirat ISSN
Kötet címe (évfolyam száma)
Kiadó
Absztrakt
This study has applied five different machine learning algorithms of Bernoulli Naive Bayes, Complement Naive Bayes, Logistic Regression, Linear Support Vector Machine and Random Forest on the Amazon fine food products reviews to predict sentiment (positive or negative). As mentioned above, the dataset is imbalanced. Therefore, the data has been prepared to be balanced and qualified using random resampling and random undersampling. After balancing the data, the size of data has been decreased due to undersampling. This study has conducted three experiments using unigrams, bigrams, and trigrams in TF-IDF. The results from the study showed that in terms of accuracy, the Linear Support Vector Classifier model got the highest accuracy (91.21) and outperformed among the proposed models while the Naive Bayes models had the lowest accuracies. In terms of running time, Naive Bayes models were the fastest ones than the other models while the Random Forest Classifier consumed the longest time among other models.