Performance Evaluation of Classification Algorithm for Movie Review Sentiment Analysis

Authors

  • Sutriawan Sutriawan
  • Pulung Nurtantio Andono
  • Muljono Muljono
  • Ricardus Anggi Pramunendar

DOI:

https://doi.org/10.47839/ijc.22.1.2873

Keywords:

Decision Tree, K-Nearest Neighbor, Support Vector Machine, Naive Bayes, Sentiment Analysis, Performance Evaluation

Abstract

The majority of the current research on sentiment analysis, which covers topics like political reviews, movie reviews, and product reviews, has developed quickly. The classification and clustering stage of sentiment analysis research involves a number of subjects. Some of them cover text classification comparison research and algorithm performance optimization. An intricate issue in sentiment analysis research is dealing with unstructured or semi-structured data. The sentiment analysis procedure and improving the efficacy of the classifier’s algorithm are both hampered by unstructured data. In order to manage unstructured data successfully and provide accurate and relevant information, unique strategies are required. The proposed classification model performance evaluation using Support Vector Machine, Naive Bayes, K-Nearest Neighbor, and Decision Tree is specifically covered in this paper. According to the study’s findings, SVM has an accuracy rate of 96% and Naive Bayes is 86%. While the decision tree’s gain accuracy is 78 percent and the kNN classification model’s gain accuracy is 78 percent, respectively. The test results demonstrate that SVM is superior to other classification models in terms of accuracy performance.

References

S. Widya Sihwi, I. Prasetya Jati, and R. Anggrainingsih, “Twitter sentiment analysis of movie reviews using information gain and Naïve Bayes classifier,” Proceedings of the 2018 Int. Semin. Appl. Technol. Inf. Commun. Creat. Technol. Hum. Life, iSemantic 2018, pp. 190–195, 2018. https://doi.org/10.1109/ISEMANTIC.2018.8549757.

B. Das and S. Chakraborty, “An improved text sentiment classification model using TF-IDF and next word negation,” 2018, [Online]. Available at: http://arxiv.org/abs/1806.06407

R. I. Mukhamediev et al., “Review of artificial intelligence and machine learning technologies: Classification, restrictions, opportunities and challenges,” Mathematics, vol. 10, no. 15, pp. 1–25, 2022. https://doi.org/10.3390/math10152552.

P. Kalaivani and K. L. Shunmuganathan, “An improved K-nearest-neighbor algorithm using genetic algorithm for sentiment classification,” Proceedings of the 2014 Int. Conf. Circuits, Power Comput. Technol. ICCPCT 2014, pp. 1647–1651, 2014. https://doi.org/10.1109/ICCPCT.2014.7054826.

T. T. Thet, J.-C. Na, and C. S. G. Khoo, “Aspect-based sentiment analysis of movie reviews on discussion boards,” J. Inf. Sci., vol. 36, no. 6, pp. 823–848, 2010. https://doi.org/10.1177/0165551510388123.

P. Chaovalit and L. Thou, “Movie review mining: A comparison between supervised and unsupervised classification approaches,” Proc. Annu. Hawaii Int. Conf. Syst. Sci., vol. 00, no. C, p. 112, 2005. https://doi.org/10.1109/HICSS.2005.445.

R. Maulana, P. A. Rahayuningsih, W. Irmayani, D. Saputra, and W. E. Jayanti, “Improved accuracy of sentiment analysis movie review using support vector machine based information gain,” J. Phys. Conf. Ser., vol. 1641, no. 1, 2020. https://doi.org/10.1088/1742-6596/1641/1/012060.

K. Lee, J. Park, I. Kim, and Y. Choi, “Predicting movie success with machine learning techniques: ways to improve accuracy,” Inf. Syst. Front., vol. 20, no. 3, pp. 577–588, 2018. https://doi.org/10.1007/s10796-016-9689-z.

A. H. Aliwy and E. H. A. Ameer, “Comparative study of five text classification algorithms with their improvements,” Int. J. Appl. Eng. Res., vol. 12, no. 14, pp. 4309–4319, 2017. https://files2.shewaya.com/files/62055.pdf.

A. I. Kadhim, “Survey on supervised machine learning techniques for automatic text classification,” Artif. Intell. Rev., vol. 52, no. 1, pp. 273–292, 2019. https://doi.org/10.1007/s10462-018-09677-1.

V. B. Vaghela, B. M. Jadav, “Analysis of various sentiment classification techniques,” Int. J. Comput. Appl., vol. 140, no. 3, pp. 22–27, 2016. https://doi.org/10.5120/ijca2016909259.

M. Karim and S. Das, “Sentiment analysis on textual reviews,” IOP Conf. Ser. Mater. Sci. Eng., vol. 396, no. 1, pp. 122–127, 2018. https://doi.org/10.1088/1757-899X/396/1/012020.

J. Miharja, J. L. Putra, and N. Hadianto, “Comparison of machine learning classification algorithm on hotel review sentiment analysis (Case study: Luminor hotel Pecenongan),” J. Pilar Nusa Mandiri, vol. 16, no. 1, pp. 59–64, 2020. https://doi.org/10.33480/pilar.v16i1.1131.

R. S. Jagdale, V. S. Shirsat, and S. N. Deshmukh, “Sentiment analysis on product reviews using machine learning techniques,” Adv. Intell. Syst. Comput., vol. 768, pp. 639–647, 2019. https://doi.org/10.1007/978-981-13-0617-4_61.

B. Saberi and S. Saad, “Sentiment analysis or opinion mining: A review,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 7, no. 5, pp. 1660–1666, 2017. https://doi.org/10.18517/ijaseit.7.5.2137.

Y. Nurdiansyah, S. Bukhori, and R. Hidayat, “Sentiment analysis system for movie review in Bahasa Indonesia using Naive Bayes classifier method,” J. Phys. Conf. Ser., vol. 1008, no. 1, 2018. https://doi.org/10.1088/1742-6596/1008/1/012011.

R. I. Pristiyanti, M. A. Fauzi, and L. Muflikhah, “Sentiment analysis summarizing film reviews using information gain and K-nearest neighbor methods,” J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 3, pp. 1179–1186, 2018. (in Indonesian), [Online]. Available at: http://j-ptiik.ub.ac.id/index.php/j-ptiik/article/view/1140.

R. Hidayat and S. Minati, “Comparative Analysis of Text Mining Classification Algorithms for English and Indonesian Qur’an Translation,” IJID (International J. Informatics Dev., vol. 8, no. 1, p. 47, 2019. https://doi.org/10.14421/ijid.2019.08108.

B. V. Indriyono, E. Utami, and A. Sunyoto, “Utilization of the Porter Stemmer Algorithm for Indonesian in the process of classifying book types,” J. Buana Inform., vol. 6, no. 4, pp. 301–310, 2015. (in Indonesian) https://doi.org/10.24002/jbi.v6i4.462.

A. Tripathy, A. Agrawal, and S. K. Rath, “Classification of sentimental reviews using machine learning techniques,” Procedia Comput. Sci., vol. 57, no. March, pp. 821–829, 2015. https://doi.org/10.1016/j.procs.2015.07.523.

D. Kim, D. Seo, S. Cho, and P. Kang, “Multi-co-training for document classification using various document representations: TF–IDF, LDA, and Doc2Vec,” Inf. Sci. (Ny)., vol. 477, pp. 15–29, 2019. https://doi.org/10.1016/j.ins.2018.10.006.

P. Dellia and A. Tjahyanto, “Tax complaints classification on twitter using text mining,” IPTEK J. Sci., vol. 2, no. 1, p. 11, 2017. https://doi.org/10.12962/j23378530.v2i1.a2254.

C. J. Huang, Y. J. Yang, D. X. Yang, and Y. J. Chen, “Frog classification using machine learning techniques,” Expert Syst. Appl., vol. 36, no. 2 PART 2, pp. 3737–3743, 2009. https://doi.org/10.1016/j.eswa.2008.02.059.

K. Kowsari, K. J. Meimandi, M. Heidarysafa, S. Mendu, L. Barnes, and D. Brown, “Text classification algorithms: A survey,” Inf., vol. 10, no. 4, pp. 1–68, 2019. https://doi.org/10.3390/info10040150.

P. Y. Pawar and S. H. Gawande, “A comparative study on different types of approaches to text categorization,” Int. J. Mach. Learn. Comput., vol. 2, no. 4, pp. 423–426, 2012. https://doi.org/10.7763/IJMLC.2012.V2.158.

M. Mamtesh and S. Mehla, “Sentiment analysis of movie reviews using machine learning classifiers,” Int. J. Comput. Appl., vol. 182, no. 50, pp. 25–28, 2019. https://doi.org/10.5120/ijca2019918756.

V. C. Pande and A. S. Khandelwal, “Comparative analysis of various text classification algorithms,” Int. J. Comput. Sci. Inf. Technol., vol. 8, no. 5, pp. 574–577, 2017.

V. Korde, “Text classification and classifiers: A survey,” Int. J. Artif. Intell. Appl., vol. 3, no. 2, pp. 85–99, 2012. https://doi.org/10.5121/ijaia.2012.3208.

R. B. Purnomoputra and U. N. Wisesty, “Sentiment analysis of movie reviews using Naïve Bayes method with Gini index feature selection,” no. July, pp. 85–94, 2019. https://doi.org/10.34818/jdsa.2019.2.36.

Downloads

Published

2023-03-29

How to Cite

Sutriawan, S., Andono, P. N., Muljono, M., & Pramunendar, R. A. (2023). Performance Evaluation of Classification Algorithm for Movie Review Sentiment Analysis. International Journal of Computing, 22(1), 7-14. https://doi.org/10.47839/ijc.22.1.2873

Issue

Section

Articles