Monitoring the Quality of PeduliLindungi Application based on Customer Reviews on Google Play Using Hybrid Naïve Bayes -Laney p' Attribute Control Chart

Muhammad Ahsan, Nia Triamalia Apsari, Muhammad Hisyam Lee

Abstract


Indonesia is battling the COVID-19 pandemic. One of the government's strategies to break the virus's transmission chain is to track digital contacts in Indonesia using the PeduliLindungi application. The Google Play comment section is where users can express their opinions about the app. User opinions discovered on Google Play can be used to perform sentiment analysis and quality evaluation. The Naïve Bayes classification can be used to identify how user opinions contain positive, neutral, or negative sentiments in user reviews of the PeduliLindungi app on Google Play. The p and Laney p' charts can be used for quality evaluation. Laney p' control chart is an attribute chart used to monitor the proportion of defects with large and varied sample sizes. The data used in this study is from April 1, 2020, to March 31, 2022. According to the sentiment analysis results of user reviews of the PeduliLindungi app on Google Play, there are more negative reviews than positive classes. The classification accuracy has an Area Under Curve (AUC) value of 89.05%. This result shows that the test data has good classification. The monitoring results using p and Laney p' charts based on ratings and user reviews of the PeduliLindungi app show that the processes are still not statistically controlled. These findings indicate that the app developer still needs to make improvements.

Keywords


Sentiment analysis; Laney p' control chart; p control chart; naïve bayes classifier; PeduliLindungi

Full Text:

PDF

References


W. H. Organization, “WHO Director-General’s opening remarks at the media briefing on COVID-19-11 March 2020.†Geneva, Switzerland, 2020.

G. D’Aniello, M. Gaeta, and I. La Rocca, “KnowMIS-ABSA: an overview and a reference model for applications of sentiment analysis and aspect-based sentiment analysis,†Artif. Intell. Rev., vol. 55, no. 7, pp. 5543–5574, 2022.

C. P. D. Cyril, J. R. Beulah, N. Subramani, P. Mohan, A. Harshavardhan, and D. Sivabalaselvamani, “An automated learning model for sentiment analysis and data classification of Twitter data using balanced CA-SVM,†Concurr. Eng., vol. 29, no. 4, pp. 386–395, 2021.

S. Styawati, A. Nurkholis, A. A. Aldino, S. Samsugi, E. Suryati, and R. P. Cahyono, “Sentiment Analysis on Online Transportation Reviews Using Word2Vec Text Embedding Model Feature Extraction and Support Vector Machine (SVM) Algorithm,†in 2021 International Seminar on Machine Learning, Optimization, and Data Science (ISMODE), 2022, pp. 163–167, doi: 10.1109/ISMODE53584.2022.9742906.

K.-X. Han, W. Chien, C.-C. Chiu, and Y.-T. Cheng, “Application of support vector machine (SVM) in the sentiment analysis of twitter dataset,†Appl. Sci., vol. 10, no. 3, p. 1125, 2020.

B. AlBadani, R. Shi, and J. Dong, “A novel machine learning approach for sentiment analysis on twitter incorporating the universal language model fine-tuning and SVM,†Appl. Syst. Innov., vol. 5, no. 1, p. 13, 2022.

C. Villavicencio, J. J. Macrohon, X. A. Inbaraj, J.-H. Jeng, and J.-G. Hsieh, “Twitter sentiment analysis towards covid-19 vaccines in the Philippines using naïve bayes,†Information, vol. 12, no. 5, p. 204, 2021.

A. R. Lubis, S. Prayudani, M. Lubis, and O. Nugroho, “Sentiment Analysis on Online Learning During the Covid-19 Pandemic Based on Opinions on Twitter using KNN Method,†in 2022 1st International Conference on Information System & Information Technology (ICISIT), 2022, pp. 106–111.

F. H. Rachman and B. S. Rintyarna, “Sentiment analysis of Madura tourism in new normal era using text blob and KNN with hyperparameter tuning,†in 2021 International Seminar on Machine Learning, Optimization, and Data Science (ISMODE), 2022, pp. 23–27.

Q. Zhao, “Social emotion classification of Japanese text information based on SVM and KNN,†J. Ambient Intell. Humaniz. Comput., pp. 1–12, 2021.

M. K. M. K. Anwar, M. Yusoff, and M. Kassim, “Decision Tree and Naïve Bayes for Sentiment Analysis in Smoking Perception,†in 2022 IEEE 12th Symposium on Computer Applications & Industrial Electronics (ISCAIE), 2022, pp. 294–299.

R. A. Annisa and E. B. Setiawan, “Aspect Based Sentiment Analysis on Twitter Using Word2Vec Feature Expansion Method and Gradient Boosting Decision Tree Classification Method,†in 2022 1st International Conference on Software Engineering and Information Technology (ICoSEIT), 2022, pp. 273–278.

M. Aufar, R. Andreswari, and D. Pramesti, “Sentiment analysis on youtube social media using decision tree and random forest algorithm: A case study,†in 2020 International Conference on Data Science and Its Applications (ICoDSA), 2020, pp. 1–7.

A. Onan, “Sentiment analysis on product reviews based on weighted word embeddings and deep neural networks,†Concurr. Comput. Pract. Exp., vol. 33, no. 23, p. e5909, 2021.

S. O. Alhumoud and A. A. Al Wazrah, “Arabic sentiment analysis using recurrent neural networks: a review,†Artif. Intell. Rev., vol. 55, no. 1, pp. 707–748, 2022.

H. T. Phan, N. T. Nguyen, and D. Hwang, “Convolutional attention neural network over graph structures for improving the performance of aspect-level sentiment analysis,†Inf. Sci. (Ny)., vol. 589, pp. 416–439, 2022.

M. Usama, B. Ahmad, E. Song, M. S. Hossain, M. Alrashoud, and G. Muhammad, “Attention-based sentiment analysis using convolutional and recurrent neural network,†Futur. Gener. Comput. Syst., vol. 113, pp. 571–578, 2020.

K. Rrmoku, B. Selimi, and L. Ahmedi, “Application of trust in recommender systems—Utilizing naive Bayes classifier,†Computation, vol. 10, no. 1, p. 6, 2022.

J. Han, J. Pei, and H. Tong, Data mining: concepts and techniques. Morgan kaufmann, 2022.

D. C. Montgomery, Introduction to statistical quality control. John Wiley & Sons, 2020.

R. Sanchez-Marquez and J. M. J. Vivas, “Multivariate SPC methods for controlling manufacturing processes using predictive models–A case study in the automotive sector,†Comput. Ind., vol. 123, p. 103307, 2020.

M. Ahsan, M. Mashuri, H. Kuswanto, D. D. Prastyo, and H. Khusna, “T2 Control Chart based on Successive Difference Covariance Matrix for Intrusion Detection System,†in Journal of Physics: Conference Series, 2018, vol. 1028, no. 1, p. 12220.

M. Mashuri, M. Ahsan, M. H. Lee, and D. D. Prastyo, “PCA-based Hotelling’s T2 chart with Fast Minimum Covariance Determinant (FMCD) Estimator and Kernel Density Estimation (KDE) for Network Intrusion Detection,†Comput. Ind. Eng., p. 107447, 2021.

M. Ahsan, M. Mashuri, H. Kuswanto, D. D. Prastyo, and H. Khusna, “Outlier detection using PCA mix based T 2 control chart for continuous and categorical data,†Commun. Stat. Comput., vol. 50, no. 5, pp. 1496–1523, 2021.

M. Ahsan, M. Mashuri, H. Khusna, and M. H. Lee, “Multivariate Control Chart Based on Kernel PCA for Monitoring Mixed Variable and Attribute Quality Characteristics,†Symmetry (Basel)., vol. 12, no. 11, p. 1838, 2020.

M. Ahsan, M. Mashuri, and H. Khusna, “Kernel principal component analysis (PCA) control chart for monitoring mixed non-linear variable and attribute quality characteristics,†Heliyon, vol. 8, no. 6, p. e09590, 2022.

M. Imran, J. Sun, X. Hu, F. S. Zaidi, and A. Tang, “Investigating zero-state and steady-state performance of MEWMA-CoDa control chart using variable sampling interval,†J. Appl. Stat., pp. 1–22, 2023.

A. Zaka, M. Naveed, and R. Jabeen, “Performance of attribute control charts for monitoring the shape parameter of modified power function distribution in the presence of measurement error,†Qual. Reliab. Eng. Int., vol. 38, no. 2, pp. 1060–1073, 2022.

A. Johannssen, N. Chukhrova, and P. Castagliola, “The performance of the hypergeometric np chart with estimated parameter,†Eur. J. Oper. Res., vol. 296, no. 3, pp. 873–899, 2022.

A. A. Aly, N. A. Saleh, and M. A. Mahmoud, “An adaptive EWMA control chart for monitoring zero-inflated Poisson processes,†Commun. Stat. Comput., vol. 51, no. 4, pp. 1564–1577, 2022.

T. Mahmood, A. Iqbal, S. A. Abbasi, and M. Amin, “Efficient GLMâ€based control charts for Poisson processes,†Qual. Reliab. Eng. Int., vol. 38, no. 1, pp. 389–404, 2022.

M. Ahsan, M. Mashuri, and H. Khusna, “Evaluation of Laney p’ Chart Performance,†Int. J. Appl. Eng. Res., vol. 12, no. 24, pp. 14208–14217, 2017.

H. Hassani, C. Beneki, S. Unger, M. T. Mazinani, and M. R. Yeganegi, “Text mining in big data analytics,†Big Data Cogn. Comput., vol. 4, no. 1, p. 1, 2020.

L. Hickman, S. Thapa, L. Tay, M. Cao, and P. Srinivasan, “Text preprocessing for text mining in organizational research: Review and recommendations,†Organ. Res. Methods, vol. 25, no. 1, pp. 114–146, 2022.

S. Vijayarani, M. J. Ilamathi, and M. Nithya, “Preprocessing techniques for text mining-an overview,†Int. J. Comput. Sci. Commun. Networks, vol. 5, no. 1, pp. 7–16, 2015.

A. A. Freitas, Data mining and knowledge discovery with evolutionary algorithms. Springer Science & Business Media, 2002.

G. Singh, B. Kumar, L. Gaur, and A. Tyagi, “Comparison between multinomial and Bernoulli naïve Bayes for text classification,†in 2019 International Conference on Automation, Computational and Technology Management (ICACTM), 2019, pp. 593–596.

S. Kharya, S. Agrawal, and S. Soni, “Naive Bayes classifiers: a probabilistic detection model for breast cancer,†Int. J. Comput. Appl, vol. 92, no. 10, pp. 975–8887, 2014.

M. J. Zaki, W. Meira Jr, and W. Meira, Data mining and analysis: fundamental concepts and algorithms. Cambridge University Press, 2014.

A. Mustopa, E. B. Pratama, A. Hendini, and D. Risdiansyah, “Analysis of user reviews for the pedulilindungi application on google play using the support vector machine and naive bayes algorithm based on particle swarm optimization,†in 2020 Fifth International Conference on Informatics and Computing (ICIC), 2020, pp. 1–7.




DOI: http://dx.doi.org/10.18517/ijaseit.13.5.18247

Refbacks

  • There are currently no refbacks.



Published by INSIGHT - Indonesian Society for Knowledge and Human Development