Comparison between the Stemmer Porter Effect and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism

Alam Rahmatulloh, Neng Ika Kurniati, Irfan Darmawan, Adi Zaenal Asyikin, Deden Witarsyah J

Abstract


Current technological developments change physical paper patterns into digital, and this has a very high impact. Positive impact because paper waste is reduced, on the other hand, the rampant copying of digital data raises the amount of plagiarism that is increasing. At present, there are many efforts made by experts to overcome the problem of plagiarism, one of which is by utilizing the winnowing algorithm as a tool to detect plagiarism data. In its development, many optimizing winnowing algorithms used stemming techniques. The most widely used stemmer algorithms include stemmer porter and nazief-adriani. However, there has not been a discussion on the comparison of the effect of performance using stemmer on the winnowing algorithm in measuring the value of plagiarism. So it is necessary to research the effect of stemmer algorithms on winnowing algorithms so that the results of plagiarism detection are more optimal. The results of this study indicate that the effect of nazief-adriani stemmer on the winnowing algorithm is superior to the stemmer porter, only decreasing the detection performance of the 0.28% similarity value while the Porter stemmer is superior in increasing the processing time to 69% faster.

Keywords


Nazief-Adriani; plagiarism; porter; stemmer; winnowing.

Full Text:

PDF

References


H. Lamba and S. Govilkar, “A Survey on Plagiarism Detection Techniques for Indian Regional Languages,†Int. J. Comput. Appl., 2017.

A. M. El Tahir Ali, H. M. D. Abdulla, and V. Snasel, “Survey of plagiarism detection methods,†in Proceedings - AMS 2011: Asia Modelling Symposium 2011 - 5th Asia International Conference on Mathematical Modelling and Computer Simulation, 2011.

D. Namdev, “A Survey Paper on Plagiarism Detection Techniques,†Int. Conf. ICT Healthc., pp. 30–34, 2015.

L. Lulu, B. Belkhouche, and S. Harous, “Overview of fingerprinting methods for local text reuse detection,†in Proceedings of the 2016 12th International Conference on Innovations in Information Technology, IIT 2016, 2017.

E. G. Hasan, A. Wicaksana, and S. Hansun, “The Implementation of Winnowing Algorithm for Plagiarism Detection in Moodle-based E-learning,†Proc. - 17th IEEE/ACIS Int. Conf. Comput. Inf. Sci. ICIS 2018, pp. 321–325, 2018.

S. Schleimer, D. S. Wilkerson, and A. Aiken, “Winnowing: Local Algorithms for Document Fingerprinting,†in ACM International Conference on Management of Data (SIGMOD), 2003.

N. Elbegbayan, “Winnowing, a Document Fingerprinting Algorithm,†Science (80-.). 2005.

N. Alamsyah, “Perbandingan Algoritma Winnowing Dengan Algoritma Rabin Karp Untuk Mendeteksi Plagiarisme Pada Kemiripan Teks Judul Skripsi,†Technologia, vol. 8, no. 3, pp. 124–134, 2017.

T. Mardiana, T. Bharata Adji, and I. Hidayah, “Stemming Influence on Similarity Detection of Abstract Written in Indonesia,†Telkomnika (Telecommunication Comput. Electron. Control. 2016.

Z. Ceska and C. Fox, “The Influence of Text Pre-processing on Plagiarism Detection,†Int. Conf. RANLP 2009, pp. 55–59, 2009.

H. T. Nugroho, “Pengaruh Algoritma Stemming Nazief-Adriani Terhadap Kinerja Algoritma Winnowing Untuk Mendeteksi Plagiarisme Bahasa Indonesia,†J. Ultim. Comput. vol. 9, no. 1, pp. 36–40, 2017.

J. Vassallo, “WASP (Write a Scientific Paper): Plagiarism and the ethics of dealing with colleagues,†Early Hum. Dev., vol. 124, pp. 65–67, 2018.

Kock and Davison, “Dealing with Plagiarism in the Information Systems Research Community: A Look at Factors That Drive Plagiarism and Ways to Address Them,†MIS Q., 2017.

D. Sharma, “Stemming Algorithms: A Comparative Study and their Analysis,†Int. J. Appl. Inf. Syst., 2013.

P. Willett, “The Porter stemming algorithm: Then and now,†Program, 2006.

R. Sugumar and M. R. Priya, “Improved Performance of Stemming Using Enhanced Porter,†Int. J. Eng. Sci. Res. Technol., vol. 7, no. 4, pp. 681–686, 2018.

J. Asian, H. E. Williams, and S. M. M. Tahaghoghi, “Stemming Indonesian: a confix-stripping approach,†Conf. Res. Pract. Inf. Technol. Ser., vol. 38, pp. 307–314, 2005.

V. Gurusamy and S. K. K. Nandhini, “Performance Analysis : Stemming Algorithm for the English Language,†IJSRD - Int. J. Sci. Res. Dev., vol. 5, no. 05, pp. 1933–1938, 2017.

J. Asian, “Effective Techniques for Indonesian Text Retrieval,†2007.

A. T. Wibowo, K. W. Sudarmadi, and A. M. Barmawi, “Comparison between fingerprint and winnowing algorithm to detect plagiarism fraud on Bahasa Indonesia documents,†in 2013 International Conference of Information and Communication Technology, ICoICT 2013, 2013.

R. Sutoyo, I. Ramadhani, and A. D. Ardiatma, “Detecting Documents Plagiarism using Winnowing Algorithm and K-Gram Method,†Cybern. Comput. Intell. (CyberneticsCom), 2017 IEEE Int. Conf., pp. 67–72, 2017.




DOI: http://dx.doi.org/10.18517/ijaseit.9.4.8844

Refbacks




Published by INSIGHT - Indonesian Society for Knowledge and Human Development