PENGARUH ALGORITMA STEMMING NAZIEF-ADRIANI TERHADAP KINERJA ALGORITMA WINNOWING UNTUK MENDETEKSI PLAGIARISME BAHASA INDONESIA
Abstrak: Winnowing algorithm
is one among manyalgorithms for detecting document similarity and plagiarism.
Some studies show that Winnowing algorithm performs quite well. One form of
plagiarism is paraphrase plagiarism. Paraphrase plagiarism can be done by
changing sentence structure, changing vocabulary, and adding or changing
affixes. Based on some of our previous experiments, detecting document
resemblances can be enhanced by changing the words containing affixes to their
basic words. In computer science, this technique is known as stemming - a
technique to extract the basic word from an affixed word. Usually this
technique is required in the filtering process to save storage media. For
Indonesian, the Nazief-Adriani stemming algorithm is by far the most
appropriate. This study examines how the effect of Nazief-Adriani stemming
algorithm on Winnowing algorithm's performance against Indonesian texts. The
results showed that the stemming process using Bloom-Filter on the Winnowing
algorithm tends to decrease the similarity level achieved, but it accelerates
processing time by approximately 30%.
Penulis: Hargyo Tri Nugroho
Kode Jurnal: jptkomputerdd170299