摘要
在各大高校,剽窃检测系统已经被广泛的使用,用于检测学生在学生中出现的不诚实现象.对于这种剽窃检测系统,其核心就是对两个学生的作业进行相似度度量,当达到一个高的相似度,就具有剽窃的嫌疑,为老师公正的作出评判提供依据.本文研究了一种在各大剽窃检测系统中广泛使用的RKRGST算法,该算法结合了KR算法和GST算法,通过分析发现该算法在计算字符串相似度时具有较高的效率.
Nowadays,the plagiarism detection system has been used in many universities and colleges to detect the dishonesty of students in studying.The most important thing for the plagiarism detection system is to calculate the similarity of the assignment of two students or more.When the similarity is high enough,it is suspicious to have a plagiarism,which gives an evidence for the teacher to judge.This paper introduces an RKRGST algorithm used in many plagiarism systems.This algorithm combines the Karp-Rabin algorithm with the Greedy String Tiling algorithm.After analysis,it is found that the RKRGST algorithm is high-performace in calculating the similarity of the stings.
出处
《西南民族大学学报(自然科学版)》
CAS
2010年第5期836-840,共5页
Journal of Southwest Minzu University(Natural Science Edition)
关键词
相似度
RKRGST
KR
GST
similarity
RKRGST
Karp-Rabin
greedy string tiling