摘要
跨语言剽窃一直是学术不端现象发生的重灾区,也是极难发现的一种剽窃行为。跨语言剽窃的检测和识别技术是目前最亟待发展的技术,也是反剽窃抄袭领域的最大技术难点。在总结和分析了单语剽窃检测和跨语言剽窃检测国内外研究现状的基础上,针对跨语言剽窃检测存在的问题,提出了一种基于指纹融合的跨语言剽窃检测技术,并将所提出的技术在人工构建的剽窃集上进行实验验证,对实验结果进行详细分析和对比分析,验证了该技术的有效性。
Cross-language plagiarism has always been the hardest hit for academic misbehavior. It is also a behavior that is extremely difficult to spot. Cross-language plagiarism detection and identification technology is the most urgent technology that needed to be developed. It is also the biggest technical difficulty in the field of plagiarism. Based on the summary and analysis of current researches on the monolingual plagiarism detection and cross-language plagiarism detection,aiming at the existing problem of cross-language plagiarism detection,this paper proposed a cross-language plagiarism detection technology based on fingerprint fusion. This paper also carried out experimental verification on the plagiarism set of artificial building. Through analyzing and comparing the result of experiments,it can be concluded that the method is indeed effective.
作者
刘刚
左权
杨倩茹
Liu Gang;Zuo Quan;Yang Qianru(College of Computer Science & Technology,Harbin Engineering University,Harbin 150001,China)
出处
《计算机应用研究》
CSCD
北大核心
2019年第1期168-174,共7页
Application Research of Computers
基金
中央高校基本科研业务费专项资金资助项目(HEUCF180604)
黑龙江省博士后科研启动金资助项目(LBH-Q15031)
黑龙江省教育科学规划课题(GJC1215107)
关键词
中间指纹
指纹融合
语义消歧
跨语言剽窃检测
intermediate fingerprint
fingerprint fusion
semantic disambiguation
cross-language plagiarism detection