期刊文献+

基于译文特征的中英文跨语种抄袭识别 被引量:3

Research of Ch-En Cross-Lingual Plagiarism Detection Based on Translation Features
下载PDF
导出
摘要 针对科技类学术论文的跨语种反抄袭识别问题,以中英跨语种抄袭的识别为目标展开了研究,用于探讨进行跨语种抄袭识别的方法.通过挖掘中文译文的内在规律找到了一组可以表明译文风格的译文特征,并通过这些译文特征和决策树算法识别出存在抄袭嫌疑的科技论文.试验系统开放测试的准确率和召回率分别到达了88.68%和79.17%. Research on anti plagiarism detection of scientific papers in single language has acquired rele- vance and a number of practical systems have been developed. However, the relevant study and achieve- ment are relatively few in cross-lingual anti-plagiarism. Targeting at scientific papers, this paper discussed the implementation of Chinese-English cross-lingual plagiarism detection. The paper locates a set of trans- lation features by digging internal laws of Chinese translation. Through these features, papers which are suspected of plagiarism can be identified by the decision tree algorithm. In open test, its recalling rate achieves 88.68% and the precision rate 79.17%.
出处 《上海交通大学学报》 EI CAS CSCD 北大核心 2012年第6期989-993,998,共6页 Journal of Shanghai Jiaotong University
基金 教育部科技论文快速共享专项研究课题(2010121) 国家高技术研究发展计划(863)项目(2010AA012505)
关键词 论文抄袭 译文特征 跨语种 决策树 paper plagiarism translation feature cross-language decision tree
  • 相关文献

参考文献11

  • 1鲍军鹏,沈钧毅,刘晓东,宋擒豹.自然语言文档复制检测研究综述[J].软件学报,2003,14(10):1753-1760. 被引量:69
  • 2金博,史彦军,滕弘飞.中文文档复制检测系统研究[J].计算机工程,2005,31(19):79-81. 被引量:9
  • 3聂规划,付志超,陈冬林,刘平峰.基于本体的论文复制检测系统[J].计算机工程,2009,35(6):79-81. 被引量:9
  • 4朱一凡.翻译误区与汉语的畸形欧化[J].民族论坛,2008(2):56-58. 被引量:10
  • 5贺阳.现代汉语欧化语法现象研究[J].世界汉语教学,2008,22(4):16-31. 被引量:50
  • 6Parker A, Hamblen J O. Computer algorithms for plagiarism detection[J]. IEEE Transactions on Eduea- lion, 1989,32(2):94 99.
  • 7Barr6n-Cedeno A, Rosso P, Pinto D, et al. On cross-lingual plagiarism analysis using a statistical model[C]//ECAI'08 PAN Workshop Uncovering Pla- giarism. Patras, Greece: Authorship, and Social Soflware Misuse, 2008:9 13.
  • 8Brin S, Davis J, Garcia-Molina H. Copy detection mechanisms for digital documents [C] // SIGMOD '95, Proceedings of the ACM SIGMOD Annual Con- ference. New York, USA: ACM Press, 1995: 398- 409.
  • 9Brown P F, Della Pietra S A, Della Pietra V J, etal.The mathematics of statistical machine translation: Parameter estimation [J]. Computational Linguistics, 1993, 19(2):263-311.
  • 10Ceska Z, Toman M, Jezek K. Multilingual plagia rism detection [C]//AIMSA' 08 : Proceedings of the 13th International Conference on Artificial Intelli- gence. Berlin, Heidelberg: Springer-Verlag, 2008: 83-92.

二级参考文献35

共引文献133

同被引文献37

引证文献3

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部