摘要
The high-obfuscation plagiarism detection in big data environment,such as the paraphrasing and cross-language plagiarism, is often difficult for anti-plagiarism system because the plagiarism skills are becoming more and more complex. This paper proposes HawkEyes, a plagiarism detection system implemented based on the source retrieval and text alignment algorithms which developed for the international competition on plagiarism detection organized by CLEF. The text alignment algorism in HawkEyes gained the first place in PAN@CLEF2012. In the demonstration, we will present our system implemented on PAN@CLEF2014 training data corpus.
出处
《国际计算机前沿大会会议论文集》
2015年第1期134-135,共2页
International Conference of Pioneering Computer Scientists, Engineers and Educators(ICPCSEE)