期刊文献+

一种文本相似性的度量方法和计算方法 被引量:3

A measuring method and algorithm of text similarity
下载PDF
导出
摘要 本文提出了一种判定两个文本相似性的度量方法,定义了相似度和摘录度,对其意义做了简单的分析;并用动态规划的思想给出了时间复杂度为O(mn)的计算方法。 This paper presents a measuring method of judging text similarity and gives the definitions of the degree of text similarity and quotation with the idea of dynamic programming ,an algorithm of time complexity O(mn) is introduced.
作者 何明 胡彩霞
出处 《黄山学院学报》 2005年第6期71-72,共2页 Journal of Huangshan University
关键词 文本 相似性 度量 算法 text similarity measure algorithm
  • 相关文献

同被引文献11

  • 1史彦军,滕弘飞,金博.抄袭论文识别研究与进展[J].大连理工大学学报,2005,45(1):50-57. 被引量:36
  • 2Kang NO, Gelbukh A, et al. PPCheck: Plagiarism Pat2- tern Checkerin Document CopyDetection. http://www. gelbukh.com/CV/Publications/2006/TSD-2006-Plagia 2rism.pdf.
  • 3Andrei ZB. On the Resemblance and Containment of Documents.Compression and Complexity of SEQUENCES 11997 .Saler 2no.Italy. 1997.21 - 291.
  • 4Shiva kumar N, Molina HG. SCAM: A Copy Detection Mechanismf or Digital Documents. The 2^nd International Conferencein. Theory and Practice of Digital Libraries.Austin.Texas.USA. 1995.9 - 171.
  • 5Manber U. Finding Similar File sina Large File System.USE 2NIX Conference. San Francisco. CA. 1994. 1 - 101.
  • 6NamOh Kang, Alexander Gelbukh, et al. PPCheck : Plagiarism Pattern Checker in Document Copy Detection [ EB/OL] . http:// www. gelbukh.com/CV/Publications/2006/TSD - 2006 - Plagiarism. pdf.
  • 7Andrei Z B. On the Resemblance and Containment of Documents [ C ]. Compression and Complexity of SEQUENCES. 1997, Salerno, Italy, 1997:21 - 29.
  • 8Shivakumar N,Molina H G. SCAM:A Copy Detection Mechanism for Digital Documents [ C ]. The 2nd International Conference in Theory and Practice of Digital Libraries, Austin, Texas, USA, 1995:9 - 17.
  • 9Manber U. Finding Similar Files in a Large File System[ C]. USENIX Conference, SanFrancisco, CA, 1994 : 1 - 10.
  • 10宋擒豹,杨向荣,沈钧毅,齐勇.数字商品非法复制的检测算法[J].计算机学报,2002,25(11):1206-1211. 被引量:16

引证文献3

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部