期刊文献+

基于Levenshtein算法的题库相似度检测算法的设计与改进 被引量:1

Design and Improvement of Detection Algorithm for Similarity of Questions Bank Based on Levenshtein Algorithm
下载PDF
导出
摘要 为快速找到题库中题干重复题或相似度很高的试题,利用java Excel API类配合Levenshtein Distance算法实现直接访问excel题库,设计了题库重复题检测算法。在实际使用过程中发现Levenshtein算法存在内存超限,检测结果输出越界等问题,采用字符串分割法及增加控制语句的方式进行改进,获得了良好的实际使用效果。 To find High similarity of question Bank quickly,Detection algorithm is designed with java Excel API. But there is possible phenomenon,such as memory limit,output bounds of test results,etc. in the actual use of the process. In order to solve these problems,we use String segmentation method and increase Control statement to get good effect.
作者 胡玉琦
出处 《东莞理工学院学报》 2014年第5期57-60,共4页 Journal of Dongguan University of Technology
关键词 Levenshtein算法 重复题 字符串分割 Levenshtein Algorithm Repetitive question String segmentation method
  • 相关文献

参考文献8

二级参考文献18

  • 1章成志.基于多层特征的字符串相似度计算模型[J].情报学报,2005,24(6):696-701. 被引量:39
  • 2汤世平,樊孝忠.基于多示例学习的题库重复性检测研究[J].北京理工大学学报,2005,25(12):1071-1074. 被引量:5
  • 3Salton G, Lesk ME. Computer evaluation of indexing and text processing. Journal of the ACM,1968,15(1):8 - 36.
  • 4Baeza-Yates R, Ribeiro-Neto B. Modem Information Retrieval: Addison Wesley, 1999.38 - 42.
  • 5Salton G, Buckley C. Term-weighting approaches in automatic retrieval. Information Processing & Management, 1988,24(5):513 - 523.
  • 6董振东 董强.知网[EB/OL].http:∥www.keenage.com.,.
  • 7Nirenburg S.Two approaches of matching in example-based machine translation.In:Proc TMI-93.Kyoto,Japan,1993
  • 8Li S,Zhang J,et al.Journal of Computer Science and Technology,2002,17(6):933
  • 9Ristad E S,Yianilos P N.IEEE PAMI,1998,20(5):522
  • 10Chatterjee N.A Statistical approach for similarity measurement between sentences for EBMT.1999

共引文献125

同被引文献3

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部