期刊文献+

一种基于Aho-Corasick算法改进的多模式匹配算法 被引量:14

An improved multi-pattern matching algorithm based on Aho-Corasick algorithm
下载PDF
导出
摘要 目前互联网中以文本存在的数据非常庞大,针对在如此庞大的文本中如何准确、快速地找到多个不同的目标字符串的问题,在介绍常见的模式匹配算法的优点和缺点基础上,结合Trie速多模式匹配算法。根据对比性实验的结果分析得出,改进AC且匹配速度大约是AC算法的5倍。 There exists a large amount of text data on the Internet currently.In allusion to the problem that how to search out multiple different target character strings accurately and quickly in such large text,an improved fast multi-pattern matching algorithm is proposed on the basis of introducing the advantages and disadvantages of common pattern matching algorithms,and combining with the idea of converting the Trie tree into the double array form.A comparison experiment was carried out.The analysis results show that the improved AC algorithm can successfully match all the to-be queried pattern strings in the text,and its matching speed is about 5 times of that of the AC algorithm,which shows that the improved AC algorithm has good effects in aspects of matching speed,recall ratio and space utilization rate.
作者 陈永杰 吾守尔.斯拉木 于清 CHEN Yongjie;Wushour Silamu;YU Qing(School of Information Science and Engineering,Xinjiang University,Urumqi 830046,China)
出处 《现代电子技术》 北大核心 2019年第4期89-93,共5页 Modern Electronics Technique
基金 国家"973"重点基础研究计划(2014CB340506)~~
关键词 字符串匹配 多模式匹配 TRIE树 双数组 AC算法 匹配速度 character string matching multi-pattern matching Trie tree double array AC algorithm matching speed
  • 相关文献

参考文献4

二级参考文献38

  • 1蒋文沛.对字符串模式匹配KMP算法的探讨[J].南宁师范高等专科学校学报,2001,18(2):72-74. 被引量:5
  • 2王凌云,李琦,江洲.国内地理编码数据库系统开发与研究[J].计算机工程与应用,2004,40(21):167-168. 被引量:33
  • 3闵联营,赵婷婷.BM算法的研究与改进[J].武汉理工大学学报(交通科学与工程版),2006,30(3):528-530. 被引量:19
  • 4李璐 王宏志 李建中 等.Ed-Sjoin;一种优化的字符串相似连接算法.计算机研究与发展,2009,:319-325.
  • 5LI G L, DENG D, WANG J N, et al. Pass-Join: a partition-based method for similarity joins [ J]. Proceedings of the VLDB Endow- ment, 2011,5(3) : 253 - 264.
  • 6JESTES J., LI F F, YAN Z P, et al. Probabilistic string similarity joins[ C] // Proceedings of 29th ACM SIGMOD International Confer- ence on Management of Data. New York: ACM, 2010:327 -338.
  • 7BRYAN B, EBERHARDT F, FALOUTSOS C. Compact similarity joins [ C]//ICDE 2008: Proceeding of the 24th International Con- ference on Data Engineering. Piseataway: IEEE, 2008:346 -355.
  • 8XIAO C, WANG W, LIN X M, et al. Efficient similarity joins for near duplicate detection [ C]// WWW'08: Proceedings of the 17th International Conference on World Wide Web. New York: ACM, 2011:695-704.
  • 9FENG J H, WANG J N, LI G L. Tile-Join: a Tile-based method for efficient string similarity joins [ J]. The VLDB Journal, 2012, 21 (4) : 437 -461.
  • 10FENG J H, LI G L. Efficient fuzzy type-ahead search in XML data [ J]. IEEE Transactions on knowledge and Data Engineering, 2012, 24(5) : 882 - 895.

共引文献24

同被引文献132

引证文献14

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部