期刊文献+

一种存储优化的多模式匹配算法

A storage-optimized multi-pattern matching algorithm
下载PDF
导出
摘要 AC(Aho-Corasick)自动机是经典的多模式匹配算法,但在模式串字符集较大的情况下,AC自动机的存储开销较大。为降低存储开销提出了存储优化的多模式匹配算法SMMA,该算法在Trie树建立阶段利用正向表来存储每个状态的后续状态指针以及失配指针,而无需存储字符集所有字符的后继指针,从而压缩了每个状态的储存空间。实验表明,所提出的算法与AC自动机算法在时间效率上相近,但极大地降低了存储开销。 Aho-Corasick automaton is a classic multi-pattern matching algorithm; however, when the character set of patterns becomes rather large, its storage overhead correspondingly will be much higher. To solve this problem, this paper proposes a storage-optimized multi-pattern matching algorithm(SMMA). In build-up phase of Trie, the subsequent state pointer and mismatching pointer of each state will be kept in storage based on the indexed table. The algorithm won't store the subsequent state pointer for entire character set of each state, which contributes to the reduction of storage space. The experiments show that the SMMA can reduce the storage overhead greatly while maintaining its efficiency similar to that of AC automation algorithm.
出处 《微型机与应用》 2015年第2期14-17,共4页 Microcomputer & Its Applications
基金 国家自然科学基金(61170108) 国家级大学生创新创业训练计划项目(201310345005)
关键词 模式匹配 AC自动机 TRIE树 pattern matching AC automata Trie tree
  • 相关文献

参考文献13

二级参考文献44

  • 1殷丽华,方滨兴.一种改进的多模式匹配算法[J].华中科技大学学报(自然科学版),2005,33(z1):300-303. 被引量:4
  • 2万国根,秦志光.改进的AC-BM字符串匹配算法[J].电子科技大学学报,2006,35(4):531-533. 被引量:20
  • 3李伟男,鄂跃鹏,葛敬国,钱华林.多模式匹配算法及硬件实现[J].软件学报,2006,17(12):2403-2415. 被引量:42
  • 4蔡晓妍,戴冠中,杨黎斌.改进的多模式字符串匹配算法[J].计算机应用,2007,27(6):1415-1417. 被引量:11
  • 5NAVARRO G, RAFFINOT M. Flexible Pattern Matching in strings [ M ]. Cambridge: Cambridge University Press, 2002.
  • 6AHO A. V, CORASICK M J. Efficient string matching: an aid to bibliographic search[ C]//Commtmications of the ACM. New York: ACM, 1975:333 - 340.
  • 7ALLAUZEN C, RAFFINOT M. Simple optimal string matehing[J]. Journal of Algorithms, 2000, 36(1) :102 -116.
  • 8WU S, MANBER U. A Fast Algorithm for Multi-pattern Searching[ R]. Report TR-94-17, Tucson, AZ: Department of Computer Science, University of Arizona, 1994.
  • 9YAO A C. The complexity of pattern matching for a random string [ J ]. SIAM Journal on Computing, 1979, 8(3) :368 -387.
  • 10TAN L, SHERWOOD T. A high throughput string matching architecture for intrusion detection and prevention[ C]//Proceeding of the 32nd Annual International Symposium on Computer Architecture. Washington : IEEE Computer Society, 2005 : 112 - 122.

共引文献110

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部