期刊文献+

基于异构隐式存储的多模式匹配算法 被引量:6

Multi-pattern matching algorithm based on heterogeneous implicit storage
下载PDF
导出
摘要 提出了紧缩存储型Aho-Corasick算法变体,以异构的按需隐式存储取代同构的例行显式存储,从横向扇出压缩与纵向路径压缩2个方向入手,围绕着压缩稀疏事件表展开,当字符集大小σ=256时可将存储量缩减为原来的0.69%左右,而σ=64K时则达0.004%,即空间复杂度降为原来的(lbσ)/σ左右。依据扇出疏密程度的不同,分类采用了4种有针对性的快速事件定位方法,加之优化的失败迁移,使得存储量的大幅缩减不以速度的明显损失为代价,实验也证实了这一点。适用于需承载大型模式集和较长模式串而对时延和抖动都比较敏感的场合(如在线数据流过滤),在宽字符(如UNICODE型亚洲字符)匹配方面拥有显著优势。 A variation of Aho-Corasick algorithm via compact storage was presented, which replaced homogeneous routine explicit storage with heterogeneous requirement oriented implicit one, first started from two aspects of widthwise fan-out compression and lengthwise path compression, then expanded around the compression of sparse event table, thus reduced memory usage to about 0.69% of original one when alphabet size a = 256, 0.004% when σ = 64K, and space complexity approximate (1bσ)/σ of origin. According to different degrees of fan-out, four kinds of specific fast event location methods were adopted, plus optimized failure transitions, led to the fact that the dramatic reduction of memory usage isn't at the cost of obvious loss of speed, which was also proved by experiments. It's suitable to be applied in the cases that need holding mass set of longer patterns and are sensitive to delay and jitter(for example, online data stream filtering) and presents distinguished superiority in wide-character(such as Asian character of UNICODE type) matching.
出处 《通信学报》 EI CSCD 北大核心 2009年第3期119-124,共6页 Journal on Communications
基金 国家重点基础研究发展计划("973"计划)基金资助项目(2007CB311100) 国家高技术研究发展计划("863"计划)基金资助项目(2007AA01Z473)~~
关键词 多模式匹配 紧缩存储 扇出压缩 路径压缩 事件定位 multiple patterns matching compact storage fan-out compression path compression event location
  • 相关文献

参考文献1

二级参考文献10

  • 1Cowan C,Arnold S,Beattie S,Wright C,Viega J.DEF- CON capture the flag: Defending vulnerable code form in- tense attack[].Proceedings of the DARPA DISCEX III Conference.2003
  • 2E2xB algorithm patch for Snort version 2.4.2. http://dcs.ics.forth.gr/Activities/Projects/snort.html . 2005
  • 3Wu S,Manber U.A fast algorithm for multi-pattern searching[].Technical Report TR-- University of Arizona.1994
  • 4Tuck N,Sherwood T,Calder B,Varghese G.Determinis- tic memory-efficient string matching algorithms for intru- sion detection[].Proceedings of IEEE INFOCOM.2004
  • 5Baker Z K,Prasanna V K.Time and area efficient pattern matching on FPGAs[].Proceedings of the th ACM In- ternational Symposium on Field-Programmable Gate Ar- rays.2004
  • 6Dharmapurikar S,Krishnamurthy P,Sproull T,Lockwood J.Deep packet inspection using parallel Bloom filters[].Proceedings of the th Symposium on High Performance Interconnects.2003
  • 7Nilsen G,Torresen J,Sorasen O.A variable word-width content addressable memory for fast string matching[].Proceedings of the nd Norchip Conference.2004
  • 8Janardhan S,Bu L,Chandy J A.A signature match proces- sor architecture for network intrusion detection[].Pro- ceedings of the th Annual IEEE Symposium on Field- Programmable Custom Computing Machine.2005
  • 9Attig M,Lockwood J.A framework for rule processing in reconfigurable network systems[].Proceedings of the th Annual IEEE Symposium on Field-Programmable Custom Computing Machines.2005
  • 10Yusuf S,Luk W.Bitwise optimized CAM for network intrusion detection systems[].Proceedings of the International Conference on Field Programmable Logic and Applications.2005

共引文献8

同被引文献55

  • 1杨东红,徐恪,崔勇.改进的Wu-Manber多模式串匹配算法[J].清华大学学报(自然科学版),2006,46(4):555-558. 被引量:13
  • 2Aho A V, Corasick M J. Efficient string matching: An aid to bibliographic search [J]. Communication of the ACM, 1975, 18(6), 333-340.
  • 3Fredriksson K, Grabowski S. Average optimal string matching [J]. Journal of Discrete Algorithms, 2009, 7 (4) : 579-594.
  • 4Nieminen J, Kilpelainen P. Efficient implementation of Aho- Corasick pattern matching automata using unicode [J]. Software: Practice and Experience, 2007, 37(6): 669-690.
  • 5Knuth D E, Morris J H, Pratt V R. Fast pattern matching in strings[J]. SIAM Journal of Computing, 1997, 6(1): 323- 350.
  • 6Liu Yanbing. Research of string matching algorithm optimization ED]. Beijing: Institute of Computing Technology, Chinese Academy of Science, 2006.
  • 7Cormen T H, Leiserson C E, Rives R L, et al. Introduction to Algorithms [M]. 2nd ed. Cambridge: MIT Press, 2001.
  • 8Powell M. The large canterbury corpus [EB/OL]. [ 2012-06- 02]. http://corpus, canterbury, ac. nz/descriptions/g:large.
  • 9Navarro G, Raffinot M. Flexible Pattern Matching in Strings: Practical On-Line Search Algorithms for Texts and Biological Sequences. Cambridge.. Cambridge University Press, 2002.
  • 10Aho A V, Corasick M J. Efficient string matching: An aid to bibliographic search. Communications of the ACM, 1975, 18(6) : 333-340.

引证文献6

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部