DHSWM:一种改进的WM多模式匹配算法被引量：8

DHSWM: An improved multi-pattern matching algorithm based on WM algorithm

下载PDF

导出

摘要针对WM算法的查找效率随着模式集规模的增大而降低的问题,提出一种改进算法。在预处理阶段,改变原有Hash表中的链表结构,采用双哈希法将模式串存放在Hash1表中指定的区间,Hash表中存放该存储区间的起始位置与区间长度;Prefix表用于判断模式集中是否存在与当前匹配窗口中文本前缀相同的模式;当Shift表中出现移动值为0时,根据后缀出现在模式串其他位置的信息计算匹配窗口可滑动的最大距离并存于Shift1表中。在查找阶段,采用双哈希法在Hash1表的某一区间中查找模式串,避免在大规模模式集情况下查找过长的模式链表,扩大匹配操作后匹配窗口滑动的距离,减少冗余的匹配操作,缩短查找时间。研究结果表明:在模式集规模较大时,改进后的算法显著地提高了匹配速度;当模式串数目超过5 000条时,改进算法的查找时间要比WM算法缩短40%～47%。 To resolve the problem that with the constant increase of the number of rules,the performance of Wu-Manber algorithm will become less efficient,an improved Wu-Manber algorithm named double Hash searching Wu-Manber algorithm（DHSWM） was proposed.In the pre-processing stage,the patterns were stored in specified intervals in Hash1 table by double Hash method while Hash table was used to store the parameters which indicate the start address of the interval and its length.Prefix table was used to determine whether the patterns in set and the text of current matching window had the same prefix.When the shifting distance was 0 in Shift table,Shift1 table was used to store the maximum sliding distance of matching window according to the suffixes appearing in other locations of pattern string.In the searching stage,double Hash method was used to look up patterns in the interval of Hash1 table to avoid searching for overlong linked list in the case of large scale pattern set.The sliding distance of matching window was enlarged after the matching procedure,so redundant matching operations was reduced and the search time was shortened.The results indicate that the algorithm can improve the speed of pattern matching when the scale of the pattern set is large.Compared with the WM algorithm,the DHSWM algorithm can reduce the search time by 40%？47% when the number of patterns is more than 5 000.

作者刘卫国胡勇刚

机构地区中南大学信息科学与工程学院

出处《中南大学学报（自然科学版）》 EI CAS CSCD 北大核心 2011年第12期3765-3771,共7页 Journal of Central South University:Science and Technology

基金国家自然科学基金资助项目(61073187)

关键词入侵检测模式匹配 WU-MANBER算法双哈希查找 intrusion detection pattern matching Wu-Manber algorithm double Hash searching

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1Mott DM, Kida Y, Nyomba BL. Human skeletal muscle, type-1 protein phosphatase and insulin resistance. Adv Prot Phosph,1993,7:413-427.
2Hansen L, Hansen T, Vestergaard H, et al. Widespread amino acid polymorphism at codon 905 of the glycogen-associated regulatory subunit of protein phosphatase-1 is associated with insulin resistance and hypersecretion of insulin. Hum Mol Genet, 1995,4:131
3Xia J, Scherer SW, Cohen PT, et al. A common variant in PPP1R3 associated with insulin resistance and type 2 diabetes.Diabetes , 1998,47:1519-1524.
4Hansen L, Reneland R, Berglund L, et al. Polymorphism in the glycogen-associated regulatory subunit of type 1 protein phosphatase(PPP1R3) gene and insulin sensitivity. Diabetes, 2000,49:298-301.
5Shen GQ, Ikegami H, Fujisawa T et al. Asp905Tyr polymorphism of the gene for the skeletal muscle-specific glycogen-targeting subunit of protein phosphatase 1 in NIDDM. Diabetes Care,1998,21 : 1086-1089.
6YANG Dong-hong,XU Ke.An improved Wu-Manber multiplepatterns matching algorithm[C]//The 25th IEEE InternationalPerformance,Computing,and Communications Conference.Phoenix,USA,2006:675-680.
7Sunday D M.A very fast substring search algorithm[J].Communications of the ACM,1990,33(8):132-142.
8Choi Y H,Jung M Y,Seo S W.L+1-MWM:A fast patternmatching algorithm for high-speed packet filtering[C]//2008Proceedings IEEE INFOCOM.Phoenix,USA,2008:261-265.
9吴冰,云晓春,高琪.基于网络的恶意代码检测技术[J].通信学报,2007,28(11):87-91. 被引量：8
10ZHANG Bao-jun,CHEN Xiao-ping,PING Ling-di.Addressfiltering based Wu-Manber multiple patterns matchingalgorithm[C]//Proceedings of the 2009 Second InternationalWorkshop on Computer Science and Engineering(WCSE 2009).Qingdao,China,2009:408-412.

二级参考文献18

1颜松远.2500年研究探寻相亲数(英文)[J].数学进展,2004,33(4):385-400. 被引量：24
2代六玲,黄河燕,陈肇雄.一种改进的多模式串匹配算法[J].模式识别与人工智能,2006,19(1):47-51. 被引量：4
3GILDER G. Telecosm: How Infinite Bandwidth Will Revolutionize Our World[M]. The Free Press, New York, 2000
4WEI S G, MIRKOVIC J. A realistic simulation of Internet-scale events[A]. Proceedings of the 1st Tnternafional Conference on Performance Evaluation Methodolgies and Tools Valuetools[C]. Italy, 2006.
5AHO A V, CORASICK M J. Efficient string matching: an aid to bibliographic search[A]. Communications of the ACM 18[C]. 1975. 333- 340.
6BOYER R S, MOORE J S. A fast string searching algorithm[A]. Communications of the ACM 20[C]. 1977. 762-772.
7WU S, MANBER U. A Fast Algorithm For Multi-Pattern Searching[R]. Technical Report TR 94-17, University of Arizona at Tuscon, 1994.
8YANG D H, XU K, CUI Y. An improved wu-manber multiple patterns matching algorithm[A]. Performance, Computing, and Communications Conference[C]. 2006.
9Fisk M,Varghese G.An analysis of fast string matching applied to content-based forwarding and intrusion detection,CS2001-0670[R]. California,San Diego,2002.
10Wu S,Mankr U.A fast algorithm for multi-pattern searching,TR- 94-17[R].Department of Computer Science,University of Arizona, 1994.

共引文献24

1陈明卫,杨明功,王长江,王佑民,徐希平,刘树琴,章秋,孙海燕.骨骼肌特异糖原靶向调节亚单位基因Asp905Tyr多态性与2型糖尿病相关性研究[J].中华糖尿病杂志（1006-6187）,2004,12(5):362-363. 被引量：2
2乐茂华.奇素数方幂中的孤立数[J].湖北民族学院学报（自然科学版）,2008,26(4):361-363. 被引量：4
3高宇,莫有权,李庆荣,李祥和.基于分布式结构的网络恶意代码智能分析系统[J].计算机应用与软件,2010,27(5):121-124. 被引量：2
4叶清,吴晓平,程晋.基于规则优化与排序的恶意代码匹配检测[J].海军工程大学学报,2010,22(4):102-106. 被引量：2
5古媛.关于孤立数的一个公开问题[J].数学杂志,2010,30(5):948-950. 被引量：3
6管训贵.关于奇素数方幂中的孤立数[J].四川理工学院学报（自然科学版）,2010,23(5):537-538.
7乐茂华.广义Mersenne数中的奇完全数[J].吉首大学学报（自然科学版）,2010,31(5):5-7. 被引量：1
8苗甫,王振兴,张连成.基于流量统计指纹的恶意代码检测模型[J].计算机工程,2011,37(18):131-133. 被引量：3
9马小成,张四保.梅森数的素因子个数的估计[J].西南民族大学学报（自然科学版）,2012,38(1):34-36.
10管训贵.形如1/3（2p＋1）的孤立数[J].数学的实践与认识,2012,24(13):214-217. 被引量：9

同被引文献64

1陈军,李志林,蒋捷,赵仁亮.基础地理数据库的持续更新问题[J].地理信息世界,2004,2(5):1-5. 被引量：159
2秦浩伟,步丰林.一个中文新词识别特征的研究[J].计算机工程,2004,30(B12):369-370. 被引量：13
3王若梅,张绮雯,周凡.一种新的多模式快速匹配算法[J].中山大学学报（自然科学版）,2005,44(A02):107-110. 被引量：3
4孙晓山,王强,关毅,王晓龙.一种改进的Wu-Manber多模式匹配算法及应用[J].中文信息学报,2006,20(2):47-52. 被引量：10
5杨东红,徐恪,崔勇.改进的Wu-Manber多模式串匹配算法[J].清华大学学报（自然科学版）,2006,46(4):555-558. 被引量：13
6李伟男,鄂跃鹏,葛敬国,钱华林.多模式匹配算法及硬件实现[J].软件学报,2006,17(12):2403-2415. 被引量：42
7巫喜红,凌捷.BM模式匹配算法剖析[J].计算机工程与设计,2007,28(1):29-31. 被引量：19
8袁世忠,曹旻,王燕燕.基于WM算法的多模式匹配改进算法WMN[J].计算机工程与应用,2007,43(15):128-130. 被引量：6
9欧嵬,吴纯青.几种字符串匹配算法的分析和比较[J].微处理机,2007,28(4):59-61. 被引量：7
10Kanniya Raja N, Arulanandam K, Raja Rajeswari B, et al. Centralized Parallel form of Pattern Matching Algorithm in Packet Inspection by Efficient Utilization of Secondary Memory in Network Processor[ J]. International Journal of Computer Applications , 2012,40 ( 5 ).

引证文献8

1陆琳琳,田野.基于确定有限状态自动机的改进多模式匹配算法研究[J].计算机应用与软件,2013,30(7):321-323. 被引量：9
2褚衍杰,李云照,魏强.一种改进的多模式匹配算法[J].西安电子科技大学学报,2014,41(6):174-180. 被引量：6
3王一霈,石春,戴上静,吴刚.一种改进的针对中文编码的Wu-Manber多模式匹配算法[J].小型微型计算机系统,2015,36(4):778-781. 被引量：4
4朱永强,秦志光.一种基于编码关联的快速多模式匹配算法[J].计算机科学,2016,43(2):26-30.
5钟远军,李自,雷丽珍,朱晓强.基于字符匹配算法组合的地理空间敏感属性检测系统[J].测绘与空间地理信息,2016,39(5):116-118.
6陶曌,杨建波,张波,张丽云.面向比特流的分组快速搜索匹配算法[J].计算机工程,2017,34(6):125-128. 被引量：1
7赵国锋,叶飞,姚永安,赵岩.一种面向云中心网络入侵检测的多模式匹配算法[J].信息网络安全,2018,0(1):52-57. 被引量：6
8周延森,张维刚.一种WM多模匹配算法的研究与改进[J].计算机应用与软件,2021,38(7):251-257. 被引量：2

二级引证文献28

1唐湘滟,程杰仁,殷建平,龚德良.基于NP模式的报文检测方法[J].计算机工程与科学,2014,36(11):2128-2131. 被引量：1
2赵旭,王伟,陈亮.网络入侵检测系统规则链表的优化研究[J].计算机工程与应用,2015,51(20):91-96. 被引量：3
3刘春晖,黄宇,宋琦.一种改进的AC多模式匹配算法[J].计算机工程,2015,41(10):280-285. 被引量：8
4牛欢,卢选民.面向比特流的未知短波协议识别技术[J].计算机系统应用,2016,25(3):142-146.
5胡朝举,石倩.基于Snort入侵检测的后缀搜索算法的研究与改进[J].网络安全技术与应用,2016(8):38-39.
6薛朋强,努尔布力,吾守尔.斯拉木.基于网络文本信息的敏感信息过滤算法[J].计算机工程与设计,2016,37(9):2447-2452. 被引量：32
7史礼婷,张骞,钟永恒,胡思思,李贞贞.双向模式匹配在年鉴数据预处理平台中的应用[J].现代图书情报技术,2016(9):88-94. 被引量：2
8朱贺军,祝烈煌.大数据环境下网络服务客户定位仿真研究[J].计算机仿真,2016,33(11):328-332. 被引量：1
9麦涛涛,潘晓中,王亚奇,苏阳.基于预定义类的紧凑型正则表达式匹配算法[J].计算机应用,2017,37(2):397-401. 被引量：7
10赵刚,姚兴仁.基于用户画像的异常行为检测模型[J].信息网络安全,2017(7):18-24. 被引量：27

1於文刚,于春玲.分布式哈希查找模型的研究[J].电脑编程技巧与维护,2010(4):3-4.
2潘楠,王勇,陶晓玲.一种基于SNMP的链路层拓扑发现算法[J].计算机工程,2012,38(2):103-105. 被引量：6
3廉佐政,王海珍.基于对等网络的分布式哈希查找机制的研究[J].齐齐哈尔大学学报（自然科学版）,2006,22(1):53-55.
4赵友桥,王坚,路松峰,胥永康.一种后缀数组与滑动窗口结合的压缩算法[J].计算机工程与应用,2012,48(15):59-62. 被引量：2
5罗慧,吴国新.P2P技术及其资源发现与定位[J].计算机与信息技术,2005(12):58-60. 被引量：4
6肖丽.哈希查找中散列函数的运用[J].技术与市场,2009,16(8):18-19. 被引量：3
7闫子骥,安计勇.一种高效的IP定位方法[J].安徽电子信息职业技术学院学报,2014,13(3):10-14. 被引量：1
8刘芳芳.近邻匹配算法实现中文分词[J].决策与信息（下旬）,2013(4):242-243.
9姚兴山.基于哈希算法的中文分词算法的改进[J].图书情报工作,2008,52(6):60-62. 被引量：6
10吴静,倪宏,邓浩江,刘建,孙鹏.嵌入式Flash播放器的优化策略[J].微计算机信息,2010,26(29):52-54.

中南大学学报（自然科学版）

2011年第12期

浏览历史

内容加载中请稍等...

DHSWM:一种改进的WM多模式匹配算法被引量：8

参考文献15

二级参考文献18

共引文献24

同被引文献64

引证文献8

二级引证文献28

相关作者

相关机构

相关主题

浏览历史

DHSWM:一种改进的WM多模式匹配算法 被引量：8

参考文献15

二级参考文献18

共引文献24

同被引文献64

引证文献8

二级引证文献28

相关作者

相关机构

相关主题

浏览历史

DHSWM:一种改进的WM多模式匹配算法被引量：8