摘要
通过观察不确定有限自动机NFA到确定性有限自动机DFA的转化过程,分析内存增长的原因,提出了一种基于状态间约束关系的正则表达式匹配算法Group2-DFA。Group2-DFA通过两级分组,利用状态间的约束关系,将原始NFA转化为NFA和DFA的混合结构。实验表明,在保持一定处理速率的前提下,Group2-DFA能够有效地减少内存占用。在300条规则下,Group2-DFA吞吐率能够达到1Gbps,并且减少约75%的状态数。
By analysis of state explosion in deterministic finite automata DFA, a novel algorithm Group2-DFA based on state constrains was proposed to reduce the memory usage. With the state constrains, states in NFA were classified into several groups. Group2-DFA introduces two-level classification and merges NFA and DFA together to a hybrid FA construction. The experiments show that G-roup2-DFA can reduce memory usage efficiently and keep high throughput with a small increase of memory reading time. With 300 regex rules, Group2-DFA can cut 75% states and achieve 1Gbps throughput.
出处
《通信学报》
EI
CSCD
北大核心
2013年第10期183-190,共8页
Journal on Communications
基金
国家科技支撑计划基金资助项目(2012BAH02B03
2012BAH02B01)
国家高技术研究发展计划("863"计划)基金资助项目(2011AA01A103
2011AA01A101
2011BAH19B04)~~
关键词
正则表达式
状态爆炸
自动机转化
状态约束
regular expression
state explosion
automata convert
state constrains