摘要
针对审计系统中搜索大量审计数据的需要,设计了一种基于哈希表机制的多关键字匹配算法.该算法把关键字集合储存到哈希表中,并为关键字集合建立了两个过滤表和一个关键字长度类型表.在查找过程中,对未经过滤表验证的字符串不再进行匹配查找,同时,关键字长度类型表的使用减少了循环的次数.测试结果表明,该算法在速度和精度上都优于BM和mgrep算法.
For the need of searching a large amount of audit data in audit system, a multi-keyword matching algorithm based on Hash table mechanism is designed. This algorithm saves the keyword set to a Hash table, and creates two filtration tables and a keyword-length-type table for the keyword set. In the course of search, the strings that fail to pass the filtration tables needn't be used for matching search, and meanwhile the use of the keyword-length-type table reduces the times of loop. The result of test shows that the speed and precision of this algorithm are both better than BM and mgrep.
出处
《郑州大学学报(理学版)》
CAS
2007年第2期145-148,共4页
Journal of Zhengzhou University:Natural Science Edition
基金
国家自然科学基金资助项目
编号90104035
关键词
信息审计
哈希表
关键字
information audit
Hash table
keyword