期刊文献+

一种适合中文的多模式匹配算法 被引量:4

Multiple Pattern Algorithm for Chinese
下载PDF
导出
摘要 中文字符的相互独立性导致AC算法的时空性能急剧下降。针对此问题,对AC算法的存储结构进行了改进,提出了一种适合中文的多模式匹配算法———AC_SC算法。该算法以邻接链表存储有限状态自动机,尝试解决存储空间快速膨胀问题,并将状态"0"的长链表转化为散列链表,以提高算法的匹配效率。实验结果表明,AC_SC算法具有良好的时空性能。 Because of the independent of the Chinese characters, the space and time performances of AC algorithm de- cline sharply. For this problem,the storage structure of AC algorithm was improved and a multi-pattern algorithm for Chinese named AC_SC was proposed. The algorithm uses adjacency-list to store the finite state automata to solve the problem of the storage space rapid expansion. Besides, the long linked list of the state '0' is changed into a Hash linked table to improve the matching efficient. Experimental results show that AC_SC has better time and space performances.
出处 《计算机科学》 CSCD 北大核心 2013年第11期117-121,共5页 Computer Science
基金 安徽省自然科学基金(090412051) 广东省教育部产学研结合项目(2008B0905002400)资助
关键词 多模式匹配 AC算法 邻接链表 有限状态自动机 Multi-pattern matching, AC algorithm, Adjacency-list, Finite state automata
  • 相关文献

参考文献15

  • 1杜大军,费敏锐,宋杨,李雪.网络控制系统的简要回顾及展望[J].仪器仪表学报,2011,32(3):713-720. 被引量:45
  • 2Wheeler D L, Barrett T, Benson D A, et al. Database [J]. Nu- resources of the National Center for Biotechnology Information cleic Acids Research, Database issue, 2007,35 : 5-12.
  • 3Knuth D E, Morris J H, Pratt V R. Fast Patter Matching in Strings[J]. SIAM Journal of Computer, 1977,6(2) .-323-350.
  • 4Boyer R S, Moore J S. A Fast String Searching Algorithm[J]. Communications of the ACM, 1977,20 (10) : 762-772.
  • 5Horspool R N. Practical fast searching in strings [J]. Software- Practice Experience, 1980,10(6) : 501-506.
  • 6Sunday D M. A very fast substring searching algorithm [J]. Communications of the ACM, 1990,33(8) : 132-142.
  • 7Aho A V, Margaret J. Corasick Efficient String Matching[J]. An Aid to Biographic Search Communications of the ACM, 1975,18(6) : 333-340.
  • 8Commentz-Walter B. A String Matching Algorithm Fast on the Average [J]. Proceedings of 6th ICALP, 1979,71:118-132.
  • 9Wu Sun, Manber U. Fast algorithm for multi-pattern searching [R]. Tucson: Department of computer science university of Ari- zona, 1994.
  • 10王永成,沈州,许一震.改进的多模式匹配算法[J].计算机研究与发展,2002,39(1):55-60. 被引量:52

二级参考文献45

共引文献119

同被引文献28

  • 1王婷,彭勇,戴忠华,伊胜伟,韩兰胜.基于SVM-RFE的钓鱼网页检测方法研究[J].华中科技大学学报(自然科学版),2013,41(S2):143-146. 被引量:3
  • 2Commentz-Walter B.A string matching algorithm fast on the average[M]//Automata,Languages and Programming.Berlin:Springer,1979:118-132.
  • 3Wu Sun,Manber U.A fast algorithm for multi-pattern searching,TR94-17[R].Tuscon:University of Arizona,1994.
  • 4Aho A V,Corasick M J.Efficient string matching:an aid to bibliographic search[J].Communications of the ACM ,1975,18(6):333-340.
  • 5Horspool R N.Practical fast searching in strings[J].Software-Practice & Experience,1980,10(6):501-506.
  • 6Boyer R S,Moore J S.A fast string searching algorithm[J].Communications of the ACM,1977,20(10):762-772.
  • 7Norton M.Optimizing pattern matching for instrusion detection[EB/OL].(2006-05-11)[2006-11-09].http://does.idsresearch.org/OptimizingPatternMatching-ForIDS.pdf.
  • 8Navarro G,Raffinot M.Flexible pattern matching in strings[M].[S.l.] :The Press Syndicate of the University of Cambridge,2002.
  • 9Navarro G.Improved approximate pattern matching on hypertext[J].Theoretical Computer Science,2000,237(1):455-463.
  • 10Karp K M,Rabin M O.Efficient randomized patternmatching algorithms[J].IBM Journal of Research and Development,1987,31(2):249-260.

引证文献4

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部