期刊文献+

改进的带可变长度通配符的近似模式串匹配算法

Improved approximate pattern matching algorithm with variable length wildcards
下载PDF
导出
摘要 针对处理可变长度通配符的近似模式串匹配传统算法结果质量不高、易丢解等问题,提出1种启发式的文本-模式倒置算法。基于动态规划思想采用文本-模式倒置策略,搜索得到符合匹配条件子串的开始位置并划分候选集。通过获取初始解、集合划分及优化组合2个过程,筛选出匹配子串的最优解。与同类动态规划(DP)和Sail-Approx算法进行实验对比,结果表明该文算法解的平均增长率为21.9%。 A heuristic text-pattern reversion algorithm is proposed for the low result quality and losing solution problems of traditional approximate pattern matching algorithms for variable length wildcards. The starting position of the substrings meeting the matching condition is searched and the candidate sets are partitioned based on dynamic programming and text-pattern reversion. The optimal solution of matching substrings is screened by obtaining the initial solution, dividing and assembling the sets optimally. Compared with the similar dynamic programming ( DP) and Sail-Approx algorithms, the experimental results show that the average growth rate of the solution of this algorithm is improved by 21.9%.
作者 汪浩 王驰
出处 《南京理工大学学报》 EI CAS CSCD 北大核心 2016年第6期687-693,共7页 Journal of Nanjing University of Science and Technology
基金 国家自然科学基金(61229031)
关键词 可变长度通配符 近似模式串匹配 动态规划 文本-模式倒置 variable length wildcards approximate pattern matching dynamic programming text - pattern reversion
  • 相关文献

参考文献2

二级参考文献21

  • 1范立新.改进的中文近似字符串匹配算法[J].计算机工程与应用,2006,42(34):172-174. 被引量:8
  • 2[1]Lavenier,J LPacherie.Parallel processing for scanning genomic databases.PARCO'97,Bonn,Germany,1997
  • 3[2]T K Yap,O Frieder,R L Martino.Parallel computation in biological sequence analysis.IEEE Trans on Parallel and Distributed Systems,1998,9(3):283-293
  • 4[3]la Correa Tavares dos,Rubem Mondaini.Using parallel algorithms for searching molecular sequence databases.In:Proc of the Computational Systems Bioinformatics Conf,Piscataway,NJ:IEEE Press,2005
  • 5[4]L Cheng,D Cheung,S -M Yiu.Approximate string matching in DNA sequences.The 8th Int'l Conf on Database Systems for Advanced Applications,Kyoto,Japan,2003
  • 6[5]P D Michailidis,K G Margaritis.Performance evaluation of load balancing strategies for approximate string matching application on an MPI cluster of heterogeneous workstations.Future Generation Computer Systems,2003,19(7):1075-1104
  • 7[7]O Beaumont,H Casanova,A Legrand,et al.Scheduling divisible loads on star and tree networks:Results and open problems.IEEE Trans on Parallel and Distributed Systems,2005,16(3):207-218
  • 8[8]V Bharadwaj,D Ghose,V Mani.Multi-installment load distribution in tree networks with delays.IEEE Trans on Aerospace and Electronic Systems,1995,31(2):555-567
  • 9[9]P Wolniewicz.Divisible job scheduling with limited memory:[Ph D dissertation].Pozna:Pozna University of Technology,2003
  • 10CHANG Y I,CHEN J R,HSU M T. A Hash Trie filter method for approximate string matching in genomic databases[J].Applied Intelligence,2010,(01):21-38.

共引文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部