期刊文献+

图算法求解带有限长空位和one-off约束的模式匹配问题

Graph-Based Algorithm for Pattern Matching with Bounded Length Gaps and One-off Constraint
下载PDF
导出
摘要 讨论带有限长空位和one-off约束条件的模式匹配问题,其中限长空位改变单个匹配解结构,one-off条件约束匹配解之间的关系,从而形成规模较大且稀疏的解空间.借鉴约束可满足性问题框架,将PMGO问题转化为图结构下的路径搜索问题,并证明转化的等价性.然后提出图结构下的剪枝和匹配算法(GPM),根据one-off约束得到节点之间的约束关系,再迭代交互地进行剪枝与搜索.实验中使用匹配解丢失率度量已有启发式算法和GPM的完备性,证明GPM可与已有启发式算法形成互补,有效降低匹配解丢失率. The problem of pattern matching with bounded length gaps and one-off constraint (PMGO) is discussed. The structure of individual occurrences is changed by the bounded gaps, and the relation between occurrences is restricted by the one-off constraint. Thus, a large-scale sparse space of all candidate occurrences is generated. Based on the framework of the constraint satisfaction, the PMGO problem is transformed into path search in a directed acyclic graph (DAG) structure. Meanwhile, the equivalence of transformation is proved. Then, a graph-based pruning and matching (GPM) algorithm is presented. In GPM algorithm, a constraint relationship between vertexes is built under the one-off constraint, and then the path search is combined with a pruning procedure in an alternating and iterative manner. The loss rate of occurrences is used to measure existing heuristic algorithms and the completeness of the proposed GPM algorithm. The experimental results demonstrate that the GPM algorithm provides a complementary method for heuristic algorithms and it efficiently reduces the loss rate of occurrences.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2016年第5期400-409,共10页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金项目(No.61305062 61273292) 教育部博士点博导基金项目(No.20130111110011) 安徽省自然科学基金项目(No.1308085QF102) 中国博士后科学基金项目(No.2012M511403)资助~~
关键词 模式匹配 one—off约束 通配符跨度 有向无环图 Pattern Matching, One-off Constraint, Wildcard Gap, Directed Acyclic Graph
  • 相关文献

参考文献3

二级参考文献28

  • 1Lunteren J V. High-performance pattern-matching for intrusion detection//Proceedings of the 25th IEEE International Conference on Computer Communications ( INFOCOM 2006). Barcelona, Spain, 2006:1-13.
  • 2Califf M E, Mooney R J. Bottom up relational learning of pattern matching rules for information extraction. Journal of Machine Learning Research, 2003, 4(6): 177-210.
  • 3Cole R, Gottlieb L A, Lewenstein M. Dictionary matching and indexing with errors and don't cares//Proceedings of the 36th ACM Symposium on the Theory of Computing. New York, USA, 2004:91-100.
  • 4Cole J R, Chai B, Farris R J, Wang Q, Kulam S A, McGartell D M, Garrity G M, Tiedje J M. The ribosomal database project (RDP-II) : Sequences and tools for high-throughput rRNA analysis. Nucleic Acids Research, 2005, 33(Sup. 1): 294-296.
  • 5Zhang M, Kao B, Cheung D, Yip K. Mining periodic patterns with gap requirement from sequences//Proceedings of the ACM SIGMOD International Conference on Management of Data. Maryland, USA, 2005:623-633.
  • 6Han J, Cheng H, Xin D, Yan X. Frequent pattern mining.. Current status and future directions. Data Mining and Knowledge Discovery, 2007, 15(1) : 55-86.
  • 7Ji X, Bailey J, Dong G. Mining minimal distinguishing subsequence patterns with gap constraints. Knowledge and Information Systems, 2007, 11(3): 259- 286.
  • 8He Y, Wu X, Zhu X, Arslan A N. Mining frequent patterns with wildcards from biological sequences//Proceedings of the 2007 IEEE International Conference on Information Reuse and Integration(IRI-07). Las Vegas, USA, 2007: 329-334.
  • 9Fischer M J, Paterson M S. String matching and other products//Proeeedings of the 7th SIAM AMS Complexity of Computation. Cambridge, USA, 1974:113-125.
  • 10Manber U, Baeza-Yates R. An algorithm for string matching with a sequence of don't cares. Information Processing Letters, 1991, 37(3): 133 -136.

共引文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部