期刊文献+

带有通配符和长度约束的模式匹配问题求解模型 被引量:1

Models for Pattern Matching with Wildcards and Length Constraints
下载PDF
导出
摘要 讨论了带有通配符和长度约束的模式匹配(PMWL)问题,其中模式由子模式序列集组成,两个相邻子模式的间隔在一定长度范围内。针对PMWL问题,已有工作包括设计启发式求解算法和对特殊情况进行完备性分析,然而还需要构建问题的基础求解模型。借鉴约束可满足问题框架,构建了由变量、值域和约束组成的三元组求解模型,对PMWL问题的基本概念和基本性质给出了形式化描述。最后,给出了算法求解PMWL问题的特定条件下的完备解。 For the problem of pattern matching with wildcards and length constraints (PMWL), the patterns are com- posed of a sequence of sub-patterns,where any two adjacent sub-patterns with flexible gaps are in a specified range of the text. Existing work includes a heuristic strategy and its completeness analysis with constraints, but the PMWL problem still needs systematic studies. We drew on the experience of constraint satisfaction problems(CSPs) and set up a 3-tuple model consisting of variables, domains and constraints. We then derived formal descriptions for the basic con- cepts and properties. Also, a tree-based matching algorithm was presented to solve the PMWL problem under certain conditions.
出处 《计算机科学》 CSCD 北大核心 2016年第4期279-283,F0003,共6页 Computer Science
基金 国家自然科学基金项目(31100956 61173117)资助
关键词 长度约束 通配符 求解模型 模式匹配 Length constraints, Wildcards, Matcing model, Pattern matching
  • 相关文献

参考文献19

  • 1Fischer M J,Paterson M S.String matching and other products[R].Cambridge,MA:Massachusetts Institute of Technology ,1974.
  • 2Manber U,Baeza-Yates R.An Algorithm for String Matching with a Sequence of Don’t cares[J].Information Processing Letters,1991,37(3):133-136.
  • 3Califf M E,Mooney R J.Bottom-up Relational Learning of Pattern Matching Rules for Information Extraction[J].Journal of Machine Learning Research,2003,4(6):177-210.
  • 4Guo Dan,Hu Xue-gang,Xie Fei,et al.Pattern Matching withWildcards and Gap-length Constraints Based on a Centrality-degree Graph[J].Applied Intelligence,2013,39:57-74.
  • 5Dahiya A,Garg D.Maximal pattern matching with flexible wildcard gaps and one-off constraint[C]∥International Conference on Advances in Computing,Communications and Informatics.Arras,France,2014:1107-1112.
  • 6吴信东,强继朋,谢飞.Pattern Matching with Flexible Wildcards[J].Journal of Computer Science & Technology,2014,29(5):740-750. 被引量:2
  • 7项泰宁,郭丹,王海平,胡学钢.带通配符的模式匹配问题及其解空间特征分析[J].计算机科学,2014,41(9):269-273. 被引量:1
  • 8Chen Gong,Wu Xin-dong,Zhu Xing-quan,et al.Efficient string matching with wildcards and length constraints[J].Knowledge and Information Systems,2006,10(4):399-419.
  • 9强继朋,谢飞,高隽,胡学钢,吴信东.带任意长度通配符的模式匹配[J].自动化学报,2014,40(11):2499-2511. 被引量:5
  • 10Min Fan,Wu Xin-dong,Lu Zhen-yu.Pattern matching with independent wildcard gaps[C]∥Proceedings of the 8th IEEE International Conference on Dependable,Autonomic and Secure Computing.2009:194-199.

二级参考文献112

  • 1潘云鹤,王金龙,徐从富.数据流频繁模式挖掘研究进展[J].自动化学报,2006,32(4):594-602. 被引量:34
  • 2Fischer M J, Paterson M S. String Matching and Other Products// Karp R M, ed. Complexity of Computation. Cambridge, USA. MIT Press, 1974:113-125.
  • 3Zhang Minghua, Kao B, Cheung D W, et al. Mining Periodic Pat- terns with Gap Requirement from Sequences // Proc of the ACM SIGMOD International Conference on Special Interest Group on Man- agement of Data. Baltimore, USA, 2005 : 623-633.
  • 4Cole J R, Chai B, Marsh T L, et al. The Ribosomal Database Pro- ject(RDP-II) : Sequences and Tools for High-Throughput rRNA Analysis. Nucleic Acids Research, 2005, 33 ( zl ) : 294-296.
  • 5He Dan, Wu Xindong, Zhu Xingquan. SAIL-APPROX: An Effi- cient On-line Algorithm for Approximate Pattern Matching with Wildcards and Length Constraints// Proc of the IEEE International Conference on Bioinformatics and Biomedicine. Silicon Valley, USA, 2007:151-158.
  • 6Xie Fei, Wu Xindong, Hu Xuegang, et al. Sequential Pattern Min- ing with Wildcards// Proc of the 22nd IEEE International Confer-ence on Tools with Artificial Intelligence. Arras, France, 2010: 241-247.
  • 7Ji Xiaonan, Bailey J, Dong Guozhu. Mining Minimal Distinguishing Subsequence Patterns with Gap Constraints. Knowledge and Infor- mation Systems, 2007, 11 (3) : 259-286.
  • 8He Yu, Wu Xindong, Zhu Xingquan, et al. Mining Frequent Pat- terns with Wildcards from Biological Sequences//Proc of the IEEE International Conference on Information Reuse and Integration. Las Vegas, USA, 2007:329-334.
  • 9Califf M E, Mooney R J. Bottom-up Relational Leaming of Pattern Matching Rules for Information Extraction. Journal of Machine Learning Research, 2003, 4(6) : 177-210.
  • 10Manber U, Baeza-Yates R. An Algorithm for String Matching with a Sequence of Don' t Cares. Information Processing Letters, 1991, 37 (3) : 133-136.

共引文献20

同被引文献3

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部