期刊文献+

模式特征对带有通配符和长度约束的模式匹配问题的影响 被引量:8

Impact of Pattern Feature on Pattern Matching Problem with Wildcards and Length Constraints
原文传递
导出
摘要 带有通配符的模式匹配问题(PMWL)模式定义的灵活性给用户提供方便,却也造成求解上的困难.目前没有任何多项式算法能得到该问题的完备解,同时也缺少足够的完备性分析.文中认为模式特征是影响PMWL完备性的关键因素,并提出模式重复度的概念,记为rep.证明在rep=0的限定条件下PMWL的完备性,同时分析rep>0时PMWL不完备的原因.实验以近似比为指标,说明rep对PMWL完备性的影响. Pattern matching with wildcards and length constraints (PMWL) provides more convenience to users since its flexibility in definition which also leads to difficulties in solving problem. Currently, to our knowledge, no polynomial algorithms obtain the complete solution of this problem, and the analysis for completeness is far from sufficient. In this paper, the pattern feature is provedto be the key factor for the completeness of PMWL and a concept, denoted as rep, is provided which measures the repetitions in the pattern. The completeness of PMWL is proved under a certain condition when rep = 0. And the reason of incompleteness under the condition of rep〉0 is also explained clearly. In the experiments, approximation ratio is utilized as a measurement to demonstrate the impact of rep on the PMWL problem.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2012年第6期1013-1021,共9页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金项目(No.60828005 60975034 61273292) 中央高校基本科研业务费专项资金项目(No.2011HGZY0003)资助
关键词 模式特征 完备性 通配符 模式匹配 Pattern Feature, Completeness, Wildcard, Pattern Matching
  • 相关文献

参考文献14

  • 1Fischer M J, Paterson M S. String Matching and Other Products// Karp R M, ed. Complexity of Computation. Cambridge, USA. MIT Press, 1974:113-125. 被引量:1
  • 2Zhang Minghua, Kao B, Cheung D W, et al. Mining Periodic Pat- terns with Gap Requirement from Sequences // Proc of the ACM SIGMOD International Conference on Special Interest Group on Man- agement of Data. Baltimore, USA, 2005 : 623-633. 被引量:1
  • 3Cole J R, Chai B, Marsh T L, et al. The Ribosomal Database Pro- ject(RDP-II) : Sequences and Tools for High-Throughput rRNA Analysis. Nucleic Acids Research, 2005, 33 ( zl ) : 294-296. 被引量:1
  • 4He Dan, Wu Xindong, Zhu Xingquan. SAIL-APPROX: An Effi- cient On-line Algorithm for Approximate Pattern Matching with Wildcards and Length Constraints// Proc of the IEEE International Conference on Bioinformatics and Biomedicine. Silicon Valley, USA, 2007:151-158. 被引量:1
  • 5Xie Fei, Wu Xindong, Hu Xuegang, et al. Sequential Pattern Min- ing with Wildcards// Proc of the 22nd IEEE International Confer-ence on Tools with Artificial Intelligence. Arras, France, 2010: 241-247. 被引量:1
  • 6Ji Xiaonan, Bailey J, Dong Guozhu. Mining Minimal Distinguishing Subsequence Patterns with Gap Constraints. Knowledge and Infor- mation Systems, 2007, 11 (3) : 259-286. 被引量:1
  • 7He Yu, Wu Xindong, Zhu Xingquan, et al. Mining Frequent Pat- terns with Wildcards from Biological Sequences//Proc of the IEEE International Conference on Information Reuse and Integration. Las Vegas, USA, 2007:329-334. 被引量:1
  • 8Califf M E, Mooney R J. Bottom-up Relational Leaming of Pattern Matching Rules for Information Extraction. Journal of Machine Learning Research, 2003, 4(6) : 177-210. 被引量:1
  • 9Manber U, Baeza-Yates R. An Algorithm for String Matching with a Sequence of Don' t Cares. Information Processing Letters, 1991, 37 (3) : 133-136. 被引量:1
  • 10Min Fan, Wu Xindong, Lu Zhenyu. Pattern Matching with Inde- pendent Wildcard Gaps//Proc of the 8th IEEE International Con- ference on Dependable, Autonomic and Secure Computing. Cheng- du, China, 2009:194-199. 被引量:1

二级参考文献14

  • 1Lunteren J V. High-performance pattern-matching for intrusion detection//Proceedings of the 25th IEEE International Conference on Computer Communications ( INFOCOM 2006). Barcelona, Spain, 2006:1-13. 被引量:1
  • 2Califf M E, Mooney R J. Bottom up relational learning of pattern matching rules for information extraction. Journal of Machine Learning Research, 2003, 4(6): 177-210. 被引量:1
  • 3Cole R, Gottlieb L A, Lewenstein M. Dictionary matching and indexing with errors and don't cares//Proceedings of the 36th ACM Symposium on the Theory of Computing. New York, USA, 2004:91-100. 被引量:1
  • 4Cole J R, Chai B, Farris R J, Wang Q, Kulam S A, McGartell D M, Garrity G M, Tiedje J M. The ribosomal database project (RDP-II) : Sequences and tools for high-throughput rRNA analysis. Nucleic Acids Research, 2005, 33(Sup. 1): 294-296. 被引量:1
  • 5Zhang M, Kao B, Cheung D, Yip K. Mining periodic patterns with gap requirement from sequences//Proceedings of the ACM SIGMOD International Conference on Management of Data. Maryland, USA, 2005:623-633. 被引量:1
  • 6Han J, Cheng H, Xin D, Yan X. Frequent pattern mining.. Current status and future directions. Data Mining and Knowledge Discovery, 2007, 15(1) : 55-86. 被引量:1
  • 7Ji X, Bailey J, Dong G. Mining minimal distinguishing subsequence patterns with gap constraints. Knowledge and Information Systems, 2007, 11(3): 259- 286. 被引量:1
  • 8He Y, Wu X, Zhu X, Arslan A N. Mining frequent patterns with wildcards from biological sequences//Proceedings of the 2007 IEEE International Conference on Information Reuse and Integration(IRI-07). Las Vegas, USA, 2007: 329-334. 被引量:1
  • 9Fischer M J, Paterson M S. String matching and other products//Proeeedings of the 7th SIAM AMS Complexity of Computation. Cambridge, USA, 1974:113-125. 被引量:1
  • 10Manber U, Baeza-Yates R. An algorithm for string matching with a sequence of don't cares. Information Processing Letters, 1991, 37(3): 133 -136. 被引量:1

共引文献20

同被引文献85

  • 1张治,车皓阳,施鹏飞.模式匹配问题的描述框架与算法模型[J].模式识别与人工智能,2006,19(6):715-721. 被引量:7
  • 2HAAPASALO T, SILVAS'FI P, SIPPU S, et al. Online dictionary matching with variable-length gaps[C]. Proceedings of the 10th SEA, 2011 : 76 - 87. 被引量:1
  • 3BILLE P, G~RTZ I L, VILDH~J H W, et al. String matching with variable length gaps[J]. Theoretical Computer Science, 2012,443:25 - 34. 被引量:1
  • 4TANBEER S K, AHMED C F, JEONG B S, et al. Efficient frequent pattern mining over data streams[C]. Proceedings of the 17th ACM conference on information and knowledge management. ACM, 2008:1447 - 1448. 被引量:1
  • 5FISCHER M J, PATERSON M S. String matching and other products[J]. In complexity of computation, vol. 7, edited by R. M. Karp. Cambridge, MA: Massachusetts Institute of Technology, 1974. 被引量:1
  • 6CHEN G, WU X D, ZHU X Q, et al. Efficient string matching with wildcards and length constraints[J]. Knowledge and Information Systems, 2006,10(4) : 399 - 419. 被引量:1
  • 7GUO D, HU X G, XIE F, et al. Pattern matching with wildcards and gap-length constraints based on a centrality-degree graph. Applied Intelligence, 39 ( 1 ) : 57 - 74,2013. 被引量:1
  • 8NCBI National Center for Bioteehnology Information. < http://www, ncbi. rdm. nih. gov>. 被引量:1
  • 9Navarro G,Raffinot M.Flexible pattern matching in strings:Practical on-line search algorithms for texts and biological sequences[M].Cambridge,UK:Cambridge University Press,2002. 被引量:1
  • 10Han J,Cheng H,Xin D,et al.Frequent pattern mining:CurrentStatus and Future Directions[J].Data Mining and Knowledge Discovery,2007,15(1):55-86. 被引量:1

引证文献8

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部