基于与或树的正则表达式有害二义性检查算法被引量：2

An AND/OR Tree-Based Algorithm for Checking Pestilent Ambiguity in Regular Expressions

导出

摘要在构造面向应用的正则表达式(RE)过程中,引入有益二义性可简化 RE 构造,而将有害二义性遗留在 RE中会危害匹配结果的正确性.为区别对待这两种二义性,基于与或树提出一种检查和定位 RE 中有害二义性的算法.该算法可减轻 RE 调试的工作量.实验表明,该算法在时间性能、空间性能和实用性等方面优于现有基于自动机的二义性检查算法.基于此算法的可视化 RE 编辑调试环境已用于构建国内第一个整合的生物数据仓库. During the construction of regular expressions （REs） for applications, introducing some beneficial ambiguities may simplify RE construction, while leaving some pestilent ambiguities in the RE will harm the correctness of matching. In order to treat these two categories of ambiguities in different ways, an algorithm based on AND/OR tree that checks and locates the pestilent ambiguities in REs is proposed. The algorithm is helpful to reducing the workload of debugging REs. Experiments show that the algorithm outperforms the present ambiguity-checking algorithm based on automaton not only in time and space behaviors, but also in practicality. A visualized RE editing and debugging environment based on the algorithm has been applied to build the first online integrated biological data warehouse of China.

作者邓绪斌朱扬勇

机构地区浙江财经学院信息学院复旦大学计算机与信息技术系

出处《模式识别与人工智能》 EI CSCD 北大核心 2006年第2期173-178,共6页 Pattern Recognition and Artificial Intelligence

基金国家863高科技研究发展计划项目(No.2002AA231011) 上海市重大科技项目(No.02DJ14013)资助

关键词正则表达式(RE) 匹配二义性与或树 Regular Expression （RE）, Matching, Ambiguity, AND/OR Tree

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献8

1Chan C Y, Garofalakis M, Rastogi R. RE-Tree: An Efficient Index Structure for Regular Expressions. International Journal on Very Large Data Bases, 2003, 12(2): 102-119 被引量：1
2Embley D W, Campbell D M, Smith R D, et al. Ontology-Based Extraction and Structuring of Information from Data-Rich Unstructured Documents. In: Proc of the 7th International Conference on Information and Knowledge Management. Bethesda,USA, 1998. 52-59 被引量：1
3Laender A H F, Ribeiro-Neto B, da Silva A S. DEByE: Data Extraction by Example. Data and Knowledge Engineering,2002, 40(2): 121-154 被引量：1
4Wu T H, Pottenger W M. A Semi-Supervised Algorithm for Pattern Discovery in Information Extraction from Textual Data.In: Proc of the 7th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Seoul, Korea. 2003, 117-123 被引量：1
5Heddad A, Brameier M, MacCallum R M. Evolving Regular Expression-Based Sequence Classifiers for Protein Nuclear Localisation. In: Proc of the 2nd European Workshop on Evolutionary Bioinformatics. Coimbra, Portugal, 2004, 31-40 被引量：1
6Vansummeren S. Unique Pattern Matching in Stringa. 2003.http://arXiv.org/ahs/cs/0302004 被引量：1
7Frisch A, Cardelli L. Greedy Regular Expression Matching. In:Proc of the 31st International Colloquium on Automata, Languages and Programming. Turku, Finland, 2004, 618-629 被引量：1
8Hosoya H. Regular Expression Pattern Matchingt A Simpler Design. 2003. http://arbre.is.s.u-tokyo.ae.jp/-hahosoya/papers/ambig.ps 被引量：1

同被引文献17

1韩江洪,魏振春,张本宏,郑淑丽,胡庆新.总线式车身控制系统的规则化建模方法[J].汽车工程,2006,28(12):1121-1124. 被引量：15
2李升波,王建强,李克强.硬件在环仿真试验台监控系统的设计与开发[J].系统仿真学报,2007,19(16):3684-3687. 被引量：9
3Bille P.A survey on tree edit distance and related problems[J].Theoretical Computer Science,2005,337(1-3):217-239. 被引量：1
4Deng X B.Automatic transformation of HTML pages into relational database[J].Journal of Information and Computational Science,2010,7(2):349-355. 被引量：1
5Gold E M.Language identification in the limit[J].Inform contr,1967,10:447-474. 被引量：1
6Bex G J,Neven F,Schwentick T,et al.Inference of concise DTDs from XML data[C]//VLDB06 Proceedings of the32nd international conference on Very large data bases.USA:VLDB Endowment,2006:115-126. 被引量：1
7Bex G J,Neven F,Vansummeren S.Inferring XML schema definitions from XML data[C]//VLDB07 Proceedings of the32nd international conference on Very large data bases.USA:VLDB Endowment,2007:998-1009. 被引量：1
8Bex G J,Gelade W,Neven F,et al.Learning deterministic regular expressions for the inference of schemas from xml data[C]//WWW08Proceeding of the17th international conference on World Wide Web.New York:ACM Press,2008:825-834. 被引量：1
9Fernau H.Algorithms for learning regular expressions from positive data[J].Information and Computation,2009,207:521-541. 被引量：1
10Brazma A.Learning of regular expressions by pattern matching[C]//EuroCOLT95Proceedings of the Second European Conference on Computational Learning Theory.UK:Springer-Verlag London Ltd,1995:392-403. 被引量：1

引证文献2

1邓绪斌.基于最优树联配的正则表达式学习算法[J].复旦学报（自然科学版）,2011,50(6):797-802. 被引量：3
2翟琰,刘磊,张建军,卫星.车身总线控制系统硬件在环仿真设计[J].合肥工业大学学报（自然科学版）,2012,35(5):621-624. 被引量：1

二级引证文献4

1刘德利,谷原野,节忠海,高洪伟.基于HIL的汽车电气功能测试系统设计[J].汽车电器,2017(1):58-59. 被引量：5
2贾雨,覃杨森,齐禅颖,王爽.组件式自主实时监控平台设计技术研究[J].电子设计工程,2017,25(17):41-45.
3贾雨,王爽,覃杨森,齐禅颖.基于组态的试飞实时监控技术[J].计算机技术与发展,2017,27(9):145-149. 被引量：1
4成勤,肖稳安,王清龙,项建国,于乃莲,陈华.正则表达式在闪电定位资料处理中的应用[J].南京信息工程大学学报（自然科学版）,2019,11(1):121-126. 被引量：1

1韩鸿文.汉字内部码二义性分析[J].中文信息,1989(1):26-29.
2赵永华.将Hyper-V安装到Windows 10[J].网络安全和信息化,2016,0(6):69-69.
3史立.C^(++)多态性设计的二义性问题的探讨[J].微机发展,1997,7(5):12-14.
4于长江.如何避免字段值的二义性[J].电脑编程技巧与维护,2004(4):88-90.
5林颖贤.对C++中继承的二义性之探讨[J].电脑学习,2000(2):31-32. 被引量：2
6张志红,朱晨光.编译原理中的文法及二义性研究[J].河南科技,2009,28(11):34-35.
7范秋生.C++中函数重载的剖析[J].福建电脑,2007,23(1):84-85. 被引量：1
8张志强.C++中自增、自减运算二义性分析[J].山西广播电视大学学报,2008,13(1):43-44.
9蓝雯飞.C++语言对C语言的扩充[J].微计算机应用,2001,22(5):300-304.
10王元航.实现FORTRAN程序调试环境的一个方法[J].安徽师大学报,1993,16(3):70-73.

模式识别与人工智能

2006年第2期

浏览历史

内容加载中请稍等...

基于与或树的正则表达式有害二义性检查算法被引量：2

参考文献8

同被引文献17

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于与或树的正则表达式有害二义性检查算法 被引量：2

参考文献8

同被引文献17

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于与或树的正则表达式有害二义性检查算法被引量：2