期刊文献+

满足度量性质的归一化树编辑距离 被引量:2

Metric Normalized Tree Edit Distance
下载PDF
导出
摘要 利用树大小和树编辑距离的简单函数提出了一种归一化树编辑距离,在权重函数具有度量性质且所有插入和删除操作的权重都相等时,不仅能完全满足三角不等式,而且是一种取值在[0,1]的度量.这种距离可以由树编辑距离直接计算得到,其计算时间复杂度与树编辑距离相同.通过手写数字识别实验说明,AESA算法利用该距离获得的识别率为91.6%,比其他2种归一化树编辑距离分别高0.2%和0.8%. Using a simple function of the sizes of two trees and the edit distance between them,this paper presents a normalized tree edit distance,which not only satisfies the triangle inequality,but also is a genuine metric valued in under the condition that the weight function is a metric over the set of elementary edit operations with all costs of insertions/deletions of the same weight.Moreover,the distance can be directly computed through tree edit distances and it has the same computing time complexity with the tree edit distance.Handwritten digits recognition experiments show that the metric normalization can reach 91.6%,which is 0.2% and 0.8% better than that of the other two existing methods respectively when the AESA is used.
出处 《北京工业大学学报》 EI CAS CSCD 北大核心 2011年第4期576-582,共7页 Journal of Beijing University of Technology
基金 国家自然科学基金资助项目(60775010) 北京市自然科学基金资助项目(4112009) 北京工业大学高层次人才培养资助项目 北京市属市管高等学校'中青年骨干教师培养计划'资助项目PHR(IHLB)
关键词 度量 树编辑距离 三角不等式 逼近排除算法 metric tree edit distance triangle inequality approximating and eliminating search algorithm(AESA)
  • 相关文献

参考文献17

  • 1ZHANG K, SHASHA D, WANG J T L. Approximate tree matching in the presence of variable length don't cares [ J]. J Algorithms, 1994, 16( 1 ) : 33-66. 被引量:1
  • 2TAI Kuo-chung. The tree-to-tree correction problem[J]. J ACM, 1979, 26: 422-433. 被引量:1
  • 3ZHANG K, SHASHA D. Simple fast algorithms for the editing distance between trees and related problems [ J]. SIAM J Comput, 1989, 18: 1245-1262. 被引量:1
  • 4KILPEL~INEN P, MANNILA H. Ordered and unordered tree inclusion[ J]. SIAM J Comput, 1995, 24: 340-356. 被引量:1
  • 5KLEIN P, TIRTHAPURA S, SHARVIT D, et al. A tree-edit-distance algorithm for comparing simple, closed shapes [ J]. Proceedings of 1 lth Annuat ACM-SIAM Symp on Discrete Algorithms (SODA). New York: ACM & SIAM, 2000: 696-704. 被引量:1
  • 6HOFFMANNC M, O'DONELL M J. Pattern matching in trees[ J]. J ACM, 1982, 29(1) : 68-95. 被引量:1
  • 7RAMESH R, RAMAKRISHNAN I V. Nonlinear pattern matching in trees[J]. J ACM, 1992, 39(2) : 295-316. 被引量:1
  • 8BILLE P. A survey on tree edit distance and related problems[ J]. Theoretical Computer Science, 2005, 337: 217-239. 被引量:1
  • 9LEVENSHTEIN A. Binary codes capable of correcting deletions, insertions and reversals[ J ]. Soviet Physics Doklady, 1966, 10(8) : 707-710. 被引量:1
  • 10SELLERS P H. On the theory and computation of evolutionary distances[J]. SIAM J Applied Math, 1974, 26(4) : 787- 793. 被引量:1

同被引文献21

  • 1章成志.基于多层特征的字符串相似度计算模型[J].情报学报,2005,24(6):696-701. 被引量:39
  • 2乔少杰 唐常杰 陈瑜等.基于树编辑距离的层次聚类算法.计算机科学与探索,2007,1(3):282-292. 被引量:5
  • 3CRESCENZI V, MECCA G, MERIALDO P. RoadRunner: Towards automatic data extraction from large Web sites[ C]// Proceedings of the 27th Very Large Data Base Endowment Conference. San Fran- cisco: Morgan Kaufmann Publishers Inc, 2001 : 109 - 118. 被引量:1
  • 4CHANG CHIA-HUI, LUI SHAO-CHEN. IEPAD: information ex- traction based on pattern discovery[ C]// Proceedings of the 10th International Conference on World Wide Web. New York: ACM, 2001:681 -688. 被引量:1
  • 5LIU BING, GROSSMAN R L, ZHAI YANHONG. Mining data re- cords in Web pages[ C]//Proceedings of the 9th ACM SIGKDD In- ternational Conference on Knowledge Discovery and Data Mining. New York: ACM, 2003:601 -606. 被引量:1
  • 6RAID H X.窜和序列处理2--字符串编辑距离算法[EB/OL].[2011-11-20].http://hxraid.iteye.com/blog/615469. 被引量:1
  • 7Levenshtein V L. Binary codes capable of correcting deletions, insertions and reversals [J] . Doklady Akademi Nauk SSSR, 1966, 163 (4).. 707-710. 被引量:1
  • 8Monge A E, Elkan C P. The field matching problem: Algo- rithm and applications [EB/OL]. (2008-06-16) [2014]. http: //www. eecs. csulb, edu/ monger/Papers/ kdd96, ps. 被引量:1
  • 9Lipsky O, Porat E. Approximate matching in the I. metric [J]. Information Processing Letters, 2008, 105 (4): 135-140. 被引量:1
  • 10Michailidis P D, Margaritis K G. Processor array architectures for flexible approximate string matching [J]. Journal of Systems Architecture, 2008, 54 (1-2): 35-55. 被引量:1

引证文献2

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部