期刊文献+

基于广义后缀树的最长重复子模式算法

Algorithm of Finding Maximal Repetitive Pattern Based on General Suffix Tree
下载PDF
导出
摘要 最长重复子串问题是字符串处理中的一个经典问题,是许多应用的基础。但有些时候人们不只关心相等的子串对,还要查找具有某种其他关系的子串对。例如在DNA序列中通常关心字符串和它的补串。这种联系可以看成是一个字符串经过某种置换后与另一个字符串相等。因此本文定义了单一置换下的最长重复子模式和最长重复子模式两个问题,提出了基于广义后缀树的算法来解决这两个问题,并在理论上分析了它们的时间复杂性和空间复杂性。 Finding the maximal repeat is a classic problem in the field of string processing. In some application, we are concerned not only with the substrings which are equal, but also with the substring which is equal to another substring under some permutation. For example, in DNA sequence, we are usually concerned with the subsequence and its complement. In this paper, we present the problem of finding maximal repetitive pattern. We make a precise definition of the problem of finding maximal repetitive pattern, also with the problem of finding maximal repetitive pattern under a permutation. We propose the algorithms based on general suffix tree to solve them and make the analysis of their time complexity and space complexity.
作者 柳渤 李建中
机构地区 哈尔滨工业大学
出处 《航天控制》 CSCD 北大核心 2008年第2期74-78,共5页 Aerospace Control
关键词 最长重复子模式 后缀树 置换 Maximal repetitive pattern Suffix tree Permutation
  • 相关文献

参考文献7

  • 1McConkey E. Human Genetics: the Molecular Revolution [ M]. Jones and Bartleett, Boston, MA, 1993. 被引量:1
  • 2Doolittle R F. Redundancies in Protein Sequences[ M]//Prediction of Protein Structure and the Principles of Protein Conformation, ed. G. Fasman. Plenum, New York, 1989:599 - 624. 被引量:1
  • 3Agarwal P,States D. The Repeat Patterm Toolkit (RPT) : Analyzing the Structure and Evolution of the C. Elegeans Genome [ C ]// Proceedings of the Second International Conference of Intelligent Systems for Molecular Biology, 1994:1 -9. 被引量:1
  • 4Ukkonen E. On-line Construction of Suffix-trees[ J]. Algorithmica, 1995, 14 : 249 - 260. 被引量:1
  • 5Weiner P. Linear Pattern Matching Algorithms [ C ]//Proc. of the 14th IEEE Symp on Switching and Automata Theory, 1973, 1-11. 被引量:1
  • 6Hui L. Color Set Size Problem with Applications to String Matching[ C ]//Conf. 3rd Syrup on Combinatorial Pattern Matching, Springer LNCS, 1992, 644:227 - 240. 被引量:1
  • 7Dan G. Algorithms on Strings,Trees and Sequences: Computer Science and Computational Biology[ M]. Cambridge University Press, 1997. 被引量:1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部