期刊文献+

LZ Complexity Distance of DNA Sequences and Its Application in Phylogenetic Tree Reconstruction 被引量:4

LZ Complexity Distance of DNA Sequences and Its Application in Phylogenetic Tree Reconstruction
原文传递
导出
摘要 DNA sequences can be treated as finite-length symbol strings over a four-letter alphabet (A, C, T, G). As a universal and computable complexity measure, LZ complexity is valid to describe the complexity of DNA sequences. In this study, a concept of conditional LZ complexity between two sequences is proposed according to the principle of LZ complexity measure. An LZ complexity distance metric between two nonnull sequences is defined by utilizing conditional LZ complexity. Based on LZ complexity distance, a phylogenetic tree of 26 species of placental mammals (Eutheria) with three outgroup species was reconstructed from their complete mitochondrial genomes. On the debate that which two of the three main groups of placental mammals, namely Primates, Ferungulates, and Rodents, are more closely related, the phylogenetic tree reconstructed based on LZ complexity distance supports the suggestion that Primates and Ferungulates are more closely related. DNA sequences can be treated as finite-length symbol strings over a four-letter alphabet (A, C, T, G). As a universal and computable complexity measure, LZ complexity is valid to describe the complexity of DNA sequences. In this study, a concept of conditional LZ complexity between two sequences is proposed according to the principle of LZ complexity measure. An LZ complexity distance metric between two nonnull sequences is defined by utilizing conditional LZ complexity. Based on LZ complexity distance, a phylogenetic tree of 26 species of placental mammals (Eutheria) with three outgroup species was reconstructed from their complete mitochondrial genomes. On the debate that which two of the three main groups of placental mammals, namely Primates, Ferungulates, and Rodents, are more closely related, the phylogenetic tree reconstructed based on LZ complexity distance supports the suggestion that Primates and Ferungulates are more closely related.
出处 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2005年第4期206-212,共7页 基因组蛋白质组与生物信息学报(英文版)
基金 supported by the National Natural Science Foundation of China(No.60371046).
关键词 BIOINFORMATICS sequence complexity conditional LZ complexity LZ complexity distance phylogenetic tree reconstruction bioinformatics, sequence complexity, conditional LZ complexity, LZ complexity distance, phylogenetic tree reconstruction
  • 相关文献

参考文献20

  • 1[1]Hao,B.L.and Zhang,S.Y.2002.Handbook of Bioinformatics (second edition).Shanghai Scientific and Technical Publishers,Shanghai,China. 被引量:1
  • 2[2]Nei,M.and Kumar,S.2000.Molecular Evolution and Phylogenetics.Oxford University Press,New York,USA. 被引量:1
  • 3[3]Misener,S.and Krawetz,S.A.(eds.) 2000.Bioinformatics:Methods and Protocols.Humana Press,Totowa,USA. 被引量:1
  • 4[4]Vinga,S.and Almeida,J.2003.Alignment-free sequence comparison-a review.Bioinformatics 19:513-523. 被引量:1
  • 5[5]Li,M.and Vitanyi,P.1997.An Introduction to Kolmogorov Complexity and Its Applications (second edition).Springer-Verlag,New York,USA. 被引量:1
  • 6[6]Li,M.,et al.2001.An information-based sequence distance and its application to whole mitochondrial genome phylogeny.Bioinformatics 17:149-154. 被引量:1
  • 7[7]Chen,X.,et al.1999.A compression algorithm for DNA sequences and its applications in genome comparison.Genome Inform.Ser.Workshop Genome Inform.10:51-61. 被引量:1
  • 8[8]Sato,H.,et al.2001.DNA data compression in the post genome era.Genome Informatics 12:512-514. 被引量:1
  • 9[9]Lempel,A.and Ziv,J.1976.On the complexity of finite sequences.IEEE Trans.Inf.Theory 22:75-81. 被引量:1
  • 10[10]Kaspar,F.and Schuster,H.G.1987.Easily calculable measure for the complexity of spatiotemporal patterns.Phys.Rev.A 36:842-848. 被引量:1

同被引文献22

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部