摘要
文章统计了HIV全序列中所有不同长度的重复序列,并分析了其重复序列的AT/GC含量。研究发现:(1)重复序列出现的次数n与出现次数为n的重复序列总数N(n)之间存在power-law关系,这表明在HIV全序列中,只有少数重复序列会出现很多次,大部分重复序列只会出现少数几次;(2)HIV全序列中的AT含量高于GC含量,并且对于不同长度K的重复序列,100%AT含量的重复序列总是多于100%GC含量的重复序列,最长的100%AT含量的重复序列比100%GC含量的重复序列要长,这表明HIV全序列在进化过程中可能经受了更多的AT压力。通过对HIV全序列中的重复序列的分析,进一步讨论HIV进化历程及特点。
This paper counting all repeated sequences with different lengths,and analysing their content of AT/GC.The result is:(1) the repeated times n and the sum of sequences N(n) appears power-law,which means a few repeated sequences in HIV emerge large times,most repeated sequences are emerge a few times;(2) The content of AT is larger than GC in all HIV sequences,and the same in its repeated sequences,which indicates it is more pressured on AT during their evolutionary prosses.This paper also discuss further on the evolutionary course and feature of HIV sequences.
出处
《大众科技》
2012年第5期131-133,共3页
Popular Science & Technology
基金
艾滋病病毒进化历程及其潜伏期的研究(T32581)