摘要
利用代数学中同态思想和物理中的"粗粒化"思想,以及HP模型,根据a,t,c,g的化学结构分类,提出了DNA序列的特征序列概念(σ-,τ-,σ∩τ-)并推广到蛋白质序列中,从而给出一种数值刻划,将蛋白质序列简化成一个(0,1)序列[6],基于上述给出特征序列的方法,根据氨基酸分子量与简并度的关系,提出了另外一种DNA序列的特征序列概念(-)并推广到蛋白质序列中,进而给出了另外一种数值刻划,将蛋白质序列简化成一个(0,1,2)序列,通过比较RHD基因和RHCE基因的特征序列的数值刻划图,得出RHD基因和RHCE基因均偏爱使用低分子量且高简并度的氨基酸。
By introducing the homomorphism in algebra and "coarse" in physics and HP model, according to the chemical structure classification of a,t,c and g, these concepts of σ- , τ- and σ ∩ τ- have been presented and promoted into protein sequence, furthermore, the graph representation is introduced, and protein sequence is simplified to be (0,1 ) sequence . On the base of the method giving the characteristic sequences, according to the relation of the molecular weight? and degeneracy of the amino acids, another concept of DNA characteristic sequences( - )is presented, and promoted into protein sequence, furthermore, another graph representation is introduced, protein sequence is simplified to be (0,1,2) sequence, we can know that both RHD and RHCE genes all prefer to use the amino acids with small molecular weight and high degeneracy, by comparing the graph representation of the RHD and RHCE characteristic sequences.
出处
《生物信息学》
2009年第4期248-251,共4页
Chinese Journal of Bioinformatics
基金
江南大学预研基金资助(2007LYY007)