基于信息量的DNA序列相似性分析被引量：1

Analysis of similarity of DNA sequences based on information quantity

下载PDF

导出

摘要针对传统方法在分析DNA序列相似性方面的不足,提出了一种新的基于信息量的DNA序列相似性分析算法,该方法将DNA序列视为基于符号集{A,C,G,T}的信号序列,全部待比较的DNA序列组合成一个以字符A、C、G、T为属性值的信息系统。在所得数据库系统中引进DNA序列的信息量、联合信息量、条件信息量、交互信息量等概念,讨论这些信息量的性质并给出它们之间的一些关系式,然后在此基础上构建DNA序列相似性分析模型。仿真实验结果表明,该方法不但能快速、有效地分析DNA序列相似性,而且较好地克服了DNA碱基数量很大且不同物种的DNA序列长短不同的不足。 Aiming at lacking in similarity analysis of DNA sequences using traditional methods, this paper proposed a novel similarity analysis of DNA sequences based on information quantity, and a DNA sequence was viewed as a signal sequence based on symbol set { A, C, G, T t , and then the DNA sequences could be viewed as a information system with attribute value A, C, G,T. It recommended the concepts of information quantity, joint information quantity, condition information quantity, mutual information quantity of DNA sequences in the database system, and discussed the properties about them, and then pro- vided some relation formulas, then built DNA sequences similarity analysis model based on this. The simulation results show that the method not only can effectively analysis of similarity of DNA sequences, but also overcome shortages for a large num- ber of DNA and DNA sequences of different species with different length.

作者陈雪刚张家录程杰仁

机构地区湘南学院计算机系湘南学院数学系杭州电子科技大学

出处《计算机应用研究》 CSCD 北大核心 2013年第5期1381-1384,共4页 Application Research of Computers

基金国家自然科学基金资助项目(60603062 61100194) 湖南省重点建设学科资助湖南省教育科学十二五规划项目(XJK011BXJ004) 湖南省教育厅科研资助项目(11C1184)

关键词 DNA序列比较数据库系统信息量相似性 DNA sequence comparison database system information quantity similarity

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1TANG XiaoChan ZHOU PanPan QIU WenYuan.On the similarity/dissimilarity of DNA sequences based on 4D graphical representation[J].Chinese Science Bulletin,2010,55(8):701-704. 被引量：5
2LIANG JiYe & QIAN YuHua Key Laboratory of Computational Intelligence and Chinese Information Processing,Ministry of Education,School of Computer & Information Technology,Shanxi University,Taiyuan 030006,China.Information granules and entropy theory in information systems[J].Science in China(Series F),2008,51(10):1427-1444. 被引量：41

二级参考文献32

1ZHANG Wenxiu,WEI Ling,QI Jianjun.Attribute reduction theory and approach to concept lattice[J].Science in China(Series F),2005,48(6):713-726. 被引量：73
2Posada D. Bioinformatics for DNA sequence analysis. New York: Humana Press, 2009. 被引量：1
3Nandy A. A new graphical representation and analysis of DNA sequence structure L methodology and application to globin genes. Curr Sci, 1994, 66:309-314. 被引量：1
4Randic M, Zupan J, Novic M. On 3-D graphical representation of proteomics maps and their numerical characterization. J Chem Inf Comput Sci, 2001, 41:1339-1344. 被引量：1
5Randic M, Vracko M, Lers N, et al. Novel 2-D graphical representation of DNA sequences and their numerical characterization. Chem Phys Lett, 2003, 368:1-6. 被引量：1
6Randic M, Vracko M, Nandy A, et al. On 3-D graphical representation of DNA primary sequences and their numerical characterization. J Chem Inf Comput Sci, 2000, 40:1235-1244. 被引量：1
7Randic M. Graphical representations of DNA as 2-D map. Chem Phys Lett., 2004, 386:468-471. 被引量：1
8Randic M, Vracko M, Lers N, et al. Analysis of similarity/dissimilarity of DNA sequences based on novel 2-D graphical representation. Chem Phys Lett, 2003, 371:202-207. 被引量：1
9Liao B, Wang T M. Analysis of similarity/dissimilarity of DNA sequences based on 3-D graphical representation. Chem Plays Lett, 2004, 388:195-200. 被引量：1
10Liu Z B, Liao B, Zhu W, et al. A 2D graphical representation of DNA sequence based on dual nucleotides and its application. Int J Quantum Chem, 2009, 109:948-958. 被引量：1

共引文献44

1唐鹏飞.基于模糊粗糙集的区间集决策表不确定性度量[J].智能计算机与应用,2021,11(12):61-67.
2王锋,梁吉业,钱宇华.非完备信息系统的相容类快速计算[J].计算机工程与应用,2009,45(27):133-136. 被引量：6
3滕书华,周石琳,孙即祥,李智勇.基于条件熵的不完备信息系统属性约简算法[J].国防科技大学学报,2010,32(1):90-94. 被引量：23
4HU QingHua GUO MaoZu YU DaRen LIU JinFu.Information entropy for ordinal classification[J].Science China(Information Sciences),2010,53(6):1188-1200. 被引量：29
5WANG ChangZhong,CHEN DeGang,HU QingHua.Some invariant properties of ordered information systems under homomorphism[J].Science China(Information Sciences),2010,53(9):1816-1825.
6QIAN Yuhua LIANG Jiye WEI Wei.Pessimistic Rough Decision[J].浙江海洋学院学报（自然科学版）,2010,29(5):440-449. 被引量：5
7钱宇华,梁吉业,王锋.面向非完备决策表的正向近似特征选择加速算法[J].计算机学报,2011,34(3):435-442. 被引量：26
8陶午沙,滕书华,孙即祥,李智勇.基于一般二元关系的不确定性度量方法研究[J].国防科技大学学报,2011,33(2):63-67. 被引量：5
9燕蜻,梁吉业.混合多属性群决策中的群体一致性分析方法[J].中国管理科学,2011,19(6):133-140. 被引量：20
10杨伟萍,林梦雷.直觉模糊信息系统中的信息粒度[J].山东大学学报（理学版）,2012,47(1):87-92. 被引量：3

同被引文献1

1刘兵,柳菁筠,李大超.一种新的相似性度量及其在DNA序列相似性分析中的应用(英文)[J].海南师范大学学报（自然科学版）,2009,22(1):21-26. 被引量：2

引证文献1

1刘述,孔玲,田辉.流媒体视频文件相似性识别的方法[J].信息通信技术与政策,2022(10):87-90.

1周彤,张家录.基于信息量的属性相关性及其应用[J].计算机工程与设计,2012,33(3):1192-1196. 被引量：2
2王海晖,彭嘉雄,吴巍.采用交互信息量评价遥感图像融合结果的方法[J].华中科技大学学报（自然科学版）,2003,31(12):32-34. 被引量：13
3黄兵,周献中,张蓉蓉.基于信息量的不完备信息系统属性约简[J].系统工程理论与实践,2005,25(4):55-60. 被引量：41
4张东淮.让拼音有声有色[J].电脑界（电脑高手）,2000(5):69-69.
5阮静,黄大荣.一种有效的基于4D图形表示法的DNA序列相似性比较方法[J].湖北民族学院学报（自然科学版）,2012,30(2):211-214.
6胡晓彤,王建东.基于子空间特征向量的三维点云相似性分析[J].红外与激光工程,2014,43(4):1316-1321. 被引量：2
7聂冰,李文,郭永香.基于粗糙集理论的决策系统属性约简[J].大连交通大学学报,2008,29(4):76-78. 被引量：1
8车成逸,马宗民,焦晓龙.Web表格中本体实例自动获取方法[J].东北大学学报（自然科学版）,2012,33(3):332-335. 被引量：2
9林毓宁,冼太生,桂现才.基于条件信息量的决策表连续属性离散化算法[J].洛阳师范学院学报,2006,25(2):14-16.
10刘超,马志强,刘帅.生物信息学中的双序列比对算法[J].长春工程学院学报（自然科学版）,2006,7(3):55-57. 被引量：1

计算机应用研究

2013年第5期

浏览历史

内容加载中请稍等...

基于信息量的DNA序列相似性分析被引量：1

参考文献2

二级参考文献32

共引文献44

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于信息量的DNA序列相似性分析 被引量：1

参考文献2

二级参考文献32

共引文献44

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于信息量的DNA序列相似性分析被引量：1