基于近邻传播的时间序列基因表达谱聚类算法

A cluster Algorithm for Time-course Gene Expression Profile Based on Affinity Propagation

下载PDF

导出

摘要聚类是识别基因表达数据蕴含的关键基因调控模块的一种有效方法,基因表达谱的相似性度量是聚类的关键问题.然而,一般的相似性度量方法不能刻画时间序列基因表达谱数据所蕴含的时间延迟、反向相关和局部相关等复杂的基因调控关系.针对时间序列基因表达谱数据,提出一种基于近邻传播和动态规划的相似性度量方法和聚类算法.在大鼠再生肝细胞基因表达谱数据集上的聚类结果与基因功能富集分析结果高度一致,证明算法在时间序列基因表达谱数据聚类上的有效性. Clustering is an effective method used to identify key gene regulation modules from time-course gene expression data.An important problem in clustering is similarity measure between gene expression profiles.However,general similarity measurement methods are not suitable for gene expression data with time delay,invert correlation and transient correlation feature.Since temporal feature reflect complex regulation relationships between genes,a similarity measurement method for timecourse gene expression data and a clustering algorithm based on affinity propagation and dynamic programming methods were proposed.Clustering results in hepatocyte gene expression dataset during rat liver regeneration showed that proposed algorithm is high degree of consensus with gene function enriched analysis.Results shows validity of proposed algorithm applied in timecourse gene expression data.

作者周运徐久成徐存拴

机构地区河南师范大学生命科学学院河南师范大学省部共建细胞分化调控国家重点实验室培育基地河南师范大学计算机与信息工程学院

出处《河南师范大学学报（自然科学版）》 CAS 北大核心 2015年第6期134-140,共7页 Journal of Henan Normal University(Natural Science Edition)

基金国家973前期研究专项基金(2012CB722304) 河南省基础与前沿计划研究项目(122300410355)

关键词近邻传播时间序列反向相关瞬时相关基因表达谱 affinity propagation time-course invert correlation transient correlation gene expression prolife

分类号 TP391.9 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献17

1徐存拴,章静波著..大鼠肝再生的功能基因组学研究中[M].北京:高等教育出版社,2009:2019.
2He F, Zeng A P. In search of functional association from time-series microarray data based on the change trend and level of gene expres- sion[J]. BMC Bioinformatics, 2006,7 (69) : 1-15. 被引量：1
3Weber C C, Hurst L D. Support for multiple classes of local expression clusters in Drosophila melanogaster, but no evidence for gene or- der conservation[J]. Genome biology, 2011,12: R23. 被引量：1
4Wang G, Zhao Y H, Zhao X G, et al. Efficiently mining local conserved clusters from gene expression data[J]. Neurocomputing,2009, 73(7/8/9): 1425-1437. 被引量：1
5Leone M, Sumedha, Weigt M. Clustering by soft-constraint affinity propagation: applications to gene-expression data[J]. Bioinformat- ics, 2007,23 (20) : 2708-2715. 被引量：1
6Kiddle S J, Windram O P F, McHattie S, et al. Temporal clustering by affinity propagation reveals transcriptional modules in Arabidop- sis thaliana[J]. Bioinformatics,2010,26(3) :355-362. 被引量：1
7Yu Z W, Chen H T, You J, et al. Hybrid Fuzzy Cluster Ensemble Framework for Tumor Clustering from Biomolecular Data[J]. IEEE- ACM transactions on computational biology and bioinformatics,2013,10(3):657-670. 被引量：1
8Feng Y H, Wang J T. Prediction of Drug Target Proteins with Affinity Propagation Algorithm[J]. Research journal of biotechnology, 2013,8(6) :38-42. 被引量：1
9Wang M, Zhang W, Ding W, et al. Parallel Clustering Algorithm for Large-Scale Biological Data Sets [ J]. PLOS ONE, 2014,9 (4): e91315. 被引量：1
10Chiu T Y, Hsu T C, Yen C C, et al. Interpolation based consensus clustering for gene expression time serles[J]. BMC Bioinformatics, 2015,17:e117. 被引量：1

1王大伟.基于局部相关的图像隐写方法研究[J].软件导刊,2009,8(4):178-180.
2蔡立军,蒋林波,易叶青.基于蚁群优化算法的基因选择[J].计算机应用研究,2008,25(9):2754-2757. 被引量：7
3闫雷鸣,孙志挥,张柏礼.一种时序数据局部相关对象聚类算法[J].东南大学学报（自然科学版）,2007,37(5):793-797.
4郑希源,张化祥.基于局部近邻相关性的多标记算法[J].计算机科学,2014,41(2):123-126. 被引量：4
5赵红,王向阳,杨红颖.基于感兴趣区域的图像水印嵌入算法研究[J].小型微型计算机系统,2007,28(5):920-925. 被引量：5
6刘维,陈汉武,陈崚.一种识别基因调控元件的新型优化算法[J].计算机应用与软件,2013,30(1):21-28. 被引量：3
7张彬,蒋涛,李国徽,朱虹.跟踪局部相关数据流的高效模型[J].华中科技大学学报（自然科学版）,2009,37(3):62-65.
8王洪峰,陈立勇.云计算环境下基于张量分解的缺失关联规则挖掘算法[J].重庆邮电大学学报（自然科学版）,2015,27(3):397-403. 被引量：5
9陈熔.B样条曲线拟合Z曲线算法的实现[J].甘肃联合大学学报（自然科学版）,2008,22(2):82-85. 被引量：1
10张志平,彭岩.监督型典型相关分析及其行为识别应用[J].计算机应用与软件,2010,27(12):15-17. 被引量：1

河南师范大学学报（自然科学版）

2015年第6期

浏览历史

内容加载中请稍等...

基于近邻传播的时间序列基因表达谱聚类算法

参考文献17

相关作者

相关机构

相关主题

浏览历史