期刊文献+

DNA保守序列识别算法的并行化和MPI集群环境构建 被引量:1

Parallelizing of a Conservative DNA sequences recognition algorithm and building of a small MPI cluster
下载PDF
导出
摘要 DNA序列中保守序列的识别需要较大的计算量。开发了一个转录因子结合位点识别的并行算法,能够从多条DNA序列中识别指定长度的序列模式。算法使用概率模型进行序列模式保守性的度量,利用迭代过程实现保守序列的搜索。使用C编程结合MPI消息传递模型开发了相应的程序,并在Windows平台下构建了一个3节点的集群环境,利用20个长度均为200的序列数据集进行测试,实现了模体识别工作,结果表明并行算法使模体识别的效率得到提高。 It needs more computation time to recognize conservative DNA sequences. Therefore, a parallel algorithm of transcription factor binding sites (TFBS) recognition was developed, which can discover a sequence pattem of given length from a group of DNA sequences. This algorithm is based on probability model and is achieved by iteratively searching. A corresponding program was implemented based on C language and MPI message transfer model, a three - node computer cluster was constructed successfully on the Windows platform, and a multi - sequence simulated dataset was tested, which consists of 20 sequences and each of which is 200 bases long. The successful experiment result indicated the efficiency of pattern recognition was improved greatly with parallel algorithm.
出处 《生物信息学》 2009年第3期190-192,201,共4页 Chinese Journal of Bioinformatics
基金 国家自然科学基金(60601017)
关键词 并行计算 模体识别 基因调控 转录因子结合位点 parallel computing pattern recognition gone regulation TFBS
  • 相关文献

参考文献21

  • 1杜耀华,王正志.转录因子结合位点的计算预测方法研究进展[J].生命科学研究,2006,10(S1):24-31. 被引量:2
  • 2Sinha S,Tompa M. Discovery of novel transcription factor binding sites by statistical overrepresentation[ J]. Nucleic Acids Research, 2002,30 ( 24 ) : 5549. 被引量:1
  • 3Bailey TL, Elkan C. Unsupervised learning of multiple motifs in biopolymers using expectation maximization [ J ]. Machine Learning,1995,21(1):51 - 80. 被引量:1
  • 4Lawrence CE, Altschul SF, Boguski MS, et al. Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment [ J ]. Science, 1993,262(5131 ) :208 - 214. 被引量:1
  • 5Carmack CS, McCue LA, Newberg LA, et al. PhyloScan: identification of transcription factor binding sites using cross- species evidence[J]. Algorithms for Molecular Biology, 2007,2 : 1. 被引量:1
  • 6Sosinsky A, Honig B, Mann RS, et al. Discovering transcriptional regulatory regions in Drosophila by a nonalignment method for phylogenetic footprinting[ J]. Proceedings of the National Academy of Sciences of the United States of America,2007,104(15) :6305 - 6310. 被引量:1
  • 7Gotea V,Ovcharenko I. DiRE:identifying distant regulatory elements of co -expressed genes[J]. Nucleic Acids Research,2O08,36:W133- 139. 被引量:1
  • 8Singh LN, Wang IS, Hannenhalli S. TREMOR a tool for retrieving transcriptional modules by incorporating motif covanance[ J ]. Nucleic Acids Research, 2007,35(21 ) :7360 - 7371. 被引量:1
  • 9刘维..生物信息学中的并行处理[D].扬州大学,2007:
  • 10Trelles O. On the parallelisation of bioinformatics applications [ J ]. Briefings in Bioinformatics,2001,2(2) : 181 - 194. 被引量:1

二级参考文献15

共引文献48

同被引文献15

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部