摘要
随着多数生物基因组测序工作的完成,基因识别就显得尤为重要。CpG岛在基因组中有着重要的生物学意义,而识别CpG岛将有助于基因的识别。因此构建了识别DNA序列中CpG岛的隐马尔可夫模型HMM(Hidden Markov Model),并利用网上人类基因CpG岛数据库中随机选取的94条基因序列数据对该模型进行训练与检测,得到了很高的预测准确率,结果表明HMM用于CpG岛的识别是快速有效的。
While the genomes of many organisms have been sequenced, gene prediction becomes one of the most important projects. CpG islands are useful markers for gene finding. Detection of CpG islands is helpful for gene predition. A special Hidden Markov Model is designed for CpG islands prediction in DNA sequences. 94 DNA sequences selected randomly from Human CpG islands Database are used to train and test the HMM ,and high prediction accuracy is achieved. The results show that HMM is effective for CpG islands prediction.
出处
《计算机应用与软件》
CSCD
北大核心
2008年第11期214-215,231,共3页
Computer Applications and Software