摘要
人类基因组计划完成证实,人类共有3-3.5万个编码基因,这些基因所涵盖的编码信息仅占人类30亿个碱基对中携带遗传信息的1.5%,其余超过98%的遗传信息并不直接编码蛋白质。近些年来由于测序技术的飞速发展,人们发现这部分遗传信息与调控、剪切、转录等生物过程密切相关,其中长链非编码RNA具有表观遗传学调控、转录调控、疾病调控、细胞分化和个体发育等重要的生命过程的调控等过程,因此如何寻找RAN的功能单元和预测新的长链非编码RNA成为很重要的问题。就非编码RNA的起源与进化进行阐述,综述了长链非编码RNA在癌症上的功能,综合了长链非编码RNA一些常见的数据库及使用最新的生物信息学手段和相关技术预测长链非编码RNA,并进行进一步的功能研究。
The completion of the Human Genome Project confirmed that there is a total of 30-35 thousand coding genes, the encoding information of these genes covers only 1.5% of human 3 billion base pairs carrying the genetic information, and over 98 percent of genetic information does not directly encode the proteins. In recent years, with the rapid development of sequencing technology, it is found that genetic information of non-protein-coding genes is closely related to regulation, slicing, transcription and other biological processes. Long non- coding RNA (lncRNA) regulates the critical life processes such as epigenetic regulation, transcriptional regulation, disease control, cell differentiation, and individual development, thus it become an extremely crucial issue that how to seek the RNA functional units and predict the novel long non-coding RNA. In this review, we summarize the origin and evolution of lncRNA, the functioning in cancer biology, and some common database of lneRNA, as well as using the latest bioinformatics tools and related technologies to predict hacRNAs for further functional studies
出处
《生物技术通报》
CAS
CSCD
北大核心
2016年第9期23-31,共9页
Biotechnology Bulletin
基金
四川省科技项目(2015JY0248)
西南民族大学研究生创新型科研项目(CX2015SZ103)