摘要
目的:筛选肺癌患病风险相关的关键基因,为进一步的相关性研究提供参考。方法:基于文献挖掘的方法统计出肺癌患病风险相关基因,运用生物信息学方法对上述基因进行分析。结果:近10年内的文献共涉及312个基因,其中不同分级的GO分类762种,KEGG通路81种,而ln大于0且P值≤0.001的GO分类有70种,KEGG通路8种,两者主要与细胞生理、代谢、增殖、DNA损伤修复及细胞凋亡、细胞周期、MAPK信号通路有关。在线String软件构建的蛋白网络图提示这些基因的表达产物大部分之间存在着密切的相互作用关系,同时发现上述基因中307个可翻译成蛋白并参与网络构建。进一步运用Cytoscape软件将"String"软件所构建的蛋白网络图可视化及量化同时筛选出24个关键基因。结论:肺癌患病风险相关基因种类繁多,针对关键基因的大样本跨种族高质量肺癌患病风险临床研究是未来的研究方向。
Objective: This study aimed to determine the key genes that are associated with lung cancer risk for further study. Methods: Techniques in bioinformatics were applied in lung cancer risk genes that were obtained from clinical trials. Results: We obtained 312 lung cancer risk genes that are mainly related to cellular physiological processes, metabolism, cell proliferation, DNA repair and cell cycle, apoptosis, and MAPK signaling pathway. A total of 762 functional classifications by the Gene Ontology (GO) and 81 pathways by the Kyoto _Encyclopedia of Genes and Genomes (KEGG) were involved. Among these risk genes, only 70 GO classifications and 8 KEGG pathways were significant [ ln(Bayes factor) ≥ 0 and P-value ≤ 0.001 ]. Only 307 genes can be translated into proteins; thus, the complex pro tein-protein network that was built using the Online String software also revealed 307 nodes. This network was visualized and quantified using the Cytoscape software. We determined 24 key genes from the protein-protein network. Conclusion: Selecting the key genes from a large number of lung cancer risk genes and performing a large sample interracial clinical study are important in the analysis of lung cancer key risk genes through bioinformatics.
出处
《中国肿瘤临床》
CAS
CSCD
北大核心
2012年第23期1890-1895,共6页
Chinese Journal of Clinical Oncology