一种分析全基因组上位性的新方法被引量：1

A Genome-wide Epistasis Analysis Method Based on Multiple Criteria Fusion

下载PDF

导出

摘要传统基于单位点的全基因组关联研究存在重复性低、难以解释性等缺陷,而采用基于机器学习的上位性分析中面临计算复杂度高、预测准确度不足等问题.本文提出一种分析全基因组上位性的新方法,该方法采用二阶段框架的上位性分析方法,它包含特征过滤阶段以及上位性组合优化阶段,在特征过滤阶段提出了多准则融合策略,从多个不同角度评价遗传变异位点,以保证易感的弱效位点能被保留,然后采用多准测排序融合策略剔除与疾病状态关联程度低的遗传变异,进一步在上位性组合优化阶段采用贪婪算法启发式地搜索组合空间,以降低时间复杂度,最后采用支持向量机作为上位性评价模型.实验中采用不同的连锁不平衡参数与经典算法SNPruler与ACO的性能进行对比,实验结果表明:本文方法能有效保留弱效位点,一定程度上提高了疾病预测的正确度. Traditional units of genome-wide association studies have serious defects such as low repeat- ability, difficulty to interpret, and epistasis analysis based on machine learning has troubles such as high computational complexity and insufficient prediction accuracy. This paper presented a new approach for the analysis of genome-wide epistatic. This method uses the framework of two-phase epistatic analysis meth- od. It includes a filtering stage and an epistatic combinatorial optimization stage. The characteristics of the filtering stage presents a multicriteria fusion strategy for the evaluation of genetic loci from multiple per- spectives to ensure that the weak effect of susceptibility loci can be retained, and then, this method uses the multiple criteria sorting fusion strategy to eliminate the low degree of genetic variation associated with disease states. Epistatic combinatorial optimization phase uses the greedy algorithm combination of heuristic search space in order to reduce the time complexity. Finally, a support vector machine was used as the epistatic evaluation model. Experiments with different parameters of linkage disequilibrium SNPruler with classical algorithms were compared with the performance of the ACO, and the experiment results show that the method can effectively keep weak effect locus and improve disease forecasting accuracy considerably.

作者李泽军陈敏曾利军

机构地区湖南大学信息科学与工程学院湖南工学院计算机科学与信息学院

出处《湖南大学学报（自然科学版）》 EI CAS CSCD 北大核心 2016年第10期155-160,共6页 Journal of Hunan University:Natural Sciences

基金国家自然科学基金资助项目(61672223) 湖南省自然科学基金资助项目(2016JJ4029)

关键词全基因组关联研究上位性复杂疾病智能计算 GWAS （Genome-Wide Association Study） epistasis complex diseases intelligent computing

分类号 TP39 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1吴蓉晖,卢友敏.基于蚁群算法的复杂疾病上位性分析方法[J].湖南大学学报（自然科学版）,2014,41(8):101-105. 被引量：1

二级参考文献11

1孙玉琳,赵晓航.复杂疾病基因定位策略与肿瘤易感基因鉴定[J].生物化学与生物物理进展,2005,32(9):803-810. 被引量：6
2阮晓钢,晁浩.肿瘤识别过程中特征基因的选取[J].控制工程,2007,14(4):373-375. 被引量：15
3GENIN E, COUSTET B, ALLANORE Y,et al. Epistatic In- teraction between BANKI and BLK in rheumatoid arthritis: results from a large trans-ethnic meta-analysis[J]. Plos One, 2013, 8(4):e61044. 被引量：1
4CHEN S H, SUN J,DIMITROV L, et al. A support vector machine approach for detecting gene-gene interaction[J]. Genetic Epidemiology, 2008,32 : 152- 167. 被引量：1
5ROBNIK-SIKONJA M, KONONENKO I. Theoretical and empirical analysis of relief and relief[J]. Machine Learning, 2003, 53(1): 23-69. 被引量：1
6WAN Xiang,YANG Can, YANG compositional epistasis detection studies[J]. BMC Genetics,2013, Qiang, et al. The complete in genome-wide association 14(7):1-11. 被引量：1
7WANG Y, LIU G M. An empirical comparison of several re- cent epistatic interaction detection methods[J]. Bioinformat- ics,2011, 27(21): 2936-2943. 被引量：1
8WAN Xiang, YANG Can, YANG Qiang,et al. Predictive rule inference for epistatie interaction detection in genome-wide as- sociation studies[J]. Bioinformatics, 2010,26 ( 1 ) : 30-37. 被引量：1
9文益民,王耀南,张莹.基于分类面拼接的快速模块化支持向量机研究[J].湖南大学学报（自然科学版）,2009,36(3):45-50. 被引量：1
10吴建辉,章兢,刘朝华.基于蚁群算法和免疫算法融合的TSP问题求解[J].湖南大学学报（自然科学版）,2009,36(10):81-87. 被引量：10

同被引文献4

1熊平,朱天清,王晓峰.差分隐私保护及其应用[J].计算机学报,2014,37(1):101-122. 被引量：174
2张啸剑,孟小峰.面向数据发布和分析的差分隐私保护[J].计算机学报,2014,37(4):927-949. 被引量：138
3何贤芒,王晓阳,陈华辉,董一鸿.差分隐私保护参数ε的选取研究[J].通信学报,2015,36(12):124-130. 被引量：15
4刘春,谢皓,肖奕霖,邓传远.EWT算法在ECG信号滤波中的研究[J].电子测量与仪器学报,2017,31(11):1835-1842. 被引量：21

引证文献1

1陈红松,孟彩霞,刘书雨.基于经验小波变换的基因关联隐私保护实验研究[J].湖南大学学报（自然科学版）,2020,47(2):125-133. 被引量：1

二级引证文献1

1牟星宇,陈晖,徐昕,江晓玲,李云峰,张鑫晶.基于Tangle网络的群智感知隐私保护激励方法[J].科学技术与工程,2024,24(3):1138-1145.

1吴蓉晖,卢友敏.基于蚁群算法的复杂疾病上位性分析方法[J].湖南大学学报（自然科学版）,2014,41(8):101-105. 被引量：1
2李雄.复杂疾病的组学数据挖掘方法研究[J].邵阳学院学报（自然科学版）,2017,14(2):12-18.
3WANG Shaowei,ZHU Qiuping.Fitness Landscape Analysis for Optimum Multiuser Detection Problem[J].Wuhan University Journal of Natural Sciences,2007,12(6):1073-1076.
4裘嵘,刘春宇,吴敏.面向GWAS未覆盖基因组区的数据集成[J].中南大学学报（自然科学版）,2013,44(5):1875-1880. 被引量：1
5韦振中,黄廷磊.基于支持向量机和遗传算法的特征选择[J].广西工学院学报,2006,17(2):18-21. 被引量：12
6吴红霞,吴悦,刘宗田,雷州.基于Relief和SVM-RFE的组合式SNP特征选择[J].计算机应用研究,2012,29(6):2074-2077. 被引量：17
7赵娟.基于NS-2的蠕虫病毒仿真[J].微型电脑应用,2009,25(12):21-23.
8孙发军,彭际群.并行计算中组合空间问题的均衡划分研究[J].怀化学院学报,2014,33(11):25-28.
9杨子立.心电监护系统便携式智能终端的设计[J].江苏理工学院学报,2015,21(6):1-6. 被引量：2
10M.P.JEYARAMAN,T.K.SURESH,E.KESHAVA REDDY.Strong Differential Subordination and Superordination Properties of Phi-Like Functions[J].Journal of Mathematical Research with Applications,2016,36(3):291-300.

湖南大学学报（自然科学版）

2016年第10期

浏览历史

内容加载中请稍等...

一种分析全基因组上位性的新方法被引量：1

参考文献1

二级参考文献11

同被引文献4

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种分析全基因组上位性的新方法 被引量：1

参考文献1

二级参考文献11

同被引文献4

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

一种分析全基因组上位性的新方法被引量：1