多步骤降维的肿瘤特征基因选择方法被引量：1

Tumor Gene Selection Based on a Muti-step Dimensionality Reduction Approach

导出

摘要针对基因芯片数据量大、样本数低和基因维数高的特点,提出了一种对基因芯片数据进行多步骤降维处理的分类方法.第一步,采用基因表达差异显著性分析方法(SAM)筛选得到差异表达基因子集.第二步,采用支持向量机(SVM)分类器对该差异表达基因子集进行进一步的分类降维.将该方法用来处理大肠癌和白血病数据集,得到了数量较少而分类能力较强的特征基因子集.实验结果证明该方法可以快速有效地筛选肿瘤特征基因. Microarray data has the characteristics of large quantity, low sample size and high gene dimensionality. To face this challenge, a multi-step dimensionality reduction method for classification of microarray data was proposed. In the first step, significant analysis of microarrays （SAM） was used to select a subset of differentially expressed genes （DEGs）. In the second step, a step-by-step support vector machine （SVM） classification algorithm was applied to reduce gene dimen- sionaloty of the subset of DEGs. The strategy was evaluated over three datasets of colorectal cancer and leukemia, with smaller gene numbers and higher classification accuracy. The results demonstrated the usefulness and efficiency of the approach for selection of tumor feature genes.

作者李小波

机构地区浙江教育学院信息学院

出处《复旦学报（自然科学版）》 CAS CSCD 北大核心 2008年第4期541-544,共4页 Journal of Fudan University：Natural Science

关键词基因芯片数据特征基因选择基因表达差异显著性分析方法支持向量机降维 microarray data gene sdection significant analysis of microarrays （SAM） support vector machine （SVM） dimensionality reduction

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献15

1Van'T Veer L J, Dai H, Van de Vijver M J, et al. Gene expression profiling predicts clinical outcome of breast cancer[J]. Nature ,2002,415(6871) :530-536. 被引量：1
2Golub T R, Slonim D K, Tamayo P, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring[J]. Science, 1999,286(5439) :531-537. 被引量：1
3Guyon I, Weston J, Bamhill S, et al. Gene selection for cancer classification using support vector machines [J ]. Machine Learning, 2002,46 (1-3) : 389-422. 被引量：1
4Huerta E B,Duval B,Hao J K. A hybrid GA/SVM approach for gene selection and classification of microarray data[J ]. Lecture Notes in Computer Science. 2006,3907.34-44. 被引量：1
5Tusher V G, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response [J]. Proc Natl Acad Sci U S A ,2001,98(9) :5116-5121. 被引量：1
6Witten I H, Frank E. Data Mining: Practical machine learning tools and techniques[ M]. 2nd ed. San Francisco: Morgan Kaufmann,2005. 被引量：1
7Alon U, Barkai N, Notterman D A, et al. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays[J ]. Proc Natl Acad Sci U S A, 1999,96 (12): 6745-6750. 被引量：1
8Barrett T, Troup D B,Wilhite S E, et al. NCBI GEO: mining tens of millions of expression profiles-database and tools update[J]. Nucleic Acids Res, 2007,35 (Database issue) : D760-765. 被引量：1
9Bolstad B M, Irizarry R A,Astrand M, et al. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J]. Bioinformatics ,2003,19(2) : 185-193. 被引量：1
10Soikkeli J, Lukk M, Nummela P, et al. Systematic search for the best gene expression markers for melanoma micrometastasis detection[J]. J Pathol, 2007,213(2) :180-189. 被引量：1

同被引文献1

1SU Yan-jun,XU Feng,YU Jin-pu,YUE Dong-sheng,REN Xiu-bao,WANG Chang-li.Up-regulation of the expression of S100A8 and S100A9 in lung adenocarcinoma and its correlation with inflammation and other clinical features[J].Chinese Medical Journal,2010(16):2215-2220. 被引量：19

引证文献1

1李小波,彭司华.多类别肿瘤分类的特征基因选择方法研究[J].复旦学报（自然科学版）,2014,53(3):305-312. 被引量：1

二级引证文献1

1喻德旷,杨谊.肿瘤特征基因选择的互信息最值过滤原则与粒子群优化算法[J].计算机应用,2018,38(2):421-426. 被引量：3

1李小波.基于SAM和GA/SVM的肿瘤基因表达谱分类算法[J].杭州师范大学学报（自然科学版）,2008,7(3):202-205. 被引量：1
2李小波,彭司华.多类别肿瘤分类的特征基因选择方法研究[J].复旦学报（自然科学版）,2014,53(3):305-312. 被引量：1
3李建更,高志坤.随机森林:一种重要的肿瘤特征基因选择法[J].生物物理学报,2009,25(1):51-56. 被引量：15
4张世芝,张明锦.基于SVM的嵌入式特征基因选择方法研究[J].计算机与应用化学,2016,33(1):85-88. 被引量：1
5刘全金,李颖新,阮晓钢.基于SVM的灵敏度分析方法选取肿瘤特征基因[J].北京工业大学学报,2007,33(9):954-958. 被引量：4
6吕江婷,陈少斌,黄宴委.基于主元分析与近邻距离的特征基因选择与去噪[J].福州大学学报（自然科学版）,2013,41(1):49-52. 被引量：1
7黄丹凤,祁云嵩,许姗娜.基于粗糙集和蚁群算法的特征基因选择方法[J].计算机技术与发展,2012,22(6):68-70. 被引量：5
8阚海俊,唐俊,苏亮亮.一种基于邻域不定性信息和记分准则相结合的肿瘤特征基因提取方法[J].安徽大学学报（自然科学版）,2014,38(1):79-83. 被引量：2
9徐久成,冯森,穆辉宇.基于信噪比与随机森林的肿瘤特征基因选择[J].河南师范大学学报（自然科学版）,2017,45(2):87-92. 被引量：11
10陈涛,洪增林,邓方安.基于优化的邻域粗糙集的混合基因选择算法[J].计算机科学,2014,41(10):291-294. 被引量：7

复旦学报（自然科学版）

2008年第4期

浏览历史

内容加载中请稍等...

多步骤降维的肿瘤特征基因选择方法被引量：1

参考文献15

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

多步骤降维的肿瘤特征基因选择方法 被引量：1

参考文献15

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

多步骤降维的肿瘤特征基因选择方法被引量：1