摘要
根据结肠癌肿瘤基因表达谱样本高维数、小样本和高噪声的特点,提出用Bhattacharyya距离对肿瘤基因进行测量,滤除分类无关基因,然后用肿瘤基因对支持向量机模型的敏感度进行二次提取.并用它的归一化值对重要基因赋权,形成只有少数重要致病肿瘤基因的新样本集.最后,支持向量机应用于对新样本集的特征基因进行分析与测试.实验证明这种分析方法提高了肿瘤诊断的准确率.
According to the characteristics of the colon cancer gene expression profiles with high dimension, small sample and great noise, a method is proposed to measure the tumor gene with the Bhattacharyya distance and remove the genes irrelevant to the classification task. The method extracts the tumor gene for the second time by utilizing the sensitivity of the tumor gene on the model. Simultaneously, a weight is added to the important genes depending on the normalization of the sensitivity and a new sample dataset is built. Finally a support vector machine is used to analyze and test the feature genes on the new sample dataset. Experimental results show that this method improves the accuracy of tumor diagnosis.
出处
《西安电子科技大学学报》
EI
CAS
CSCD
北大核心
2012年第1期191-196,共6页
Journal of Xidian University
基金
国家自然科学基金资助项目(60974082)