期刊文献+

消除随机一致性的支持向量机分类方法 被引量:12

Support Vector Machine with Eliminating the Random Consistency
下载PDF
导出
摘要 在人类自身的学习过程中,对学习结果进行科学客观的评价与反馈是关键环节.通常,由于学习者的知识缺陷或证据不足使得学习过程存在随机性,进一步可能导致学习结果与实际情况产生随机一致性.对此结果的直接反馈将严重影响学习性能的提升.同样,机器学习是以数据为驱动、以目标为导向的学习系统.由于经验历史数据有限、不平衡、含噪音等特质导致学习结果具有随机一致性.然而,以准确度为反馈准则的机器学习系统无法辨识随机一致性,这会影响学习系统的泛化能力.首先给出随机准确度和纯准确度的定义,并且进一步分析消除随机准确度的意义及必要性.然后,基于纯准确度指标,提出消除随机一致性的支持向量机分类方法PASVM,并在KEEL数据集的10种不同领域的基准测试集上验证其有效性.实验结果表明:相比于SVM、SVMperf以及其他可用于优化纯准确度指标的学习方法,PASVM泛化性能有明显提高. During the process of human learning, it is an important step to make the evaluation and feedback of the learning results objective. Usually, due to the lack of knowledge of evidence, there may exist consistency generated by the randomness in the learning results. Such rough feedback will hinder the improvement of the learning ability. Similarly, the machine learning system is a system driven by data and guided by performance measure. Due to the limitation, imbalance and noise of data, the results of machine learning also contain random consistency. However, the machine learning systems with the accuracy as the feedback index cannot discriminate the random consistency, which damages the generalization ability. In this paper, we propose the definition of the random accuracy and the pure accuracy. Further, the necessity of the elimination of random accuracy is analyzed. Then, based on the defined pure accuracy, we propose an SVM model with eliminating the random consistency, called as PASVM, and validate its efficiency on ten different benchmark data sets downloaded from KEEL. The experimental results show that the performance of the PASVM is better than that of the traditional SVM method, the SVMperf method and other methods that can optimize the pure accuracy measure.
作者 王婕婷 钱宇华 李飞江 刘郭庆 Wang Jieting;Qian Yuhua;Li Feijiang;Liu Guoqing(Institute of Big Data Science and Industry,Shanxi University,Taiyuan 030006;Key Laboratoryof Computational Intelligence and Chinese Information Processing(Shanxi University),Ministryof Education,Taiyuan 030006;School of Computer and Information Technology,Shanxi University,Taiyuan 030006)
出处 《计算机研究与发展》 EI CSCD 北大核心 2020年第8期1581-1593,共13页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61672332) 山西省三晋学者支持计划项目 山西省回国留学人员科研项目(2017023)。
关键词 随机一致性 纯准确度 支持向量机 分类 泛化能力 random consistency pure accuracy support vector machine(SVM) classification generalization ability
  • 相关文献

参考文献3

二级参考文献79

  • 1王丽丽,苏德富.基于群体智能的选择性决策树分类器集成[J].计算机技术与发展,2006,16(12):55-57. 被引量:3
  • 2Thompson S. Pruning boosted classifiers with a real valued genetic algorithm. Knowledge-Based Systems, 1999, 12(5-6): 277-284. 被引量:1
  • 3Zhou Z H, Tang W. Selective ensemble of decision trees// Proceedings of the 9th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. Chongqing, China, 2003:476-483. 被引量:1
  • 4Hernandez-Lobato D, Hernandez-Lobato J M, Ruiz-Torrubiano R, Valle A. Pruning adaptive boosting ensembles by means of a genetic algorithm//Corchado et al. International Conference on Intelligent Data Engineering and Automated Learning. Berlin Heidelberg: Springer-Verlag, 2006: 322- 329. 被引量:1
  • 5Zhang Y, Burer S, Street W N. Ensemble pruning via semidefinite programming. Journal of Machine Learning Research, 2006, 7: 1315-1338. 被引量:1
  • 6Chen H H, Tino P, Yao X. Predictive ensemble pruning by expectation propagation. IEEE Transactions on Knowledge and Data Engineering, 2009, 21(7): 999-1013. 被引量:1
  • 7Dos Santos E M, Sahourin R, Maupin P. Overfitting cautious selection of classifier ensembles with genetic algorithms. Information Fusion, 2009, 10(2): 150-162. 被引量:1
  • 8Li N, Zhou Z H. Selective ensemble under regularization framework//Benediksson J A, Kittler J, Roll F. Multiple Classifier Systems. Berlin Heidelberg: Springer-Verlag, 2009:293-303. 被引量:1
  • 9Reid S, Grudic G. Regularized linear models in stacked generalization//Benediksson J A, Kittler J, Roli F. Multiple Classifier Systems. Berlin Heidelberg: Springer-Verlag, 2009:112-121. 被引量:1
  • 10Zhang L, Zhou W D. Sparse ensembles using weighted combination methods based on linear programming. Pattern Recognition, 2011, 44(1): 97-106. 被引量:1

共引文献207

同被引文献87

引证文献12

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部