期刊文献+

基于分类间隔增强的不平衡多标签学习算法 被引量:2

Imbalanced Multi-label Learning Algorithm Based on Classification Interval Enhanced
下载PDF
导出
摘要 传统的多标签学习算法一般没有考虑标签的不均衡性,从而忽略了标签不平衡给分类带来的影响。但统计发现,目前常用的多标签数据集均存在标签不均衡问题,且少数类标签往往更加重要。基于此,本文提出了一种基于分类间隔增强的不平衡多标签学习算法(Imbalanced multi-label learning algorithm based on classification interval enhanced,MLCIE),旨在利用各标签分类间隔的重构来增强分类器对少数类标签样本的学习效率,提升样本标签质量,从而减少多标签不平衡对分类器学习精度的影响。首先利用各标签密度与条件熵计算各标签的不确定性系数;然后构建分类间隔增强矩阵,将各标签独有的密度信息融入到原始标签矩阵中,获取平衡的标签空间;最后使用极限学习机作为线性分类器进行分类。本文在11个多标签标准数据集上与其他7种多标签学习算法进行对比实验,结果表明本文算法在解决标签不平衡问题上有一定效果。 Traditional multi-label learning algorithms generally do not consider the label imbalance,so the impact of label imbalance on classification is not ignored.However,statistics show that the current multilabel datasets have the problem of label imbalance,and a few kinds of labels are often more important.Based on this,this paper proposes an imbalanced multi-label learning algorithm based on classification interval enhanced(MLCIE),which aims to enhance the learning efficiency and improve the quality of the sample label by using the reconstruction of each label classification interval,so as to reduce the impact of multi-label imbalance on the learning accuracy of the classifier.Firstly,the uncertainty coefficient of each label is calculated by using the density and conditional entropy of each label;Then the enhancement matrix of classification interval is constructed,so that the unique density information of each label is integrated into the original label matrix to obtain the balanced label space;Finally,the limit learning machine is used as the linear classifier for classification.In this paper,the proposed algorithm is compared with other seven multi-label learning algorithms on the 11 multi-label standard datasets.The results show that the proposed algorithm can solve the problem of label imbalance.
作者 程玉胜 曹天成 CHENG Yusheng;CAO Tiancheng(University Key Laboratory of Intelligent Perception and Computing of Anhui Province(Anqing Normal University),Anqing 246133,China;Innovation Team of Anqing Normal University,Anqing 246133,China)
出处 《数据采集与处理》 CSCD 北大核心 2021年第3期519-528,共10页 Journal of Data Acquisition and Processing
关键词 多标签学习 标签不平衡 分类间隔 标签密度 极限学习机 multi-label learning label imbalance classification interval label density extreme learning machine
  • 相关文献

参考文献3

二级参考文献49

  • 1Schapire R E, Singer Y. BoosTcxter: A boostnlg bsed syslem for text categorizaion[J]. Machine Lcarning . 39(2/3): 135- 168. 被引量:1
  • 2McCallum A. Muhi-lahcl lext classification with a micture model trained by EM[C] //Proc of *he Working Nolcs of 11/ AAAI'99 Workshop on Text I.earning. Menlo Park, CA: AAAI Press, 1999. 被引量:1
  • 3Elissecff A, Weston J. A kcrtxel method for multi -labeclledclassification [C] //Advances in Neural Informalion Processing Systcms 14. Cambridge, MA: M1T Press, 2002: 681 -687. 被引量:1
  • 4QiGJ, HuaX S, Rui Y, et al. Corrclaativcmulti label vidco annotation [C] //Proc of the 15th ACM Int Conf on Muhimedia. New York: ACM, 2007:17- 26. 被引量:1
  • 5Aha D W. Specied A1 review issoe on lazy learning [J ]. Artificial Intelligcnce Review, 1997. 11(1/2/3/4/5): 7 -10. 被引量:1
  • 6Zhang M L,, Zhou Z H. ML-hNN: A lazy lcarning approach to multi label learning [J]. Paltern Recognition, 2007. 10 (7): 2038 -2048. 被引量:1
  • 7Freund Y, Sc:hapire R E. A dccision theoretic gcncralization of on-linc learning and an applocation to boosting[G]//Lecture Notcs in Computer Scicnce 904.Bcrlin:Springer.1995:23-37. 被引量:1
  • 8Dempstcr A P, 1.aird N M, Rubin D B+ Maxitnuntlikclihood from incomplete data via the EM algorithm[J].Journal of the Royal Statistics Socicty B, 1977, 39(1): 1-38. 被引量:1
  • 9Ueda N, Saito K. Parametric mixturc models for multi label text [C] //Advances in Neural Information Processing Systems 15. Cambridge, MA= MITPress, 2003:721-728. 被引量:1
  • 10Dumais S T, Platt J, Heckerman D, et al. Inductive learning algorithm and representation for text categorization [C]// Proc of the 7th ACM Int Conf on Information and Knowledge Management. New York: ACM, 1998= 148-155. 被引量:1

共引文献52

同被引文献12

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部