不确定性视角下的弱监督学习

Weakly supervised learning from the view of uncertainty

下载PDF

导出

摘要大数据时代为机器智能化提供了更丰富的数据信息和更全面的学习依据。但由于获取精准的高质量数据标签信息需要花费大量的人力和物力,学习过程中能够高效利用的带标签数据依然很有限。大量的未标注样本或低质量的标注样本所包含的信息存在较大的不确定性。弱监督是指监督信息具有不确定性,弱监督学习则是监督信号的不确定性形式化后训练和推理的范式。该文从不确定性建模的视角分析了弱监督学习的问题描述和相应的解决方法,综述了部分弱监督学习范式与不确定性建模之间的关系。 In the era of big data,richer data information and more comprehensive learning basis has been provided for machine intelligence.However,the number of labeled instances that can be utilized effectively is still limited due to the case that acquiring high-quality labeled data needs tons of manpower and materials.As a result,there is a large number of unlabeled or low-quality labeled instances with quite a lot of uncertain information.Weak supervision means that the supervision information is uncertain,whereas weakly supervised learning is defined as a training and inference paradigm based on the formalization of the uncertainty that exists in supervision signals.This work analyzes the problem and the corresponding solutions of weakly supervised learning and summarizes the relationship between the paradigm of several kinds of weakly supervised learning methods and uncertainty quantification from the perspective of uncertainty modeling.

作者周欣蕾王熙照 ZHOU Xinlei;WANG Xizhao(Department of Computer Science and Software Engineering,Shenzhen University,Shenzhen 518060,China)

机构地区深圳大学计算机与软件学院

出处《西北大学学报（自然科学版）》 CAS CSCD 北大核心 2022年第5期813-823,共11页 Journal of Northwest University（Natural Science Edition）

基金国家自然科学基金(61976141,61732011)。

关键词监督信息弱监督学习不确定性建模 supervision weakly supervised learning uncertainty modeling

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1陈思宏,沈浩靖,王冉,王熙照.预测不确定性与对抗鲁棒性的关系研究[J].软件学报,2022,33(2):524-538. 被引量：5
2程圣军..基于带约束随机游走图模型的弱监督学习算法研究[D].哈尔滨工业大学,2014:
3祝宏玉..基于不确定性的代价敏感半监督学习[D].深圳大学,2018:
4张敏灵.偏标记学习研究综述[J].数据采集与处理,2015,30(1):77-87. 被引量：13
5zhi-hua zhou.A brief introduction to weakly supervised learning[J].National Science Review,2018,5(1):44-53. 被引量：101
6潘俊..基于图的半监督学习及其应用研究[D].浙江大学,2011:
7易坤..深度学习下标签带噪的图像识别研究[D].南京大学,2020:

二级参考文献40

1周志华.Multi-Instance Learning from Supervised View[J].Journal of Computer Science & Technology,2006,21(5):800-809. 被引量：12
2Mitchell T M. Machine learning[,M]. New York: McGraw-Hill, 1997. 被引量：1
3Pfahringer B. Learning with weak supervision: Charting the territory[C]//Keynote Talk at the 1st International Workshop on I.earning with Weak Supervision ( LAWS' 12, in conjunction with ACML' 12). Singapore : [ s. n. ], 2012. 被引量：1
4Cour T, Sapp B, Taskar B. Learning from partial labels[J]. Journal of Machine Learning Research, 2011, 12: 1501-1536. 被引量：1
5Cour T, Sapp B, Jordan C, et al. Learning from ambiguous labeled images[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Miami, FL:[s. n. ], 2009: 919-926. 被引量：1
6Zeng Z, Xiao S, Jia K, et al. Learning by associating ambiguously labeled images[C]//Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Portland, OR:[s. n. ], 2013: 708-715. 被引量：1
7Jie L, Orabona F. Learning from candidate labeling sets[C]//Advances in Neural Information Processing Systems 23. Cam- bridge, MA: MIT Press, 2010: 1504-1512. 被引量：1
8Liu L, Dietterich T. A conditional multinomial mixture model for superset label learning[C]//Advances in Neural Informa- tion Processing Systems 25. Cambridge, MA: MIT Press, 2012: 557-565. 被引量：1
9Grandvalet Y. Logistic regression for partial labels[C]//Proceedings of the 9th International Conference on Information Pro- cessing and Management of Uncertainty in Knowledge-Based Systems. Annecy, France:[s. n. ], 2002: 1935-1941. 被引量：1
10Jin R, Ghahramani Z. Learning with multiple labels[C]//Advances in Neural Information Processing Systems 15. Cam- bridge, MA: MIT Press, 2003: 897-904. 被引量：1

共引文献115

1孙朝云,裴莉莉,徐磊,李伟,杜耀辉.基于DS-LOF与GA-XGBoost的路域环境感知数据智能检测与修复[J].中国公路学报,2023,36(4):15-26. 被引量：1
2罗益超,李争彦,张奇.基于句子选择的关键短语生成[J].中文信息学报,2021,35(8):64-72.
3郝昕毓,周建涛,王昊.表格单元格分类的端到端不完全监督方法[J].计算机与数字工程,2023,51(1):59-65.
4宋闯,赵佳佳,王康,梁欣凯.面向智能感知的小样本学习研究综述[J].航空学报,2020(S01):15-28. 被引量：15
5樊艺,吴章勇.WTO与我国商业银行的业务拓展[J].现代商业银行导刊,2000(6):22-25. 被引量：1
6商立军,臧益民,王四旺.耐钙心肌细胞的分离及基本电生理特性[J].第四军医大学学报,2000,21(2):247-249. 被引量：12
7朱越,姜远,周志华.一种基于多示例多标记学习的新标记学习方法[J].中国科学：信息科学,2018,48(12):1670-1680. 被引量：5
8周瑜,贺建军,顾宏,张俊星.一种基于最大值损失函数的快速偏标记学习算法[J].计算机研究与发展,2016,53(5):1053-1062. 被引量：2
9蒋铭初,潘志松,尤峻.基于PLSA主题模型的多标记文本分类[J].数据采集与处理,2016,31(3):541-547. 被引量：5
10宋凤义,胡太,杨明.基于外观的复合属性学习的细粒度识别[J].数据采集与处理,2016,31(6):1205-1212. 被引量：1

1罗萍,丁玲,杨雪,向阳.基于数据增强和弱监督对抗训练的中文事件检测[J].计算机应用,2022,42(10):2990-2995. 被引量：3
2刘宇琳.智媒时代网络新闻编辑转型路径分析[J].新闻研究导刊,2022,13(19):143-145. 被引量：8

西北大学学报（自然科学版）

2022年第5期

浏览历史

内容加载中请稍等...

不确定性视角下的弱监督学习

参考文献7

二级参考文献40

共引文献115

相关作者

相关机构

相关主题

浏览历史