基于反绎学习的裁判文书量刑情节识别被引量：3

Recognition of sentencing circumstances in adjudication documents based on abductive learning

下载PDF

导出

摘要针对司法领域标记数据匮乏、标注质量不高、存在强逻辑性导致裁判文书量刑情节识别效果不佳的问题,提出一种基于反绎学习的量刑情节识别模型ABL-CON。首先结合神经网络与领域逻辑推理,通过半监督学习方法,使用置信学习方法表征情节识别置信度;然后修正无标签数据经过神经网络产生的不合逻辑的错误情节,重新训练识别模型,以提高识别精度。在自构建的司法数据集上的实验结果表明,使用50%标注数据与50%无标注数据的ABL-CON模型在Macro_F1值和Micro_F1值上分别达到了90.35%和90.58%,优于同样条件下的BERT和SS-ABL,也超越了使用100%标注数据的BERT模型。ABL-CON模型通过逻辑反绎修正不符合逻辑的标签能够有效提高标签的逻辑合理性以及标签的识别能力。 Aiming at the problem of poor recognition of sentencing circumstances in adjudication documents caused by the lack of labeled data,low quality of labeling and existence of strong logicality in judicial field,a sentencing circumstance recognition model based on abductive learning named ABL-CON(ABductive Learning in CONfidence)was proposed.Firstly,combining with neural network and domain logic inference,through the semi-supervised method,a confidence learning method was used to characterize the confidence of circumstance recognition.Then,the illogical error circumstances generated by neural network of the unlabeled data were corrected,and the recognition model was retrained to improve the recognition accuracy.Experimental results on the self-constructed judicial dataset show that the ABL-CON model using 50%labeled data and 50%unlabeled data achieves 90.35%and 90.58%in Macro_F1 and Micro_F1,respectively,which is better than BERT(Bidirectional Encoder Representations from Transformers)and SS-ABL(Semi-Supervised ABductive Learning)under the same conditions,and also surpasses the BERT model using 100%labeled data.The ABL-CON model can effectively improve the logical rationality of labels as well as the recognition ability of labels by correcting illogical labels through logical abductive correctness.

作者李锦烨黄瑞章秦永彬陈艳平田小瑜 LI Jinye;HUANG Ruizhang;QIN Yongbin;CHEN Yanping;TIAN Xiaoyu(College of Computer Science and Technology,Guizhou University,Guiyang Guizhou 550025,China;State Key Laboratory of Public Big Data(Guizhou University),Guiyang Guizhou 550025,China)

机构地区贵州大学计算机科学与技术学院公共大数据国家重点实验室(贵州大学)

出处《计算机应用》 CSCD 北大核心 2022年第6期1802-1807,共6页 journal of Computer Applications

基金国家自然科学基金资助项目(62066008) 贵州省科学技术基金重点项目(黔科合基础[2020]1Z055)。

关键词量刑情节识别半监督学习多标签分类反绎学习置信学习 sentencing circumstance recognition semi-supervised learning multi-label classification abductive learning confidence learning

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1Zhi-Hua ZHOU.Abductive learning: towards bridging machine learning and logical reasoning[J].Science China(Information Sciences),2019,62(7):220-222. 被引量：22
2马建刚,张鹏,马应龙.基于知识块摘要和词转移距离的高效司法文档分类[J].计算机应用,2019,39(5):1293-1298. 被引量：5
3马建刚,马应龙.语义驱动的司法文档学习分类方法[J].计算机应用,2019,39(6):1696-1700. 被引量：2
4张洛阳,毛嘉莉,刘斌,吴涛.基于贝叶斯模型的多标签分类算法[J].计算机应用,2016,36(1):52-56. 被引量：4

二级参考文献22

1ZHANG M, ZHOU Z. A review on multi-label learning algorithms [J]. IEEE transactions on knowledge and data engineering, 2014, 26(8): 1819-1837. 被引量：1
2READ J. A pruned problem transformation method for multi-label classification [C]// Proceedings of the 2008 New Zealand Computer Science Research Student Conference. Hamilton, New Zealand: [s.n.], 2008: 143-150. 被引量：1
3TSOUMAKAS G, KATAKIS I, VLAHAVAS I. Random k-labelsets for multilabel classification [J]. IEEE transactions on knowledge and data engineering, 2011, 23(7): 1079-1089. 被引量：1
4READ J, PFAHRINGER B, HOLMES G, et al. Classifier chains for multi-label classification [J]. Machine learning, 2011, 85(3): 333-359. 被引量：1
5READ J, PFAHRINGER B, HOLMES G, et al. Classifiers chains for multi-label classification [C]// Proceedings of the 2009 European Conference on Machine Learning and Knowledge Discovery in Databases. Berlin: Springer, 2009: 254-269. 被引量：1
6CHENG W, HVLLERMEIER E, DEMBCZYNSKI K J. An analysis of chaining in multi-label classification [C]// Proceedings of the 20th European Conference on Artificial Intelligence. Amsterdam: IOS Press, 2012: 294-299. 被引量：1
7CHENG W, HüLLERMEIER E, DEMBCZYNSKI K J. Bayes optimal multilabel classification via probabilistic classifier chains [C]// Proceedings of the 27th International Conference on Machine Learning. New York: ACM, 2010: 279-286. 被引量：1
8SUCAR L E, BIELZA C, MORALES E F, et al. Multi-label classification with Bayesian network-based chain classifiers [J]. Pattern recognition letters, 2014, 41(9):12-22. 被引量：1
9YU Y, PEDRYCZ W, MIAO D. Multi-label classification by exploiting label correlations [J]. Expert systems with applications, 2014, 41(6): 2989-3004. 被引量：1
10ZHANG M L, ZHOU Z H. ML-KNN: a lazy learning approach to multilabel learning [J]. Pattern recognition, 2007, 40(7): 2038-2048. 被引量：1

共引文献29

1梁睿博,王思远,李壮,刘亚松.基于RAKEL算法的商品评论多标签分类研究与实现[J].软件工程,2019,22(1):8-11. 被引量：3
2马建刚,马应龙.语义驱动的司法文档学习分类方法[J].计算机应用,2019,39(6):1696-1700. 被引量：2
3陈长建,姜流,雷娜,刘世霞.基于众包学习的交互式特征选择方法[J].中国科学：信息科学,2020,50(6):794-812. 被引量：4
4陈卓,杜昊,吴雨菲,徐童,陈恩红.基于视觉–文本关系对齐的跨模态视频片段检索[J].中国科学：信息科学,2020,50(6):862-876. 被引量：7
5原恺涓,傅四维,葛晓东,巫英才.动态图序列演变模式的可视化[J].计算机辅助设计与图形学学报,2020,32(10):1655-1662. 被引量：2
6Qi XIE,Qian ZHAO,Zongben XU,Deyu MENG.Color and direction-invariant nonlocal self-similarity prior and its application to color image denoising[J].Science China(Information Sciences),2020,63(12):83-99. 被引量：1
7Bin-Bin JIA,Min-Ling ZHANG.Multi-dimensional classification via stacked dependency exploitation[J].Science China(Information Sciences),2020,63(12):100-113. 被引量：3
8Tingting LV,Zhilei REN,Xiaochen LI,Guojun GAO,He JIANG.Toward accurate detection on change barriers[J].Science China(Information Sciences),2021,64(3):84-100.
9张姝楠,曹峰,郭倩,钱宇华.一种基于时序关系网络的逻辑推理方法[J].计算机科学,2021,48(5):239-246. 被引量：3
10Yan-Ping SUN,Min-Ling ZHANG.Compositional metric learning for multi-label classification[J].Frontiers of Computer Science,2021,15(5):1-12.

同被引文献6

1Heung-yeung SHUM,Xiao-dong HE,Di LI.From Eliza to XiaoIce：challenges and opportunities with social chatbots[J].Frontiers of Information Technology & Electronic Engineering,2018,19(1):10-26. 被引量：19
2肖琳,陈博理,黄鑫,刘华锋,景丽萍,于剑.基于标签语义注意力的多标签文本分类[J].软件学报,2020,31(4):1079-1089. 被引量：60
3郑泳智,朱定局,吴惠粦,彭小荣.知识图谱问答领域综述[J].计算机系统应用,2022,31(4):1-13. 被引量：16
4魏鑫炀,秦永彬,唐向红,黄瑞章,陈艳平.融合法条的司法裁判文书摘要生成方法[J].计算机工程与设计,2023,44(9):2844-2850. 被引量：2
5余帅,宋玉梅,秦永彬,黄瑞章,陈艳平.基于审判逻辑步骤的裁判文书摘要生成方法[J].计算机工程与应用,2024,60(4):113-121. 被引量：2
6Zhi-Hua ZHOU.Abductive learning: towards bridging machine learning and logical reasoning[J].Science China(Information Sciences),2019,62(7):220-222. 被引量：22

引证文献3

1肖新正,黄瑞章,陈艳平,秦永彬,宋玉梅,周裕林.Corrective-Net:面向多标签文本分类的标签关联学习模块[J].计算机工程与科学,2024,46(6):1092-1100.
2张鹏,郝国生,王霞,许文阳,祝义.反绎学习支持下的自动问答及其应用[J].计算机工程与应用,2024,60(17):139-147.
3李佳沂,黄瑞章,陈艳平,林川,秦永彬.结合提示学习和Qwen大语言模型的裁判文书摘要方法[J].清华大学学报（自然科学版）,2024,64(12):2007-2018.

1李会勤,魏燕.NBASS-APS在重症颅脑损伤患儿术后镇痛护理中的应用[J].保健医学研究与实践,2022,19(5):117-120. 被引量：3
2李艳翠,冯继克,来纯晓,冯洪玉,冯文贺.汉英篇章衔接对齐语料库构建研究[J].中文信息学报,2022,36(4):39-47.
3周明月,龚晨,李正华,张民.数据标注方法比较研究:以依存句法树标注为例[J].清华大学学报（自然科学版）,2022,62(5):908-916. 被引量：4
4本刊(编译).威猛一站式注射成型解决方案助力卡赫持续增长[J].现代塑料,2022(2):39-40.
5王东,李书琦.智能合约在著作权交易与保护中应用的法律适用冲突与解决路径[J].行政与法,2022(5):85-91. 被引量：1
6韩玉民,郝晓燕.基于子词嵌入和相对注意力的材料实体识别[J].计算机应用,2022,42(6):1862-1868. 被引量：3
7赵曲川,池添雨.基于U-Net深度学习慢性萎缩性胃炎模型的应用与研究[J].胃肠病学和肝病学杂志,2022,31(6):656-661. 被引量：2
8陶怀川,徐寅晨.智慧法院体系建设场景下算法可解性机制的构建[J].河南广播电视大学学报,2022,35(2):60-65.
9Simplice Innocent Moussouami,Eddie Janvier Bouhika,Florent Nsompi,Jean Michel Bazaba Kayilou,François Mbemba,Alphonse Massamba.Prevalence and Risk Factors of Cardiovascular Diseases in the Congo-Brazzaville Pygmies[J].World Journal of Cardiovascular Diseases,2016,6(7):211-217.
10Leonel Pereira.Therapeutic and nutritional uses of marine algae:a pharmacy in the ocean[J].Traditional Medicine Research,2022,7(3):124-127.

计算机应用

2022年第6期

浏览历史

内容加载中请稍等...

基于反绎学习的裁判文书量刑情节识别被引量：3

参考文献4

二级参考文献22

共引文献29

同被引文献6

引证文献3

相关作者

相关机构

相关主题

浏览历史

基于反绎学习的裁判文书量刑情节识别 被引量：3

参考文献4

二级参考文献22

共引文献29

同被引文献6

引证文献3

相关作者

相关机构

相关主题

浏览历史

基于反绎学习的裁判文书量刑情节识别被引量：3