机器学习在医疗与健康应用场景下的恶意流量检测

Machine Learning for Malicious Traffic Detection in Medical and Health Application Scenarios

下载PDF

导出

摘要目的基于机器学习方法中的随机森林和决策树模型,实现在医疗与健康应用场景下的恶意流量检测。方法以CICIDS2017样本集作为模型的训练集与验证集,对将该样本集通过Python预处理后的共1708979条数据进行模型训练。预处理后的样本集中训练集占比80%(1367183条),验证集占比20%(341795条),在sklearn中进行随机森林和决策树模型参数调整训练,再将在医疗与健康应用场景下捕获到的500条网络流量作为测试集进行模型泛化能力评估。结果由决策树和随机森林混淆矩阵图可知,决策树模型对于慢速拒绝服务攻击以及跨站脚本攻击的预测准确率为95%,尤其是决策树模型对慢速拒绝服务攻击进行预测时,会将其与跨站脚本攻击混淆。随机森林模型对于慢速拒绝服务攻击预测准确率为99%,能够正确预测大多数慢速拒绝服务攻击。随机森林模型在医疗与健康应用场景下整体表现良好。结论两种模型对于在医疗与健康应用场景下的恶意流量检测准确率效果较好,但传统的决策树模型准确率低于随机森林模型。随机森林模型更适合在医疗健康场景下的恶意流量检测,可为医疗健康应用场景中的网络安全研究提供参考。 Objective To realize the malicious traffic detection in medical and health application scenarios,the random forest and decision tree model in machine learning method were used.Methods CIC-ISD2017 sample set were used as the training and validation set for the model.A total of 1708979 pieces of data were pre-processed in Python for model training.The pre-processed sample set accounted for 80%of the training set(1367183 pieces)and 20%of the validation set(341795 pieces),and was trained by adjusting parameters of random forest and decision tree models on sklearn.Finally,500 network traffic captured in the built medical and health application scenarios were used as the test set to evaluate the model generalization ability.Results From the decision tree and random forest confusion matrix,the prediction accuracy of decision tree model for slow denial-of-service attacks and cross-site scripting attacks was 95%,especially when decision tree model predicted slow denial-of-service attacks,it was confused with cross-site scripting attacks.Random forest model predicted slow denial-of-service attacks with 99%accuracy,could correctly predict most slow denial-of-service attacks.The random forest model performs well in medical and health application scenarios.Conclusion The two models achieve ideal results for malicious traffic detection accuracy in medical and health application scenarios,but the accuracy of the traditional decision tree model is lower than that of the random forest model.The random forest model is more suitable for malicious traffic detection in medical and health scenarios,and can provide some reference for future network security research in medical and health application scenarios.

作者高健云刘颖颖戴依蓝李澍 GAO Jianyun;LIU Yingying;DAI Yilan;LI Shu(Institute of Medical Device Control,China Academy of Food and Drug Control,Beijing 102629,China;School of Medical Devices,Shenyang Pharmaceutical University,Shenyang Liaoning 117004,China;Beijing Medical and Health Technology Development Center,Beijing 100035,China)

机构地区中国食品药品检定研究院医疗器械检定所沈阳药科大学医疗器械学院北京医药健康科技发展中心

出处《中国医疗设备》 2024年第1期12-17,共6页 China Medical Devices

基金国家重点研发计划(2020YFC2007104)。

关键词医疗健康应用场景机器学习决策树随机森林网络安全 medical and health application scenarios machine learning decision tree random forest network security

分类号 R197.39 [医药卫生—卫生事业管理]

引文网络
相关文献

参考文献10

1曾梦良.医疗集团的数字化建设规划和应用场景[J].科技资讯,2022,20(21):19-22. 被引量：1
2杨雷,林健泽,全筱筱,董石磊,陈钦.5G定制网在智慧医疗中的应用场景[J].深圳中西医结合杂志,2022,32(17):129-133. 被引量：2
3王月辰,蔡葵.人工智能在践行“健康中国”战略中的应用[J].中国医院建筑与装备,2021,22(12):44-47. 被引量：5
4段文清,段剑峰,杨静文,寸仙娥.5G+智慧医疗应用场景设计[J].电子技术与软件工程,2021(18):8-9. 被引量：6
5张路,员雯倩,陈小叶,孙静,马蓉,乔梦菲.推动“5G+院前急救”融合发展的综合分析[J].大众标准化,2023(4):175-177. 被引量：1
6林楚强.基于机器学习的恶意行为流量特征分类[J].自动化与仪器仪表,2022(6):7-12. 被引量：1
7王远帆,施勇,薛质.基于决策树的端口扫描恶意流量检测研究[J].通信技术,2020,53(8):2002-2005. 被引量：2
8王慧霞,张玉婷,王莹,朱曼辉.计算机视觉模型在糖尿病视网膜病变转诊中的应用[J].中国医疗设备,2023,38(11):1-5. 被引量：1
9沈文娟,徐昶,林嘉希,许春芳,陆建英,朱锦舟.基于深度学习的内镜肠道准备评分模型的建立[J].中国医疗设备,2023,38(11):11-15. 被引量：2
10Jiayao Chen,Hongwei Huang,Anthony G.Cohn,Dongming Zhang,Mingliang Zhou.Machine learning-based classification of rock discontinuity trace:SMOTE oversampling integrated with GBT ensemble learning[J].International Journal of Mining Science and Technology,2022,32(2):309-322. 被引量：10

二级参考文献70

1王晓凤.服务基层医疗卫生机构推进数字化实验室建设——重庆市万州区基层医疗机构数字化整体检验室建设项目正式启用[J].中国医学装备,2012,9(6):4-4. 被引量：1
2刘迪,李耀峰.模式识别综述[J].黑龙江科技信息,2012(28):120-120. 被引量：5
3BARBER Alistair J..Diabetic retinopathy: recent advances towards understanding neurodegeneration and vision loss[J].Science China(Life Sciences),2015,58(6):541-549. 被引量：19
4David J.Lary,Amir H.Alavi,Amir H.Gandomi,Annette L.Walker.Machine learning in geosciences and remote sensing[J].Geoscience Frontiers,2016,7(1):3-10. 被引量：41
5张琪.人工智能的发展及其在医学领域中的应用[J].电子技术与软件工程,2016(20):259-259. 被引量：15
6糖尿病视网膜病变防治专家共识[J].中华糖尿病杂志,2018,10(4):241-247. 被引量：359
7唐小明,梁锦华,蒋建春,文伟平.网络端口扫描及其防御技术研究[J].计算机工程与设计,2002,23(9):15-17. 被引量：12
8尤晋泽,林岩.大数据时代认知医疗的数据安全伦理透视——以IBM Watson Health为例[J].医学与哲学（A）,2018,39(3):28-31. 被引量：15
9郑红颖,杨艳,倪佳琪,秦彦雯,富晶晶,李卫平.人工智能临床应用研究进展[J].护理研究,2019,33(3):454-458. 被引量：33
10李昊朋.基于机器学习方法的智能机器人探究[J].通讯世界,2019,26(4):241-242. 被引量：24

共引文献21

1戴拯,付达安,王征,王国斌.外科转化医学研究进展[J].中国实用外科杂志,2022,42(3):285-288. 被引量：1
2胡子琦,胡彩燕,马镯.医疗人工智能技术的专利情报研究[J].中国发明与专利,2022,19(7):29-35. 被引量：7
3王培涛,刘智超,马驰,彭阿晓,任奋华,蔡美峰.基于Hough检测的节理岩体迹线信息快速识别方法与应用研究[J].岩土力学,2022,43(10):2887-2897. 被引量：4
4Yuxin Yuan,Nong Zhang,Changliang Han,Sen Yang,Zhengzheng Xie,Jin Wang.Digital image processing-based automatic detection algorithm of cross joint trace and its application in mining roadway excavation practice[J].International Journal of Mining Science and Technology,2022,32(6):1219-1231. 被引量：2
5耶云.大数据时代背景下物联网技术的应用研究[J].产业创新研究,2022(22):76-78. 被引量：10
6强应海,陈坚,刘西北,万超.人工智能技术在医疗健康领域中的应用探究[J].电子元器件与信息技术,2022,6(11):111-115. 被引量：4
7任忠妍,韩俐,贾凡星,张咏琪,张奕佳.SDN架构下基于LSTM的低速端口扫描检测模块研究[J].信息记录材料,2023,24(2):242-245.
8黄琮凯,黄天增,邓华生.5G+AIoT助力健康老龄化的探讨[J].广西通信技术,2022(4):15-18. 被引量：1
9周巍.基于5G技术的智慧医疗应用仿真技术探析[J].电脑知识与技术,2023,19(30):104-107. 被引量：2
10丁海萌,郭小燕.基于SMOTE_GA_XGBoost的葡萄酒质量预测[J].智能计算机与应用,2024,14(1):147-151. 被引量：1

1张娇.跆拳道比赛规则的变化对技战术运用的影响[J].拳击与格斗,2023(20):16-18.

中国医疗设备

2024年第1期

浏览历史

内容加载中请稍等...

机器学习在医疗与健康应用场景下的恶意流量检测

参考文献10

二级参考文献70

共引文献21

相关作者

相关机构

相关主题

浏览历史