摘要
匿名通信工具在进行用户隐私保护的同时也为违法犯罪提供了便利,使得网络环境净化与监管愈发困难。对匿名网络信息交换产生的匿名流量进行分类可以细化网络监管范围。文章针对现有匿名流量分类方法存在流量分类粒度不细致和应用层匿名流量分类准确率偏低等问题,提出一种基于机器学习的匿名流量分类方法。该方法包括基于自动编码器和随机森林的特征提取模型以及基于卷积神经网络和XGBoost的匿名流量多分类模型两个模型,通过特征重构和模型结合的方式提升分类效果。最后在Anon17公开匿名流量数据集上进行了验证,证明了模型的可用性、有效性和准确性。
Anonymous communication tools not only protect users’privacy,but also provide shelter for crimes,making it more difficult to purify and supervise the network environment.Classification of anonymous traffic generated during information exchange in anonymous networks can refine the scope of network supervision.Aiming at the problems of insufficient granularity of traffic classification and low accuracy of anonymous traffic classification in the application layer in the existing anonymous traffic classification field,this paper proposed an application layer multi classification method for anonymous traffic based on machine learning.It included the feature extraction model based on auto-encoder and random forest,and the anonymous traffic multi classification model based on convolutional neural networks and XGBoost.The classification effect is improved through feature reconstruction and model combination,and is verified on Anon17 public anonymous traffic dataset,proving the usability,effectiveness and accuracy of the designed model.
作者
赵小林
王琪瑶
赵斌
薛静锋
ZHAO Xiaolin;WANG Qiyao;ZHAO Bin;XUE Jingfeng(School of Computer Science&Technology,Beijing Institute of Technology,Beijing 100081,China)
出处
《信息网络安全》
CSCD
北大核心
2023年第5期1-10,共10页
Netinfo Security
基金
国家重点研发计划[2020YFB1712104]
山东省重点研发计划(重大科技创新工程)[2020CXGC010116]。
关键词
机器学习
匿名流量
自动编码器
特征提取
卷积神经网络
machine learning
anonymous traffic
auto-encoder
feature extraction
convolutional neural networks