基于双向时间卷积网络的半监督日志异常检测

Semi-supervised log anomaly detection based on bidirectional temporal convolution network

下载PDF

导出

摘要由于日志解析准确率不高以及标记样本不足降低了异常检测的准确率,所以提出了一种新的基于日志的半监督异常检测方法。首先,通过改进字典的日志解析方法,保留了日志事件中的部分参数信息,从而提高日志信息的利用率和日志解析的准确率;然后,使用BERT对模板中的语义信息进行编码,获得日志的语义向量;接着采用聚类的方法进行标签估计,缓解了数据标注不足的问题,有效提高了模型对不稳定数据的检测;最后,使用带有残差块的双向时间卷积网络(Bi-TCN)从两个方向捕获上下文信息,提高了异常检测的精度和效率。为了评估该方法的性能,在两个数据集上进行了评估,最终实验结果表明,该方法与最新的三个基准模型LogBERT、PLELog和LogEncoder相比,F 1值平均提高了7%、14.1%和8.04%,能够高效精准地进行日志解析和日志异常检测。 Because the accuracy of log parsing is not high and the lack of tag samples reduces the accuracy of anomaly detection,this paper proposed a new semi-supervised anomaly detection method based on logs.Firstly,the method enhanced the log parsing method of the dictionary to retain parameter information in log events,improving the utilization and accuracy of log resolution.Next,the method utilized BERT to encode semantic information in the template,obtaining the semantic vector of the log.Then,the method employed the clustering method to estimate the tag,which effectively alleviated the problem of insufficient data labeling and enhanced the model’s ability of detecting unstable data.Finally,the method captured context information from two directions based on the bidirectional temporal convolution network(Bi-TCN)with residual blocks,which enhanced the accuracy and efficiency of anomaly detection.To evaluate the method’s performance,it conducted extensive experiments on two datasets.The results demonstrate that the proposed method achieves an average improvement of 7%,14.1%and 8.04%in F 1 value compared to the latest three benchmark models,LogBERT,PLELog and LogEncoder,enabling efficient and accurate log parsing and log anomaly detection.

作者尹春勇孔娴 Yin Chunyong;Kong Xian(School of Computer,Nanjing University of Information Science&Technology,Nanjing 210044,China)

机构地区南京信息工程大学计算机学院

出处《计算机应用研究》 CSCD 北大核心 2024年第7期2110-2117,共8页 Application Research of Computers

基金国家自然科学基金面上项目(6177282)。

关键词日志解析异常检测半监督学习双向时间卷积网络上下文相关性 log parsing anomaly detection semi-supervised learning bidirectional temporal convolution network contextual correlation

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1贾统,李影,吴中海.基于日志数据的分布式软件系统故障诊断综述[J].软件学报,2020,31(7):1997-2018. 被引量：26
2张颖君,刘尚奇,杨牧,张海霞,黄克振.基于日志的异常检测技术综述[J].网络与信息安全学报,2020,6(6):1-12. 被引量：9

二级参考文献6

1饶翔,王怀民,陈振邦,周扬帆,蔡华,周琦,孙廷韬.云计算系统中基于伴随状态追踪的故障检测机制[J].计算机学报,2012,35(5):856-870. 被引量：23
2MI HaiBo,WANG HuaiMin,ZHOU YangFan,LYU Michael,CAI Hua.Localizing root causes of performance anomalies in cloud computing systems by analyzing request trace logs[J].Science China(Information Sciences),2012,55(12):2757-2773. 被引量：4
3廖湘科,李姗姗,董威,贾周阳,刘晓东,周书林.大规模软件系统日志研究综述[J].软件学报,2016,27(8):1934-1947. 被引量：37
4崔元,张琢.基于大规模网络日志的模板提取研究[J].计算机科学,2017,44(B11):448-452. 被引量：7
5梅御东,陈旭,孙毓忠,牛逸翔,肖立,王海荣,冯百明.一种基于日志信息和CNN-text的软件系统异常检测方法[J].计算机学报,2020,43(2):366-380. 被引量：35
6M.Ablikim,M.N.Achasov,P.Adlarson,S.Ahmed,M.Albrecht,M.Alekseev,A.Amoroso,F.F.An,Q.An,Y.Bai,O.Bakina,R.Baldini Ferroli,Y.Ban,K.Begzsuren,J.V.Bennett,N.Berger,M.Bertani,D.Bettoni,F.Bianchi,J Biernat,J.Bloms,I.Boyko,R.A.Briere,L.Calibbi,H.Cai,X.Cai,A.Calcaterra,G.F.Cao,N.Cao,S.A.Cetin,J.Chai,J.F.Chang,W.L.Chang,J.Charles,G.Chelkov,Chen,G.Chen,H.S.Chen,J.C.Chen,M.L.Chen,S.J.Chen,Y.B.Chen,H.Y.Cheng,W.Cheng,G.Cibinetto,F.Cossio,X.F.Cui,H.L.Dai,J.P.Dai,X.C.Dai,A.Dbeyssi,D.Dedovich,Z.Y.Deng,A.Denig,Denysenko,M.Destefanis,S.Descotes-Genon,F.De Mori,Y.Ding,C.Dong,J.Dong,L.Y.Dong,M.Y.Dong,Z.L.Dou,S.X.Du,S.I.Eidelman,J.Z.Fan,J.Fang,S.S.Fang,Y.Fang,R.Farinelli,L.Fava,F.Feldbauer,G.Felici,C.Q.Feng,M.Fritsch,C.D.Fu,Y.Fu,Q.Gao,X.L.Gao,Y.Gao,Y.Gao,Y.G.Gao,Z.Gao,B.Garillon,I.Garzia,E.M.Gersabeck,A.Gilman,K.Goetzen,L.Gong,W.X.Gong,W.Gradl,M.Greco,L.M.Gu,M.H.Gu,Y.T.Gu,A.Q.Guo,F.K.Guo,L.B.Guo,R.P.Guo,Y.P.Guo,A.Guskov,S.Han,X.Q.Hao,F.A.Harris,K.L.He,F.H.Heinsius,T.Held,Y.K.Heng,Y.R.Hou,Z.L.Hou,H.M.Hu,J.F.Hu,T.Hu,Y.Hu,G.S.Huang,J.S.Huang,X.T.Huang,X.Z.Huang,Z.L.Huang,N.Huesken,T.Hussain,W.Ikegami Andersson,W.Imoehl,M.Irshad,Q.Ji,Q.P.Ji,X.B.Ji,X.L.Ji,H.L.Jiang,X.S.Jiang,X.Y.Jiang,J.B.Jiao,Z.Jiao,D.P.Jin,S.Jin,Y.Jin,T.Johansson,N.Kalantar-Nayestanaki,X.S.Kang,R.Kappert,M.Kavatsyuk,B.C.Ke,I.K.Keshk,T.Khan,A.Khoukaz,P.Kiese,R.Kiuchi,R.Kliemt,L..Future Physics Programme of BESⅢ[J].Chinese Physics C,2020,44(4). 被引量：538

共引文献33

1潘磊.基于FP-Growth的电力系统故障预测方法[J].软件导刊,2020,19(10):152-155. 被引量：3
2林静怀,刘治宇,李军良,高欣,李泽科,唐志军,余斯航,徐建航.面向不平衡数据分类的高维超球体过采样方法[J].微电子学与计算机,2021,38(5):65-72. 被引量：1
3姚杰,程春玲,韩静,刘峥.云工作流中基于多任务时序卷积网络的异常检测方法[J].计算机应用,2021,41(6):1701-1708. 被引量：3
4戴振邦,江恩杰,刘力嘉,甘江伟.基于分布式管道模式的管道服务框架设计与实现[J].现代信息科技,2021,5(7):44-49.
5沈亮,周敏.基于云架构的"未来医院"信息平台构建及云迁移实践[J].中华医院管理杂志,2021,37(4):293-299. 被引量：15
6陈自力.一种基于K-means聚类的软件测试数据异常检测方法[J].太原师范学院学报（自然科学版）,2021,20(3):38-42. 被引量：2
7牛国臣,吕波漾.融合Cox回归与维纳过程的设备状态评估方法[J].计算机测量与控制,2021,29(11):213-218.
8刘春波,梁孟孟,侯晶雯,顾兆军,王志.面向不稳定日志的一致性异常检测方法[J].湖南大学学报（自然科学版）,2022,49(4):89-99. 被引量：1
9周建国,戴华,杨庚,周倩,王俊.基于并列GRU分类模型的日志异常检测方法[J].南京理工大学学报,2022,46(2):198-204. 被引量：3
10欧萍.数据挖掘技术在软件工程中的应用[J].长江信息通信,2022,35(5):71-73. 被引量：3

计算机应用研究

2024年第7期

浏览历史

内容加载中请稍等...

基于双向时间卷积网络的半监督日志异常检测

参考文献2

二级参考文献6

共引文献33

相关作者

相关机构

相关主题

浏览历史