BSDG: Anomaly Detection of Microservice Trace Based on Dual Graph Convolutional Neural Network

被引:1
作者
Shi, Kuanzhi [1 ]
Li, Jing [1 ]
Liu, Yuecan [2 ]
Chang, Yuzhu [2 ]
Li, Xuyang [2 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Artificial Intelligence, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] State Grid Informat Telecommun Branch, Beijing, Peoples R China
来源
SERVICE-ORIENTED COMPUTING (ICSOC 2022) | 2022年 / 13740卷
关键词
Microservices; Trace; dualGCN; Anomaly detection; FAULT-DIAGNOSIS;
D O I
10.1007/978-3-031-20984-0_12
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microservice architecture has been widely used by more and more developers in recent years. Accurate anomaly detection is crucial for system maintenance. Trace data can reflect the microservice dependency relationship and response time, which has been adopted for microservice anomaly detection now. However, due to the lack of unification modeling framework of response time and call path, the performance of anomaly detection degrades, and difficult to adapt to downstream tasks. To address the above issues, we propose BSDG, a trace anomaly detection method based on a dual graph convolutional neural network (dual-GCN). First, BSDG extracts the microservice call dependencies, combing the learnable node attributes generated by Bi-directional Long Short-Term Memory(BiLSTM) to build an attribute dependency graph combined response time and call path. Then, a self-attention mapping graph is constructed and we use a dualGCN with mutual attention to generate effective feature embedding representation. Finally, BSDG adopts a multilayer perceptron with a new classification loss function to train the model in an end-to-end way for anomaly detection. The experimental results on public benchmarks show that the proposed BDSG outperforms baseline methods. We also conduct experiments on our constructed microservice trace dataset to validate the robustness of BSDG. Experiments show that the BSDG outperforms existing methods in microservice trace anomaly detection.
引用
收藏
页码:171 / 185
页数:15
相关论文
共 20 条
[1]   Self-Supervised Anomaly Detection from Distributed Traces [J].
Bogatinovski, Jasmin ;
Nedelkoski, Sasho ;
Cardoso, Jorge ;
Kao, Odej .
2020 IEEE/ACM 13TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC 2020), 2020, :342-347
[2]   Trace-based Intelligent Fault Diagnosis for Microservices with Deep Learning [J].
Chen, Hao ;
Wei, Kegang ;
Li, An ;
Wang, Tao ;
Zhang, Wenbo .
2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, :884-893
[3]  
Deng AL, 2021, AAAI CONF ARTIF INTE, V35, P4027
[4]   DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning [J].
Du, Min ;
Li, Feifei ;
Zheng, Guineng ;
Srikumar, Vivek .
CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, :1285-1298
[5]   Seer: Leveraging Big Data to Navigate the Complexity of Performance Debugging in Cloud Microservices [J].
Gan, Yu ;
Zhang, Yanqi ;
Hu, Kelvin ;
Cheng, Dailun ;
He, Yuan ;
Pancholi, Meghna ;
Delimitrou, Christina .
TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, :19-33
[6]  
Li R, 2021, P 59 ANN M ASS COMP, V1, P6319, DOI DOI 10.18653/V1/2021.ACL-LONG.494
[7]   Practical Root Cause Localization for Microservice Systems via Trace Analysis [J].
Li, Zeyan ;
Chen, Junjie ;
Jiao, Rui ;
Zhao, Nengwen ;
Wang, Zhijun ;
Zhang, Shuwei ;
Wu, Yanjun ;
Jiang, Long ;
Yan, Leiqin ;
Wang, Zikai ;
Chen, Zhekang ;
Zhang, Wenchi ;
Nie, Xiaohui ;
Sui, Kaixin ;
Pei, Dan .
2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
[8]   Unsupervised Detection of Microservice Trace Anomalies through Service-Level Deep Bayesian Networks [J].
Liu, Ping ;
Xu, Haowen ;
Ouyang, Qianyu ;
Jiao, Rui ;
Chen, Zhekang ;
Zhang, Shenglin ;
Yang, Jiahai ;
Mo, Linlin ;
Zeng, Jice ;
Xue, Wenman ;
Pei, Dan .
2020 IEEE 31ST INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2020), 2020, :48-58
[9]   DualGCN: a dual graph convolutional network model to predict cancer drug response [J].
Ma, Tianxing ;
Liu, Qiao ;
Li, Haochen ;
Zhou, Mu ;
Jiang, Rui ;
Zhang, Xuegong .
BMC BIOINFORMATICS, 2022, 23 (SUPPL 4)
[10]   Localizing Faults in Cloud Systems [J].
Mariani, Leonardo ;
Monni, Cristina ;
Pezze, Mauro ;
Riganelli, Oliviero ;
Xin, Rui .
2018 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST), 2018, :262-273