Sieve: Attention-based Sampling of End-to-End Trace Data in Distributed Microservice Systems

被引:11
作者
Huang, Zicheng [1 ]
Chen, Pengfei [1 ]
Yu, Guangba [1 ]
Chen, Hongyang [1 ]
Zheng, Zibin [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2021 | 2021年
基金
中国国家自然科学基金;
关键词
End-to-end tracing; Weighted sampling; Microservice; Robust Random Cut Forest;
D O I
10.1109/ICWS53863.2021.00063
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
End-to-end tracing plays an important role in understanding and monitoring distributed microservice systems. The trace data are valuable to help find out the anomalous or erroneous behavior of the system. However, the volume of trace data is huge leading to a heavy burden on analyzing and storing them. To reduce the volume of trace data, the sampling technique is widely adopted. However, existing uniform sampling approaches are unable to capture uncommon traces that are more interesting and informative. To tackle this problem, we design and implement Sieve, an online sampler that aims to bias sampling towards uncommon traces by taking advantage of the attention mechanism. The evaluation results on the trace datasets collected from real-world and experimental microservice systems show that Sieve is effective to increase sampling probabilities of the structurally and temporally uncommon traces and reduce the storage space to a large extent by taking a low sampling rate.
引用
收藏
页码:436 / 446
页数:11
相关论文
共 27 条
[11]   Sifter: Scalable Sampling for Distributed Traces, without Feature Engineering [J].
Las-Casas, Pedro ;
Papakerashvili, Giorgi ;
Anand, Vaastav ;
Mace, Jonathan .
PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, :312-324
[12]   Weighted Sampling of Execution Traces: Capturing More Needles and Less Hay [J].
Las-Casas, Pedro ;
Mace, Jonathan ;
Guedes, Dorgival ;
Fonseca, Rodrigo .
PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '18), 2018, :326-332
[13]   Microscope: Pinpoint Performance Issues with Causal Graphs in Micro-service Environments [J].
Lin, Jinjin ;
Chen, Pengfei ;
Zheng, Zibin .
SERVICE-ORIENTED COMPUTING (ICSOC 2018), 2018, 11236 :3-20
[14]   Isolation-Based Anomaly Detection [J].
Liu, Fei Tony ;
Ting, Kai Ming ;
Zhou, Zhi-Hua .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (01)
[15]   JCallGraph: Tracing Microservices in Very Large Scale Container Cloud Platforms [J].
Liu, Haifeng ;
Zhang, Jinjun ;
Shan, Huasong ;
Li, Min ;
Chen, Yuan ;
He, Xiaofeng ;
Li, Xiaowei .
CLOUD COMPUTING - CLOUD 2019, 2019, 11513 :287-302
[16]   Unsupervised Detection of Microservice Trace Anomalies through Service-Level Deep Bayesian Networks [J].
Liu, Ping ;
Xu, Haowen ;
Ouyang, Qianyu ;
Jiao, Rui ;
Chen, Zhekang ;
Zhang, Shenglin ;
Yang, Jiahai ;
Mo, Linlin ;
Zeng, Jice ;
Xue, Wenman ;
Pei, Dan .
2020 IEEE 31ST INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2020), 2020, :48-58
[17]   AutoMAP: Diagnose Your Microservice-based Web Applications Automatically [J].
Ma, Meng ;
Wang, Ping ;
Xu, Jingmin ;
Wang, Yuan ;
Chen, Pengfei ;
Zhang, Zonghua .
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, :246-258
[18]  
Narasimhan P., P IEEE IFIP INT C DE, P1, DOI [10.1109/dsn.2012.6263927, DOI 10.1109/DSN.2012.6263927]
[19]  
Reynolds P, 2006, USENIX ASSOCIATION PROCEEDINGS OF THE 3RD SYMPOSIUM ON NETWORKED SYSTEMS DESIGN & IMPLEMENTATION (NSDI 06), P115
[20]  
Sambasivan R. R., 2011, P 8 USENIX C NETWORK, P43