SiEVE: Semantically Encoded Video Analytics on Edge and Cloud

被引:11
作者
Elgamal, Tarek [1 ]
Shi, Shu [2 ,4 ]
Gupta, Varun [3 ]
Jana, Rittwik [2 ]
Nahrstedt, Klara [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] AT&T Labs Res, Atlanta, GA USA
[3] Facebook Inc, Menlo Pk, CA USA
[4] ByteDance, Beijing, Peoples R China
来源
2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS) | 2020年
关键词
D O I
10.1109/ICDCS47774.2020.00182
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in computer vision and neural networks have made it possible for more surveillance videos to be automatically searched and analyzed by algorithms rather than humans. This happened in parallel with advances in edge computing where videos are analyzed over hierarchical clusters that contain edge devices, close to the video source. However, the current video analysis pipeline has several disadvantages when dealing with such advances. For example, video encoders have been designed for a long time to please human viewers and be agnostic of the downstream analysis task (e.g., object detection). Moreover, most of the video analytics systems leverage 2-tier architecture where the encoded video is sent to either a remote cloud or a private edge server but does not efficiently leverage both of them. In response to these advances, we present SIEVE, a 3-tier video analytics system to reduce the latency and increase the throughput of analytics over video streams. In SIEVE, we present a novel technique to detect objects in compressed video streams. We refer to this technique as semantic video encoding because it allows video encoders to be aware of the semantics of the downstream task (e.g., object detection). Our results show that by leveraging semantic video encoding, we achieve close to 100% object detection accuracy with decompressing only 3.5% of the video frames which results in more than 100x speedup compared to classical approaches that decompress every video frame.
引用
收藏
页码:1383 / 1388
页数:6
相关论文
共 8 条
[1]  
[Anonymous], 2017, NOSCOPE GITHUB
[2]   Chameleon: Scalable Adaptation of Video Analytics [J].
Jiang, Junchen ;
Ananthanarayanan, Ganesh ;
Bodik, Peter ;
Sen, Siddhartha ;
Stoica, Ion .
PROCEEDINGS OF THE 2018 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '18), 2018, :253-266
[3]   In-Datacenter Performance Analysis of a Tensor Processing Unit [J].
Jouppi, Norman P. ;
Young, Cliff ;
Patil, Nishant ;
Patterson, David ;
Agrawal, Gaurav ;
Bajwa, Raminder ;
Bates, Sarah ;
Bhatia, Suresh ;
Boden, Nan ;
Borchers, Al ;
Boyle, Rick ;
Cantin, Pierre-luc ;
Chao, Clifford ;
Clark, Chris ;
Coriell, Jeremy ;
Daley, Mike ;
Dau, Matt ;
Dean, Jeffrey ;
Gelb, Ben ;
Ghaemmaghami, Tara Vazir ;
Gottipati, Rajendra ;
Gulland, William ;
Hagmann, Robert ;
Ho, C. Richard ;
Hogberg, Doug ;
Hu, John ;
Hundt, Robert ;
Hurt, Dan ;
Ibarz, Julian ;
Jaffey, Aaron ;
Jaworski, Alek ;
Kaplan, Alexander ;
Khaitan, Harshit ;
Killebrew, Daniel ;
Koch, Andy ;
Kumar, Naveen ;
Lacy, Steve ;
Laudon, James ;
Law, James ;
Le, Diemthu ;
Leary, Chris ;
Liu, Zhuyuan ;
Lucke, Kyle ;
Lundin, Alan ;
MacKean, Gordon ;
Maggiore, Adriana ;
Mahony, Maire ;
Miller, Kieran ;
Nagarajan, Rahul ;
Narayanaswami, Ravi .
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, :1-12
[4]   NoScope: Optimizing Neural Network Queries over Video at Scale [J].
Kang, Daniel ;
Emmons, John ;
Abuzaid, Firas ;
Bailis, Peter ;
Zaharia, Matei .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (11) :1586-1597
[5]   Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge [J].
Kang, Yiping ;
Hauswald, Johann ;
Gao, Cao ;
Rovinski, Austin ;
Mudge, Trevor ;
Mars, Jason ;
Tang, Lingjia .
ACM SIGPLAN NOTICES, 2017, 52 (04) :615-629
[6]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[7]  
Ravindra P., 2017, ICSOC
[8]   The Case for VM-Based Cloudlets in Mobile Computing [J].
Satyanarayanan, Mahadev ;
Bahl, Paramvir ;
Caceres, Ramon ;
Davies, Nigel .
IEEE PERVASIVE COMPUTING, 2009, 8 (04) :14-23