Analytics in Motion High Performance Event-Processing AND Real-Time Analytics in the Same Database

被引:29
作者
Braun, Lucas [1 ]
Etter, Thomas [1 ]
Gasparis, Georgios [1 ]
Kaufmann, Martin [1 ]
Kossmann, Donald [1 ]
Widmer, Daniel [1 ]
Avitzur, Aharon [2 ]
Iliopoulos, Anthony [2 ]
Levy, Eliezer [2 ]
Liang, Ning [2 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Syst Grp, Zurich, Switzerland
[2] Huawei Technol, European Res Ctr, Shenzhen, Peoples R China
来源
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2015年
关键词
OLTP/OLAP Engine; Analytics; Event-Processing;
D O I
10.1145/2723372.2742783
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern data-centric flows in the telecommunications industry require real time analytical processing over a rapidly changing and large dataset. The traditional approach of separating OLTP and OLAP workloads cannot satisfy this requirement. Instead, a new class of integrated solutions for handling hybrid workloads is needed. This paper presents an industrial use case and a novel architecture that integrates key-value-based event processing and SQL-based analytical processing on the same distributed store while minimizing the total cost of ownership. Our approach combines several well-known techniques such as shared scans, delta processing, a PAX-fashioned storage layout, and an interleaving of scanning and delta merging in a completely new way. Performance experiments show that our system scales out linearly with the number of servers. For instance, our system sustains event streams of 100,000 events per second while simultaneously processing 100 ad-hoc analytical queries per second, using a cluster of 12 commodity servers. In doing so, our system meets all response time goals of our telecommunication customers; that is, 10 milliseconds per event and 100 milliseconds for an ad-hoc analytical query. Moreover, our system beats commercial competitors by a factor of 2.5 in analytical and two orders of magnitude in update performance.
引用
收藏
页码:251 / 264
页数:14
相关论文
共 33 条
[1]  
Ailamaki A., 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P169
[2]   H2O: A Hands-free Adaptive Store [J].
Alagiannis, Ioannis ;
Idreos, Stratos ;
Ailamaki, Anastasia .
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, :1103-1114
[3]  
Ali Mohamed., 2010, Proceedings of the 1st International Conference and Exhibition on Computing for Geospatial Research Application, P66
[4]  
[Anonymous], 2012, IEEE DATA ENG B
[5]  
[Anonymous], 2009, Proceedings of the VLDB Endowment, DOI DOI 10.14778/1687627.1687671
[6]  
[Anonymous], 2008, P 2008 ACM SIGMOD IN, DOI [10.1145/1376616.1376713, DOI 10.1145/1376616.1376713]
[7]  
[Anonymous], 2011, P 4 INT WORKSH TES D
[8]  
[Anonymous], 2010, P USENIX WORKSH HOT
[9]  
Apache Foundation, AP STORM SYST PROC S
[10]   Scheduling multithreaded computations by work stealing [J].
Blumofe, RD ;
Leiserson, CE .
JOURNAL OF THE ACM, 1999, 46 (05) :720-748