TDAG: A Tunable Distributed Data Processing Model for Data Stream

被引:0
作者
Tang, Jintao [1 ]
Lin, Xuelian [1 ]
Shen, Yang [1 ]
Wo, Tianyu [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Inst Adv Comp Technol, Beijing, Peoples R China
来源
2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017) | 2017年
关键词
Data Stream; Data Processing Model; Tunable;
D O I
10.1109/ISPA/IUCC.2017.00070
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the Internet of Vehicles (IoV) becomes flourishing and the data generated by sensors be ubiquitous, there exist various kinds of IoV applications with different performance requirements. Hence, different distributed data processing systems (DDPS) clusters will coexist, e.g., a stream processing system cluster for real-time tasks and a batch one for statistics based data mining tasks, to meet the requirements of such IoV applications. However, it is not an economical or convenient way to maintain varied systems clusters, as the developers and/or administrators have to be familiar with all of these DDPSs, and of course, the deployment of multiple DDPS means a waste of resources compared to the deployment of one DDPS. Based on these observations, this paper proposes the TDAG as a solution. TDAG allows users to adjust the data processing from the streaming style to the batch style by encapsulating the input data with specific packing strategies. We have implemented TDAG in a prototype called TStream. The experimental tests show that our TStream is both effective and efficient.
引用
收藏
页码:433 / 437
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 2011, P 6 INT WORKSH NETW
[2]  
Carbone P., 2015, IEEE DATA ENG B
[3]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[4]   Twitter Heron: Stream Processing at Scale [J].
Kulkarni, Sanjeev ;
Bhagat, Nikunj ;
Fu, Maosong ;
Kedigehalli, Vikas ;
Kellogg, Christopher ;
Mittal, Sailesh ;
Patel, Jignesh M. ;
Ramasamy, Karthik ;
Taneja, Siddarth .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :239-250
[5]   The ganglia distributed monitoring system: design, implementation, and experience [J].
Massie, ML ;
Chun, BN ;
Culler, DE .
PARALLEL COMPUTING, 2004, 30 (07) :817-840
[6]  
Shvachko K, 2010, IEEE S MASS STOR SYS
[7]  
Somasundaram N., 2014, APACHE SAMZA STREAM
[8]   Storm @Twitter [J].
Toshniwa, Ankit ;
Taneja, Siddarth ;
Shukla, Amit ;
Ramasamy, Karthik ;
Patel, Jignesh M. ;
Kulkarni, Sanjeev ;
Jackson, Jason ;
Gade, Krishna ;
Fu, Maosong ;
Donham, Jake ;
Bhagat, Nikunj ;
Mittal, Sailesh ;
Ryaboy, Dmitriy .
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, :147-156
[9]   ZEST: a Hybrid Model on Predicting Passenger Demand for Chauffeured Car Service [J].
Wei, Hua ;
Wang, Yuandong ;
Wo, Tianyu ;
Liu, Yaxiao ;
Xu, Jie .
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, :2203-2208
[10]  
Zaharia M., 2010, 2 USENIX WORKSHOP HO, V10, P95