Benchmarking large-scale data management for Internet of Things

被引:7
|
作者
Hendawi, Abdeltawab [1 ,2 ]
Gupta, Jayant [3 ]
Liu, Jiayi [6 ]
Teredesai, Ankur [7 ]
Ramakrishnan, Naveen [8 ]
Shah, Mohak [6 ]
El-Sappagh, Shaker [4 ,5 ]
Kwak, Kyung-Sup [4 ]
Ali, Mohamed [7 ]
机构
[1] Univ Rhode Isl, Dept Comp Sci & Stat, Kingston, RI 02881 USA
[2] Cairo Univ, Fac Comp & Informat, Giza, Egypt
[3] Univ Minnesota, Comp Sci & Engn, Minneapolis, MN USA
[4] Inha Univ, Dept Informat & Commun Engn, Incheon, South Korea
[5] Benha Univ, Fac Comp & Informat, Informat Syst Dept, Kaliobeya, Egypt
[6] LG Elect, Seoul, South Korea
[7] Univ Washington, Ctr Data Sci, Tacoma, WA USA
[8] Robert Bosch LLC, Ctr AI, Palo Alto, CA USA
来源
JOURNAL OF SUPERCOMPUTING | 2019年 / 75卷 / 12期
基金
新加坡国家研究基金会;
关键词
Benchmarking; NoSQL; Distributed data management; Parallel data management; Internet of things (IoT); MongoDB; Cassandra; HBase; CHALLENGES;
D O I
10.1007/s11227-019-02984-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the current era of the Internet of Things (IoT), massive number of sensors are used in our daily lives. Sensors are everywhere around us. They exist in our homes, work places, streets, cars, and even ourselves. Examples include home appliances, wearable devices, and medical sensors. These sensors generate huge amount of dynamic, heterogeneous, and unstructured data that need special handling beyond the capabilities of conventional relational databases. Thus, identification of suitable data management platform to store and query this data is necessary. Despite of its popularity and efficiency in processing various types of big data, there is no single-guided study of how NoSQL data stores will behave with the Internet of Things (IoT) datasets. IoT data have its own characteristics that make it special. IoT data come from various sensors, with a wide range of formats, high velocity, and require high throughput processing with low latency. NoSQL data stores are commonly used to provide flexibility and availability for big data handling. However, there is a lack of comprehensive studies about which NoSQL data store performs the best from the two scalability aspects (scale-up and scale-out) in a distributed and parallel processing environment. This paper benchmarks the commonly used NoSQL data stores (MongoDB, Cassandra, and HBase), and compares their performance with real industrial IoT dataset. In addition, we focus on comparing the throughput, latency, and run time of the evaluated NoSQL data stores.
引用
收藏
页码:8207 / 8230
页数:24
相关论文
共 50 条
  • [41] Operating Systems for Internet of Things Low-End Devices: Analysis and Benchmarking
    Silva, Miguel
    Cerdeira, David
    Pinto, Sandro
    Gomes, Tiago
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (06) : 10375 - 10383
  • [42] Data management and internet of things : A methodological review in smart farming
    Debauche, Olivier
    Trani, Jean-Philippe
    Mahmoudi, Said
    Manneback, Pierre
    Bindelle, Jerome
    Mahmoudi, Sidi Ahmed
    Guttadauria, Adriano
    Lebeau, Frederic
    INTERNET OF THINGS, 2021, 14
  • [43] Benchmarking large-scale subset selection in evolutionary multi-objective optimization
    Shang, Ke
    Shu, Tianye
    Ishibuchi, Hisao
    Nan, Yang
    Pang, Lie Meng
    INFORMATION SCIENCES, 2023, 622 : 755 - 770
  • [44] Hadoop-HBase for Large-Scale Data
    Vora, Mehul Nalin
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 601 - 605
  • [45] A Data Management Strategy for Property Management Information System Based on the Internet of Things
    Dai F.
    Ingenierie des Systemes d'Information, 2020, 25 (03): : 337 - 343
  • [46] Facility Information Management on HBase: Large-Scale Storage for Time-Series Data
    Ochiai, Hideya
    Ikegami, Hiroyuki
    Teranishi, Yuuichi
    Esaki, Hiroshi
    2014 38TH ANNUAL IEEE INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSACW 2014), 2014, : 306 - 311
  • [47] Large-scale data envelopment analysis (DEA) implementation: a strategic performance management approach
    Medina-Borja, A.
    Pasupathy, K. S.
    Triantis, K.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2007, 58 (08) : 1084 - 1098
  • [48] DATA REGULATION IN THE INTERNET OF THINGS
    Kroenke, Christoph
    FRONTIERS OF LAW IN CHINA, 2018, 13 (03) : 367 - 379
  • [49] Two Time-Scale Resource Management for Green Internet of Things Networks
    Zhang, Deyu
    Qiao, Ying
    She, Liang
    Shen, Ruyin
    Ren, Ju
    Zhang, Yaoxue
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (01): : 545 - 556
  • [50] A Response-Aware Traffic Offloading Scheme Using Regression Machine Learning for User-Centric Large-Scale Internet of Things
    Manogaran, Gunasekaran
    Srivastava, Gautam
    Muthu, Bala Anand
    Baskar, S.
    Shakeel, P. Mohamed
    Hsu, Ching-Hsien
    Bashir, Ali Kashif
    Kumar, Priyan M.
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05): : 3360 - 3368