Benchmarking large-scale data management for Internet of Things

被引:7
|
作者
Hendawi, Abdeltawab [1 ,2 ]
Gupta, Jayant [3 ]
Liu, Jiayi [6 ]
Teredesai, Ankur [7 ]
Ramakrishnan, Naveen [8 ]
Shah, Mohak [6 ]
El-Sappagh, Shaker [4 ,5 ]
Kwak, Kyung-Sup [4 ]
Ali, Mohamed [7 ]
机构
[1] Univ Rhode Isl, Dept Comp Sci & Stat, Kingston, RI 02881 USA
[2] Cairo Univ, Fac Comp & Informat, Giza, Egypt
[3] Univ Minnesota, Comp Sci & Engn, Minneapolis, MN USA
[4] Inha Univ, Dept Informat & Commun Engn, Incheon, South Korea
[5] Benha Univ, Fac Comp & Informat, Informat Syst Dept, Kaliobeya, Egypt
[6] LG Elect, Seoul, South Korea
[7] Univ Washington, Ctr Data Sci, Tacoma, WA USA
[8] Robert Bosch LLC, Ctr AI, Palo Alto, CA USA
来源
JOURNAL OF SUPERCOMPUTING | 2019年 / 75卷 / 12期
基金
新加坡国家研究基金会;
关键词
Benchmarking; NoSQL; Distributed data management; Parallel data management; Internet of things (IoT); MongoDB; Cassandra; HBase; CHALLENGES;
D O I
10.1007/s11227-019-02984-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the current era of the Internet of Things (IoT), massive number of sensors are used in our daily lives. Sensors are everywhere around us. They exist in our homes, work places, streets, cars, and even ourselves. Examples include home appliances, wearable devices, and medical sensors. These sensors generate huge amount of dynamic, heterogeneous, and unstructured data that need special handling beyond the capabilities of conventional relational databases. Thus, identification of suitable data management platform to store and query this data is necessary. Despite of its popularity and efficiency in processing various types of big data, there is no single-guided study of how NoSQL data stores will behave with the Internet of Things (IoT) datasets. IoT data have its own characteristics that make it special. IoT data come from various sensors, with a wide range of formats, high velocity, and require high throughput processing with low latency. NoSQL data stores are commonly used to provide flexibility and availability for big data handling. However, there is a lack of comprehensive studies about which NoSQL data store performs the best from the two scalability aspects (scale-up and scale-out) in a distributed and parallel processing environment. This paper benchmarks the commonly used NoSQL data stores (MongoDB, Cassandra, and HBase), and compares their performance with real industrial IoT dataset. In addition, we focus on comparing the throughput, latency, and run time of the evaluated NoSQL data stores.
引用
收藏
页码:8207 / 8230
页数:24
相关论文
共 50 条
  • [31] TinyML Algorithms for Big Data Management in Large-Scale IoT Systems
    Karras, Aristeidis
    Giannaros, Anastasios
    Karras, Christos
    Theodorakopoulos, Leonidas
    Mammassis, Constantinos S.
    Krimpas, George A.
    Sioutas, Spyros
    FUTURE INTERNET, 2024, 16 (02)
  • [32] Benchmarking Internet of Things deployments in Smart Cities
    Le Gall, Franck
    Chevillard, Sophie Vallet
    Gluhak, Alex
    Zhang Xueli
    2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2013, : 1319 - 1324
  • [33] Approximate to Be Great: Communication Efficient and Privacy-Preserving Large-Scale Distributed Deep Learning in Internet of Things
    Du, Wei
    Li, Ang
    Zhou, Pan
    Xu, Zichuan
    Wang, Xiumin
    Jiang, Hao
    Wu, Dapeng
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (12) : 11678 - 11692
  • [34] ReapIoT: Reliable, Energy-Aware Network Protocol for Large-Scale Internet-of-Things (IoT) Applications
    Badi, Ahmed
    Mahgoub, Imad
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (17): : 13582 - 13592
  • [35] Blockchain Based Data Integrity Verification for Large-Scale IoT Data
    Wang, Haiyan
    Zhang, Jiawei
    IEEE ACCESS, 2019, 7 : 164996 - 165006
  • [36] Survey on Data Management for Healthcare using Internet of Things
    Chaudhari, Dipalee A.
    Umamaheswari, E.
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [37] Internet of Things (IoT) Architecture for Flood Data Management
    Ghapar, Azimah Abdul
    Yussof, Salman
    Abu Bakar, Asmidar
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2018, 11 (01): : 55 - 62
  • [38] EventDB: A Large-Scale Semi-structured Scientific Data Management System
    Zhao, Wenjia
    Qi, Yong
    Hou, Di
    Wang, Peijian
    Gao, Xin
    Du, Zirong
    Zhang, Yudong
    Zong, Yongfang
    BIG SCIENTIFIC DATA MANAGEMENT, 2019, 11473 : 105 - 115
  • [39] Modeling Distributed and Configurable Hierarchical Blockchain over SDN and Fog-Based Networks for Large-Scale Internet of Things
    Salman Azeez Syed
    Deepak Kumar Sharma
    Gautam Srivastava
    Journal of Grid Computing, 2023, 21
  • [40] Modeling Distributed and Configurable Hierarchical Blockchain over SDN and Fog-Based Networks for Large-Scale Internet of Things
    Syed, Salman Azeez
    Sharma, Deepak Kumar
    Srivastava, Gautam
    JOURNAL OF GRID COMPUTING, 2023, 21 (04)