Benchmarking large-scale data management for Internet of Things

被引:7
|
作者
Hendawi, Abdeltawab [1 ,2 ]
Gupta, Jayant [3 ]
Liu, Jiayi [6 ]
Teredesai, Ankur [7 ]
Ramakrishnan, Naveen [8 ]
Shah, Mohak [6 ]
El-Sappagh, Shaker [4 ,5 ]
Kwak, Kyung-Sup [4 ]
Ali, Mohamed [7 ]
机构
[1] Univ Rhode Isl, Dept Comp Sci & Stat, Kingston, RI 02881 USA
[2] Cairo Univ, Fac Comp & Informat, Giza, Egypt
[3] Univ Minnesota, Comp Sci & Engn, Minneapolis, MN USA
[4] Inha Univ, Dept Informat & Commun Engn, Incheon, South Korea
[5] Benha Univ, Fac Comp & Informat, Informat Syst Dept, Kaliobeya, Egypt
[6] LG Elect, Seoul, South Korea
[7] Univ Washington, Ctr Data Sci, Tacoma, WA USA
[8] Robert Bosch LLC, Ctr AI, Palo Alto, CA USA
来源
JOURNAL OF SUPERCOMPUTING | 2019年 / 75卷 / 12期
基金
新加坡国家研究基金会;
关键词
Benchmarking; NoSQL; Distributed data management; Parallel data management; Internet of things (IoT); MongoDB; Cassandra; HBase; CHALLENGES;
D O I
10.1007/s11227-019-02984-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the current era of the Internet of Things (IoT), massive number of sensors are used in our daily lives. Sensors are everywhere around us. They exist in our homes, work places, streets, cars, and even ourselves. Examples include home appliances, wearable devices, and medical sensors. These sensors generate huge amount of dynamic, heterogeneous, and unstructured data that need special handling beyond the capabilities of conventional relational databases. Thus, identification of suitable data management platform to store and query this data is necessary. Despite of its popularity and efficiency in processing various types of big data, there is no single-guided study of how NoSQL data stores will behave with the Internet of Things (IoT) datasets. IoT data have its own characteristics that make it special. IoT data come from various sensors, with a wide range of formats, high velocity, and require high throughput processing with low latency. NoSQL data stores are commonly used to provide flexibility and availability for big data handling. However, there is a lack of comprehensive studies about which NoSQL data store performs the best from the two scalability aspects (scale-up and scale-out) in a distributed and parallel processing environment. This paper benchmarks the commonly used NoSQL data stores (MongoDB, Cassandra, and HBase), and compares their performance with real industrial IoT dataset. In addition, we focus on comparing the throughput, latency, and run time of the evaluated NoSQL data stores.
引用
收藏
页码:8207 / 8230
页数:24
相关论文
共 50 条
  • [21] Big Spatial Data Management for the Internet of Things: A Survey
    Al Jawarneh, Isam Mashhour
    Bellavista, Paolo
    Corradi, Antonio
    Foschini, Luca
    Montanari, Rebecca
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2020, 28 (04) : 990 - 1035
  • [22] Geographically distributed data management to support large-scale data analysis
    Emara, Tamer Z.
    Trinh, Thanh
    Huang, Joshua Zhexue
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [23] Recommendation Based on Large-Scale Many-Objective Optimization for the Intelligent Internet of Things System
    Cao, Bin
    Zhang, Yatian
    Zhao, Jianwei
    Liu, Xin
    Skonieczny, Lukasz
    Lv, Zhihan
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (16) : 15030 - 15038
  • [24] Research on the Large-scale Database Optimization Algorithm under the Environment of Cloud Computing and Internet of Things
    Chen, Liwei
    PROCEEDINGS OF THE 2015 CONFERENCE ON INFORMATIZATION IN EDUCATION, MANAGEMENT AND BUSINESS, 2015, 20 : 17 - 21
  • [25] DEMAND SIDE MANAGEMENT OF INTERNET OF THINGS DATA
    Opera, Simona Vasilica
    Tudorica, Bogdan George
    Belciu, Anda
    Botha, Iuliana
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY (IE 2017): EDUCATION, RESEARCH & BUSINESS TECHNOLOGIES, 2017, : 311 - 316
  • [26] The research on abnormal signal retrieval methods for differences equipments under the framework of large-scale Internet of Things
    Li Liantian
    ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1060 - 1063
  • [27] Technical configurations of the Internet of Things for environmental monitoring at large-scale coal-fired power plants
    Zhu, Guoxun
    Tian, Ye
    Zhou, Yuyong
    Dong, Rencai
    INTERNATIONAL JOURNAL OF SUSTAINABLE DEVELOPMENT AND WORLD ECOLOGY, 2017, 24 (05): : 450 - 455
  • [28] BDIM: A Blockchain-Based Decentralized Identity Management Scheme for Large Scale Internet of Things
    Xiong, Ruoting
    Ren, Wei
    Hao, Xiaohan
    He, Jie
    Choo, Kim-Kwang Raymond
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (24) : 22581 - 22590
  • [29] VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository
    Hu, Kevin
    Gaikwad, Snehalkumar 'Neil' S.
    Hulsebos, Madelon
    Bakker, Michiel A.
    Zgraggen, Emanuel
    Hidalgo, Cesar
    Kraska, Tim
    Li, Guoliang
    Satyanarayan, Arvind
    Demiralp, Cagatay
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [30] Configure and Management of Internet of Things
    Rao, M. Varaprasad
    Raju, K. Srujan
    Murthy, G. Vishnu
    Rani, B. Kavitha
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 163 - 172