Detection outliers on internet of things using big data technology

被引:26
|
作者
Ghallab, Haitham [1 ]
Fahmy, Hanan [1 ]
Nasr, Mona [1 ]
机构
[1] Helwan Univ, Dept Informat Syst, Cairo, Egypt
关键词
Internet of things; IoT; Big data; Data quality; Outliers Detection; DBSCAN; RDDs; CLUSTERING-ALGORITHM; MR;
D O I
10.1016/j.eij.2019.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Internet of Things (IoT) is a fundamental concept of a new technology that will be promising and significant in various fields. IoT is a vision that allows things or objects equipped with sensors, actuators, and processors to talk and communicate with each other over the internet to achieve a meaningful goal. Unfortunately, one of the major challenges that affect IoT is data quality and uncertainty, as data volume increases noise, inconsistency and redundancy increases within data and causes paramount issues for IoT technologies. And since IoT is considered to be a massive quantity of heterogeneous networked embedded devices that generate big data, then it is very complex to compute and analyze such massive data. So this paper introduces a new model named NRDD-DBSCAN based on DBSCAN algorithm and using resilient distributed datasets (RDDs) to detect outliers that affect the data quality of IoT technologies. NRDD-DBSCAN has been applied on three different datasets of N-dimensions (2-D, 3-D, and 25-D) and the results were promising. Finally, comparisons have been made between NRDD-DBSCAN and previous models such as RDD-DBSCAN model and DBSCAN algorithm, and these comparisons proved that NRDD-DBSCAN solved the low dimensionality issue of RDD-DBSCAN model and also solved the fact that DBSCAN algorithm cannot handle IoT data. So the conclusion is that NRDD-DBSCAN proposed model can detect the outliers that exist in the datasets of N-dimensions by using resilient distributed datasets (RDDs), and NRDD-DBSCAN can enhance the quality of data exists in IoT applications and technologies. (C) 2019 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Artificial Intelligence, Cairo University.
引用
收藏
页码:131 / 138
页数:8
相关论文
共 50 条
  • [31] Random forest for big data classification in the internet of things using optimal features
    Lakshmanaprabu, S. K.
    Shankar, K.
    Ilayaraja, M.
    Nasir, Abdul Wahid
    Vijayakumar, V.
    Chilamkurti, Naveen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (10) : 2609 - 2618
  • [32] Simulation of Internet of Things Network for Big Data Analytics
    Manujakshi, B. C.
    Ramesh, K. B.
    Garg, Lalit
    Shashidhar, T. M.
    INFORMATION SYSTEMS AND MANAGEMENT SCIENCE, ISMS 2021, 2023, 521 : 37 - 48
  • [33] Algorithms for Big Data Delivery over the Internet of Things
    Plageras, Andreas P.
    Psannis, Kostas E.
    2017 IEEE 19TH CONFERENCE ON BUSINESS INFORMATICS (CBI), VOL 1, 2017, 1 : 202 - 206
  • [34] Business Information Architecture for Big Data and Internet of Things
    Hadj Sassi, M. Saifeddine
    Chaari Fourati, Lamia
    Ghozzi Jedidi, Faiza
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1749 - 1756
  • [35] The role of big data analytics in industrial Internet of Things
    Rehman, Muhammad Habib Ur
    Yaqoob, Ibrar
    Salah, Khaled
    Imran, Muhammad
    Jayaraman, Prem Prakash
    Perera, Charith
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 99 : 247 - 259
  • [36] A New Architecture for Cognitive Internet of Things and Big Data
    Sassi, Mohamed Saifeddine Hadj
    Jedidi, Faiza Ghozzi
    Fourati, Lamia Chaari
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 534 - 543
  • [37] Big Data, the Internet of Things, and the Revised Knowledge Pyramid
    Jennex, Murray E.
    DATA BASE FOR ADVANCES IN INFORMATION SYSTEMS, 2017, 48 (04): : 69 - 79
  • [38] Internet of Things, big data and the economics of networked vehicles
    Knieps, Guenter
    TELECOMMUNICATIONS POLICY, 2019, 43 (02) : 171 - 181
  • [39] BIG DATA AND INTERNET OF THINGS IN THE PRODUCTION OF ORGANIC BANANAS
    Vite Cevallos, Harry
    Townsend Valencia, Jose
    Carvajal Romero, Hector
    REVISTA UNIVERSIDAD Y SOCIEDAD, 2020, 12 (04): : 192 - 200
  • [40] The Optimization of Big Data Platform under the Internet of Things
    Wang, Suzhen
    Zhang, Yanpiao
    Zhang, Lu
    Cao, Ning
    2018 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC 2018), 2018, : 126 - 129