Data-driven Curation, Learning and Analysis for Inferring Evolving loT Botnets in the Wild

被引:3
作者
Pour, Morteza Safaei [1 ]
Mangino, Antonio [1 ]
Friday, Kurt [1 ]
Rathbun, Matthias [1 ]
Bou-Harb, Elias [1 ]
Iqbal, Farkhund [2 ]
Shaban, Khaled [3 ]
Erradi, Abdelkarim [3 ]
机构
[1] Florida Atlantic Univ, Cyber Threat Intelligence Lab, Boca Raton, FL 33431 USA
[2] Zayed Univ, Coll Technol Innovat, Dubai, U Arab Emirates
[3] Qatar Univ, Dept Comp Sci & Engn, Doha, Qatar
来源
14TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY (ARES 2019) | 2019年
基金
美国国家科学基金会;
关键词
Internet-of-Things; IoT botnets; network security; network telescopes; Internet measurements; deep learning; INTERNET;
D O I
10.1145/3339252.3339272
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The insecurity of the Internet-of-Things (IoT) paradigm continues to wreak havoc in consumer and critical infrastructure realms. Several challenges impede addressing IoT security at large, including, the lack of IoT-centric data that can be collected, analyzed and correlated, due to the highly heterogeneous nature of such devices and their widespread deployments in Internet-wide environments. To this end, this paper explores macroscopic, passive empirical data to shed light on this evolving threat phenomena. This not only aims at classifying and inferring Internet-scale compromised IoT devices by solely observing such one-way network traffic, but also endeavors to uncover, track and report on orchestrated "in the wild" IoT botnets. Initially, to prepare the effective utilization of such data, a novel probabilistic model is designed and developed to cleanse such traffic from noise samples (i.e., misconfiguration traffic). Subsequently, several shallow and deep learning models are evaluated to ultimately design and develop a multi-window convolution neural network trained on active and passive measurements to accurately identify compromised IoT devices. Consequently, to infer orchestrated and unsolicited activities that have been generated by well-coordinated IoT botnets, hierarchical agglomerative clustering is deployed by scrutinizing a set of innovative and efficient network feature sets. By analyzing 3.6 TB of recent darknet traffic, the proposed approach uncovers a momentous 440,000 compromised IoT devices and generates evidence -based artifacts related to 350 IoT botnets. While some of these detected botnets refer to previously documented campaigns such as the Hide and Seek, Ha j ime and Fbot, other events illustrate evolving threats such as those with cryptojacking capabilities and those that are targeting industrial control system communication and control services.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Deep learning algorithm for data-driven simulation of noisy dynamical system
    Yeo, Kyongmin
    Melnyk, Igor
    JOURNAL OF COMPUTATIONAL PHYSICS, 2019, 376 : 1212 - 1231
  • [32] A data-driven deep learning pipeline for quantitative susceptibility mapping (QSM)
    Wang, Zuojun
    Xia, Peng
    Huang, Fan
    Wei, Hongjiang
    Hui, Edward Sai-Kam
    Mak, Henry Ka-Fung
    Cao, Peng
    MAGNETIC RESONANCE IMAGING, 2022, 88 : 89 - 100
  • [33] Heterogeneous Multi-Party Learning With Data-Driven Network Sampling
    Gong, Maoguo
    Gao, Yuan
    Wu, Yue
    Zhang, Yuanqiao
    Qin, A. K.
    Ong, Yew-Soon
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13328 - 13343
  • [34] Classification of machine learning frameworks for data-driven thermal fluid models
    Chang, Chih-Wei
    Dinh, Nam T.
    INTERNATIONAL JOURNAL OF THERMAL SCIENCES, 2019, 135 : 559 - 579
  • [35] A Survey on Data-Driven Learning for Intelligent Network Intrusion Detection Systems
    Abdelmoumin, Ghada
    Whitaker, Jessica
    Rawat, Danda B.
    Rahman, Abdul
    ELECTRONICS, 2022, 11 (02)
  • [36] Data-driven prediction of unsteady pressure distributions based on deep learning
    Rozov, Vladyslav
    Breitsamter, Christian
    JOURNAL OF FLUIDS AND STRUCTURES, 2021, 104
  • [37] Data-Driven Deep Learning for Automatic Modulation Recognition in Cognitive Radios
    Wang, Yu
    Liu, Miao
    Yang, Jie
    Gui, Guan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (04) : 4074 - 4077
  • [38] Optimization: data-driven management using deep learning in cloud computing
    Karim, Sajida
    He, Hui
    2022 23RD ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS 2022), 2022, : 423 - 426
  • [39] Data-driven hedging of stock index options via deep learning
    Chen, Jie
    Li, Lingfei
    OPERATIONS RESEARCH LETTERS, 2023, 51 (04) : 408 - 413
  • [40] Data-Driven Intelligence System for General Recommendations of Deep Learning Architectures
    Noveski, Gjorgji
    Eftimov, Tome
    Mishev, Kostadin
    Simjanoska, Monika
    IEEE ACCESS, 2021, 9 : 148710 - 148720