A crawler architecture for harvesting the clear, social, and dark web for IoT-related cyber-threat intelligence

被引:16
|
作者
Koloveas, Paris [1 ]
Chantzios, Thanasis [1 ]
Tryfonopoulos, Christos [1 ]
Skiadopoulos, Spiros [1 ]
机构
[1] Univ Peloponnese, GR-22131 Tripolis, Greece
来源
2019 IEEE WORLD CONGRESS ON SERVICES (IEEE SERVICES 2019) | 2019年
关键词
IoT; cyber-security; cyber-threat intelligence; crawling architecture; machine learning; language models;
D O I
10.1109/SERVICES.2019.00016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The clear, social, and dark web have lately been identified as rich sources of valuable cyber-security information that given the appropriate tools and methods may be identified, crawled and subsequently leveraged to actionable cyber-threat intelligence. In this work, we focus on the information gathering task, and present a novel crawling architecture for transparently harvesting data from security websites in the clear web, security forums in the social web, and hacker forums/marketplaces in the dark web. The proposed architecture adopts a two-phase approach to data harvesting. Initially a machine learning-based crawler is used to direct the harvesting towards websites of interest, while in the second phase state-of-the-art statistical language modelling techniques are used to represent the harvested information in a latent low-dimensional feature space and rank it based on its potential relevance to the task at hand. The proposed architecture is realised using exclusively open-source tools, and a preliminary evaluation with crowdsourced results demonstrates its effectiveness.
引用
收藏
页码:3 / 8
页数:6
相关论文
共 14 条
  • [1] Dark-Net Ecosystem Cyber-Threat Intelligence (CTI) Tool
    Arnold, Nolan
    Ebrahimi, Mohammadreza
    Zhang, Ning
    Lazarine, Ben
    Patton, Mark
    Chen, Hsinchun
    Samtani, Sagar
    2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 92 - 97
  • [2] Towards Safe Cyber Practices: Developing a Proactive Cyber-Threat Intelligence System for Dark Web Forum Content by Identifying Cybercrimes
    Sangher, Kanti Singh
    Singh, Archana
    Pandey, Hari Mohan
    Kumar, Vivek
    INFORMATION, 2023, 14 (06)
  • [3] Dark-Web Cyber Threat Intelligence: From Data to Intelligence to Prediction
    Shakarian, Paulo
    INFORMATION, 2018, 9 (12):
  • [4] Exploring the Dark Web for Cyber Threat Intelligence using Machine Leaning
    Kadoguchi, Masashi
    Hayashi, Shota
    Hashimoto, Masaki
    Otsuka, Akira
    2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 200 - 202
  • [5] Extracting Threat Intelligence Related IoT Botnet From Latest Dark Web Data Collection
    Furumoto, Keisuke
    Umizaki, Mitsuhiro
    Fujita, Akira
    Nagata, Takahiko
    Takahashi, Takeshi
    Inoue, Daisuke
    IEEE CONGRESS ON CYBERMATICS / 2021 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS (ITHINGS) / IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) / IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) / IEEE SMART DATA (SMARTDATA), 2021, : 138 - 145
  • [6] INTIME: A Machine Learning-Based Framework for Gathering and Leveraging Web Data to Cyber-Threat Intelligence
    Koloveas, Paris
    Chantzios, Thanasis
    Alevizopoulou, Sofia
    Skiadopoulos, Spiros
    Tryfonopoulos, Christos
    ELECTRONICS, 2021, 10 (07)
  • [7] Deep Self-Supervised Clustering of the Dark Web for Cyber Threat Intelligence
    Kadoguchi, Masashi
    Kobayashi, Hanae
    Hayashi, Shota
    Otsuka, Akira
    Hashimoto, Masaki
    2020 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2020, : 163 - 168
  • [8] Threats from the Dark: A Review over Dark Web Investigation Research for Cyber Threat Intelligence
    Basheer, Randa
    Alkhatib, Bassel
    JOURNAL OF COMPUTER NETWORKS AND COMMUNICATIONS, 2021, 2021
  • [9] Cyber Security Training with Generative Artificial Intelligence Supported Web Platform Using IoT Cyber Threat Scenarios
    Hatipoglu, Zehra
    Yaman, Busra
    Ceylan, Sedanur
    Kose, Utku
    2024 CYBER AWARENESS AND RESEARCH SYMPOSIUM, CARS 2024, 2024,
  • [10] Navigating the Shadows: Manual and Semi-Automated Evaluation of the Dark Web for Cyber Threat Intelligence
    Kuehn, Philipp
    Wittorf, Kyra
    Reuter, Christian
    IEEE ACCESS, 2024, 12 : 118903 - 118922