Anomaly Detection Based on GCNs and DBSCAN in a Large-Scale Graph

被引:3
作者
Emane, Christopher Retiti Diop [1 ]
Song, Sangho [1 ]
Lee, Hyeonbyeong [1 ]
Choi, Dojin [2 ]
Lim, Jongtae [1 ]
Bok, Kyoungsoo [3 ]
Yoo, Jaesoo [1 ]
机构
[1] Chungbuk Natl Univ, Dept Informat & Commun Engn, Chungdae ro 1, Cheongju 28644, South Korea
[2] Changwon Natl Univ, Dept Comp Engn, Changwondaehak ro 20, Chang Won 51140, South Korea
[3] Wonkwang Univ, Dept Artificial Intelligence Convergence, Iksandae 460, Iksan 54538, South Korea
基金
新加坡国家研究基金会;
关键词
anomaly detection; GCNs; DBSCAN; deep learning; clustering algorithms; large-scale graph;
D O I
10.3390/electronics13132625
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection is critical across domains, from cybersecurity to fraud prevention. Graphs, adept at modeling intricate relationships, offer a flexible framework for capturing complex data structures. This paper proposes a novel anomaly detection approach, combining Graph Convolutional Networks (GCNs) and Density-Based Spatial Clustering of Applications with Noise (DBSCAN). GCNs, a specialized deep learning model for graph data, extracts meaningful node and edge representations by incorporating graph topology and attribute information. This facilitates learning expressive node embeddings capturing local and global structural patterns. For anomaly detection, DBSCAN, a density-based clustering algorithm effective in identifying clusters of varying densities amidst noise, is employed. By defining a minimum distance threshold and a minimum number of points within that distance, DBSCAN proficiently distinguishes normal graph elements from anomalies. Our approach involves training a GCN model on a labeled graph dataset, generating appropriately labeled node embeddings. These embeddings serve as input to DBSCAN, identifying clusters and isolating anomalies as noise points. The evaluation on benchmark datasets highlights the superior performance of our approach in anomaly detection compared to traditional methods. The fusion of GCNs and DBSCAN demonstrates a significant potential for accurate and efficient anomaly detection in graphs. This research contributes to advancing graph-based anomaly detection, with promising applications in domains where safeguarding data integrity and security is paramount.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] AMAD: Active learning-based multivariate time series anomaly detection for large-scale IT systems
    Yu, Rongwei
    Wang, Yong
    Wang, Wang
    COMPUTERS & SECURITY, 2024, 137
  • [32] Anomaly detection in large-scale networks: A state-space decision process
    Alghuried, Abdullah
    Moghaddass, Ramin
    JOURNAL OF QUALITY TECHNOLOGY, 2022, 54 (01) : 65 - 92
  • [33] Research on Anomaly Detection Method Based on DBSCAN Clustering Algorithm
    Deng, Dingsheng
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 439 - 442
  • [34] PVEL-AD: A Large-Scale Open-World Dataset for Photovoltaic Cell Anomaly Detection
    Su, Binyi
    Zhou, Zhong
    Chen, Haiyong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 404 - 413
  • [35] KNN-BLOCK DBSCAN: Fast Clustering for Large-Scale Data
    Chen, Yewang
    Zhou, Lida
    Pei, Songwen
    Yu, Zhiwen
    Chen, Yi
    Liu, Xin
    Du, Jixiang
    Xiong, Naixue
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (06): : 3939 - 3953
  • [36] Enhancing anomaly detection with adaptive node inspection in large-scale networks with binary sensors
    Xu, Feiran
    Moghaddass, Ramin
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 189
  • [37] A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
    Abbasi, Ahmed
    Javed, Abdul Rehman Rehman
    Yasin, Amanullah
    Jalil, Zunera
    Kryvinska, Natalia
    Tariq, Usman
    IEEE ACCESS, 2022, 10 : 38885 - 38894
  • [38] HADES: a Hybrid Anomaly Detection System for Large-Scale Cyber-Physical Systems
    Alwan, Ahmed Abdulhasan
    Ciupala, Mihaela Anca
    Baravalle, Andres
    Falcarin, Paolo
    2020 FIFTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), 2020, : 136 - 142
  • [39] A scalable Bayesian framework for large-scale sensor-driven network anomaly detection
    Xu, Feiran
    Moghaddass, Ramin
    IISE TRANSACTIONS, 2023, 55 (05) : 445 - 462
  • [40] ADF: An Anomaly Detection Framework for Large-Scale PM2.5 Sensing Systems
    Chen, Ling-Jyh
    Ho, Yao-Hua
    Hsieh, Hsin-Hung
    Huang, Shih-Ting
    Lee, Hu-Cheng
    Mahajan, Sachit
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (02): : 559 - 570