Isolation Forest Based Anomaly Detection Framework on Non-IID Data

被引:10
|
作者
Xiang, Haolong [1 ]
Wang, Jiayu [2 ]
Ramamohanarao, Kotagiri [2 ]
Salcic, Zoran [1 ]
Dou, Wanchun [3 ]
Zhang, Xuyun [4 ]
机构
[1] Univ Auckland, Auckland 1010, New Zealand
[2] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
[3] Nanjing Univ, Nanjing 210093, Peoples R China
[4] Macquarie Univ, Sydney, NSW 2109, Australia
基金
澳大利亚研究理事会;
关键词
Anomaly detection; Data mining; Measurement; Extraterrestrial measurements; Hash functions; Intelligent systems;
D O I
10.1109/MIS.2021.3057914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomaly detection is a significant but challenging data mining task in a wide range of applications. Different domains usually use different ways to measure the characteristics of data and to define the anomaly types. As a result, it is a big challenge to develop a versatile anomaly detection framework that can be universally applied with satisfactory performance in most, if not all, applications. In this article, we propose a generic isolation forest based ensemble framework named EDBHiForest, which can be universally applied to data spaces with arbitrary distance measures. It is realized through embedding the isolation forest structure with extended distance-based hashing (EDBH), which can significantly enhance the versatility and applicability of isolation forest based anomaly detection. This framework overcomes the limitations of existing isolation forest based methods that can only be applied to datasets with a very limited range of distance measure types. Extensive experiments on various non-independent and identically distributed datasets demonstrate the effectiveness and efficiency of our approach.
引用
收藏
页码:31 / 40
页数:10
相关论文
共 50 条
  • [1] Homophily outlier detection in non-IID categorical data
    Guansong Pang
    Longbing Cao
    Ling Chen
    Data Mining and Knowledge Discovery, 2021, 35 : 1163 - 1224
  • [2] Homophily outlier detection in non-IID categorical data
    Pang, Guansong
    Cao, Longbing
    Chen, Ling
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (04) : 1163 - 1224
  • [3] Secure and decentralized federated learning framework with non-IID data based on blockchain
    Zhang, Feng
    Zhang, Yongjing
    Ji, Shan
    Han, Zhaoyang
    HELIYON, 2024, 10 (05)
  • [4] FedPD: A Federated Learning Framework With Adaptivity to Non-IID Data
    Zhang, Xinwei
    Hong, Mingyi
    Dhople, Sairaj
    Yin, Wotao
    Liu, Yang
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 (69) : 6055 - 6070
  • [5] Federated Learning-Based IoT Intrusion Detection on Non-IID Data
    Huang, Wenxuan
    Tiropanis, Thanassis
    Konstantinidis, George
    INTERNET OF THINGS, GIOTS 2022, 2022, 13533 : 326 - 337
  • [6] Fedward: Flexible Federated Backdoor Defense Framework with Non-IID Data
    Chen, Zekai
    Wang, Fuyi
    Zheng, Zhiwei
    Liu, Ximeng
    Lin, Yujie
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 348 - 353
  • [7] FedCD: A Classifier Debiased Federated Learning Framework for Non-IID Data
    Long, Yunfei
    Xue, Zhe
    Chu, Lingyang
    Zhang, Tianlong
    Wu, Junjiang
    Zang, Yu
    Du, Junping
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8994 - 9002
  • [8] Federated Learning Framework for IID and Non-IID datasets of Medical Images
    Srinivasasn, Kavitha
    Prasanna, Sainath
    Midha, Rohit
    Mohan, Shraddhaa
    EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2023, 11 (01) : 1 - 20
  • [9] A Straggler-resilient Federated Learning Framework for Non-IID Data Based on Harmonic Coding
    Tang, Weiheng
    Chen, Lin
    Chen, Xu
    2023 19TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN 2023, 2023, : 512 - 519
  • [10] Federated Learning Framework for Brain Tumor Detection Using MRI Images in Non-IID Data Distributions
    Muntaqim, MD. Zahin
    Smrity, Tangin Amir
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025,