An Efficient Adaptive Multi-Kernel Learning With Safe Screening Rule for Outlier Detection

被引:0
作者
Wang, Xinye [1 ]
Duan, Lei [1 ]
He, Chengxin [1 ]
Chen, Yuanyuan [1 ]
Wu, Xindong [2 ]
机构
[1] Sichuan Univ, Sch Comp Sci, Chengdu 610065, Peoples R China
[2] Zhejiang Lab, Res Ctr Knowledge Engn, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-kernel learning; safe screening rule (SSR); support vector data description (SVDD); outlier detection;
D O I
10.1109/TKDE.2023.3330708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in multi-kernel-based methods for outlier detection have positioned them as an attractive way to detect instances that are markedly different from the remaining data in a dataset. Currently, most outlier detection approaches based on multi-kernel learning are simply a convex combination of various kernels with handcrafted weights, meaning that these weights may not be suitable. Meanwhile, this combination of weights does not sufficiently consider the intrinsic correlations of instances when fusing different kernels. Thus, a key challenge is how to adaptively learn an appropriate combination of weights for capturing a new feature space in which outliers can be better detected than the original space. Simultaneously, it is still a burning issue to get the optimal combination of weights due to considerable computational cost and memory usage when the feature or instance size is large. In this paper, we propose a novel method for efficient adaptive multi-kernel for outlier detection (EAMOD), which automatically learns the optimal weight for each training instance under different kernels using a non-negative function. In addition, we design a safe screening rule (SSR) for EAMOD to improve its training efficiency without any loss of accuracy. To the best of our knowledge, it is the first attempt to develop SSR for multi-kernel-based outlier detection methods. Extensive experiments show that EAMOD is effective and efficient.
引用
收藏
页码:3656 / 3669
页数:14
相关论文
共 65 条
  • [1] Aggarwal C. C., 2001, SIGMOD Record, V30, P37, DOI 10.1145/376284.375668
  • [2] Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P1
  • [3] EasyMKL: a scalable multiple kernel learning algorithm
    Aiolli, Fabio
    Donini, Michele
    [J]. NEUROCOMPUTING, 2015, 169 : 215 - 224
  • [4] A Novel Outlier Detection Method for Multivariate Data
    Almardeny, Yahya
    Boujnah, Noureddine
    Cleary, Frances
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4052 - 4062
  • [5] [Anonymous], 2002, P 6 PACIFIC ASIA C A, DOI DOI 10.1007/3-540-47887-6_53
  • [6] Arning A., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P164
  • [7] Atamt A., 2020, P 37 INT C MACHINE L, V119, P421
  • [8] Isolation-based anomaly detection using nearest-neighbor ensembles
    Bandaragoda, Tharindu R.
    Ting, Kai Ming
    Albrecht, David
    Liu, Fei Tony
    Zhu, Ye
    Wells, Jonathan R.
    [J]. COMPUTATIONAL INTELLIGENCE, 2018, 34 (04) : 968 - 998
  • [9] Bao RX, 2020, PR MACH LEARN RES, V119
  • [10] LOF: Identifying density-based local outliers
    Breunig, MM
    Kriegel, HP
    Ng, RT
    Sander, J
    [J]. SIGMOD RECORD, 2000, 29 (02) : 93 - 104