A novel outlier detecting algorithm based on the outlier turning points

被引:14
|
作者
Huang, Jinlong [1 ]
Cheng, Dongdong [1 ]
Zhang, Sulan [1 ]
机构
[1] Yangtze Normal Univ, Coll Big Data & Intelligent Engn, Chongqing 408100, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Outlier detection; Local outliers; Outlier clusters; Outlier turning points; NATURAL NEIGHBORHOOD GRAPH; CLUSTER;
D O I
10.1016/j.eswa.2023.120799
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection is one of the hot research in data mining, and has been applied to various fields such as network anomaly detection, image abnormal analysis, etc. In recent years, many outlier detecting algorithms have been proposed. However, these outlier detecting algorithms are hard to effectively detect global outliers, local outliers and outlier clusters at the same time. In this paper, we propose a novel outlier detecting algorithm based on the following ideas: (1) the density distribution should not be changed dramatically on local area; (2) the ratio of the number of k nearest neighbors and the number of reverse k nearest neighbors should not be very big. Based on above ideas, the proposed algorithm aims to find outlier turning points, then regards all outlier turning points and its sparse neighbors as outliers. Furthermore, the proposed algorithm use natural neighbors to obtain the neighborhood parameter k adaptively. The formal analysis and extensive experiments demonstrate that this technique can detect global outliers, local outliers and outlier clusters without neighborhood parameter k.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Hypothesis testing for detecting outlier evaluators
    Xu, Li
    Zucker, David M.
    Wang, Molin
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024, 20 (02) : 419 - 431
  • [22] Outlier detection based on the distribution of distances between data points
    Saltenis, V
    INFORMATICA, 2004, 15 (03) : 399 - 410
  • [23] A Novel Autoencoder Deep Architecture for Detecting the Outlier in Heterogeneous Data Sets
    Uttarkabat, Satarupa
    Sharma, Abhaya
    Patra, Bidyut Kumar
    ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS, 2022, 14 (03N04)
  • [24] Global High Dimension Outlier Algorithm for Efficient Clustering & Outlier Detection
    Nigam, Nidhi
    Saxena, Tripti
    Richhariya, Vineet
    2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
  • [25] A novel outlier cluster detection algorithm without top-n parameter
    Huang, Jinlong
    Zhu, Qingsheng
    Yang, Lijun
    Cheng, DongDong
    Wu, Quanwang
    KNOWLEDGE-BASED SYSTEMS, 2017, 121 : 32 - 40
  • [26] Outlier detection and quasi-periodicity optimization algorithm: Frequency domain based outlier detection (FOD)
    Erkus, Ekin Can
    Purutcuoglu, Vilda
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (02) : 560 - 574
  • [27] Outlier Detection Algorithm Based on Robust Component Analysis
    Zheng Cha
    Ji Lixin
    Gao Chao
    Li Shaomei
    Wang Yanchuan
    THIRD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2018, 10828
  • [28] A Frequency-Based Algorithm for Workflow Outlier Mining
    Chuang, Yu-Cheng
    Hsu, PingYu
    Wang, MinTzu
    Chen, Sin-Cheng
    FUTURE GENERATION INFORMATION TECHNOLOGY, 2010, 6485 : 191 - 207
  • [29] An Outlier Detection Algorithm Based on Arbitrary Shape Clustering
    Su, Xiaoke
    Lan, Yang
    Wan, Renxia
    Qin, Yuming
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 627 - +
  • [30] Relative Density-Based Outlier Detection Algorithm
    Ning, Jin
    Chen, Leiting
    Chen, Junwei
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 227 - 231