An Optimized Pruning-based Outlier Detecting algorithm

被引:0
作者
Wang, Jinghua [1 ]
Zhao, Xinxiang [1 ]
Jin, Peng [1 ]
Zhang, Guoyan [1 ]
机构
[1] Cent China Normal Univ, Acad Comp Sci, Wuhan, Hubei, Peoples R China
来源
INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4 | 2013年 / 411-414卷
关键词
Data mining; Outlier detection; Pruning; Clustering;
D O I
10.4028/www.scientific.net/AMM.411-414.1076
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
An Optimized Pruning-based Outlier Detecting algorithm is proposed based on the density-based outlier detecting algorithm (LOF algorithm). The calculation accuracy and the time complexity of LOP algorithm are not ideal, so two steps are taken to reduce the amount of calculation and improve the calculation accuracy for LOF algorithm. Firstly, using cluster pruning technique to preprocess data set, at the same time filtering the non-outliers based on the differences of cluster models to avoid the error pruning of outliers located at the edge of clusters, different cluster models are output by inputing multiple parameters in the DBSCAN algorithm. Secondly,optimize the query process of the neighborhood (epsilon - neighbor and k- neighbor). After pruning, local outlier factors are calculated only for the data objects out of clusters. Experimental results show that the algorithm proposed in this paper can improve the outlier detection accuracy, reduce the time complexity and realize the effective local outlier detection.
引用
收藏
页码:1076 / 1080
页数:5
相关论文
共 50 条
  • [21] An Outlier Detection Algorithm Based on Arbitrary Shape Clustering
    Su, Xiaoke
    Lan, Yang
    Wan, Renxia
    Qin, Yuming
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 627 - +
  • [22] Outlier detection algorithm based on fluctuation of centroid projection
    Zhang Z.
    Zhang Y.
    Liu W.
    Deng Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (12): : 3869 - 3878
  • [23] KNN Based Outlier Detection Algorithm in Large Dataset
    Yang, Peng
    Huang, Biao
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 611 - 613
  • [24] A Data Stream Outlier Detection Algorithm Based on Grid
    Yu Xiang
    Lei Guohua
    Xu Xiandong
    Lin Liandong
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4136 - 4141
  • [25] ERDOF: outlier detection algorithm based on entropy weight distance and relative density outlier factor
    Zhang Z.
    Liu W.
    Zhang Y.
    Deng Y.
    Wei M.
    Tongxin Xuebao/Journal on Communications, 2021, 42 (09): : 133 - 143
  • [26] An Outlier Mining Algorithm Based on Dissimilarity
    Zhou, Ming-jian
    Chen, Xue-jiao
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL II, 2011, : 289 - 291
  • [27] An Improved KNN Based Outlier Detection Algorithm for Large Datasets
    Wang, Qian
    Zheng, Min
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 585 - 592
  • [28] An Outlier Mining Algorithm Based on Dissimilarity
    Zhou, Ming-jian
    Chen, Xue-jiao
    2011 INTERNATIONAL CONFERENCE OF ENVIRONMENTAL SCIENCE AND ENGINEERING, VOL 12, PT B, 2012, 12 : 810 - 814
  • [29] An Outlier Detection Algorithm in Wireless Sensor Network Based on Clustering
    Niu, Kun
    Zhao, Fang
    Qiao, Xiuquan
    2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 433 - 437
  • [30] Cell-based outlier detection algorithm: A fast outlier detection algorithm for large datasets
    Wan, You
    Bian, Fuling
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 1042 - 1048