Correlation-based detection of attribute outliers

被引:0
|
作者
Koh, Judice L. Y. [1 ,2 ]
Lee, Mong Li [2 ]
Hsu, Wynne [2 ]
Lam, Kai Tak [3 ]
机构
[1] Inst Infocomm Res, Singapore 119613, Singapore
[2] Natl Univ Singapore, Sch Comput, Singapore 117548, Singapore
[3] Inst High Performance Comput, Singapore 117528, Singapore
来源
ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS | 2007年 / 4443卷
关键词
outlier detection; data cleaning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An outlier is an object that does not conform to the normal behavior of the data set. In data cleaning, outliers are identified for data noise reduction. In applications such as fraud detection, and stock market analysis, outliers suggest abnormal behavior requiring further investigation. Existing outlier detection methods have focused on class outliers and research on attribute outliers is limited, despite the equal role attribute outliers play in depreciating data quality and reducing data mining accuracy. In this paper, we propose a novel method to detect attribute outliers from the deviating correlation behavior of attributes. We formulate three metrics to evaluate outlier-ness of attributes, and introduce an adaptive factor to distinguish outliers from non-outliers. Experiments with both synthetic and real-world data sets indicate that the proposed method is effective in detecting attribute outliers.
引用
收藏
页码:164 / +
页数:2
相关论文
共 50 条
  • [41] Grid Anchor Lane Detection Based on Attribute Correlation
    Feng, Qiaohui
    Chi, Cheng
    Chen, Fei
    Shen, Jianhao
    Xu, Gang
    Wen, Huajie
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [42] Correlation-based feature partition regression method for unsupervised anomaly detection
    Liu, Zhiyu
    Gao, Xin
    Jia, Xin
    Xue, Bing
    Fu, Shiyuan
    Li, Kangsheng
    Huang, Xu
    Huang, Zijian
    APPLIED INTELLIGENCE, 2022, 52 (13) : 15074 - 15090
  • [43] Damage detection and correlation-based localization using wireless mote sensors
    Clayton, EH
    Koh, BH
    Xing, GL
    Fok, CL
    Dyke, SJ
    Lu, CY
    2005 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL & 13TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1 AND 2, 2005, : 304 - 309
  • [44] CORRELATION-BASED LAW OF EFFECT
    BAUM, WM
    JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 1973, 20 (01) : 137 - 153
  • [45] Correlation-Based Transformer Tracking
    Zhong, Minghan
    Chen, Fanglin
    Xu, Jun
    Lu, Guangming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 85 - 96
  • [46] Malware Detection Using Deep Learning and Correlation-Based Feature Selection
    Alomari, Esraa Saleh
    Nuiaa, Riyadh Rahef
    Alyasseri, Zaid Abdi Alkareem
    Mohammed, Husam Jasim
    Sani, Nor Samsiah
    Esa, Mohd Isrul
    Musawi, Bashaer Abbuod
    SYMMETRY-BASEL, 2023, 15 (01):
  • [47] Non-interactive watermark detection for a correlation-based watermarking scheme
    Adelsbach, A
    Rohe, M
    Sadeghi, AR
    COMMUNICATIONS AND MULTIMEDIA SECURITY, 2005, 3677 : 129 - 139
  • [48] Linear Correlation-Based Feature Selection for Network Intrusion Detection Model
    Eid, Heba F.
    Hassanien, Aboul Ella
    Kim, Tai-hoon
    Banerjee, Soumya
    ADVANCES IN SECURITY OF INFORMATION AND COMMUNICATION NETWORKS, 2013, 381 : 240 - +
  • [49] On Optimal Correlation-Based Prediction
    Bottai, Matteo
    Kim, Taeho
    Lieberman, Benjamin
    Luta, George
    Pena, Edsel
    AMERICAN STATISTICIAN, 2022, 76 (04): : 313 - 321
  • [50] A Multi-level Correlation-Based Feature Selection for Intrusion Detection
    Mahendra Prasad
    Rahul Kumar Gupta
    Sachin Tripathi
    Arabian Journal for Science and Engineering, 2022, 47 : 10719 - 10729