A label noise filtering method for regression based on adaptive threshold and noise score

被引:9
作者
Li, Chuang [1 ]
Mao, Zhizhong [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
关键词
Noise filter; Real-valued label noise; Adaptive noise determination; Noise score; Ensemble filtering; Iterative filtering; CLASSIFICATION; PERFORMANCE; SELECTION; PREDICTION; RANKING; FUSION; TESTS; SET;
D O I
10.1016/j.eswa.2023.120422
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quality of training data plays a decisive role in the establishment of intelligent models. Since raw data obtained from the real world are usually entwined with noise due to variety of causes, noise filtering has become an important aspect of machine learning techniques. In contrast with the extensive research conducted on noise elimination for classification purposes, papers addressing this problem for regression tasks are rather scarce. In this paper, we propose a novel noise filter to clean noisy instances with real-valued label noise. Aiming at the deficiency of the existing noise determination criterion, a new adaptive threshold-based method is first proposed. It allows a noisy instance to be adaptively defined according to the fitting difficulty levels of different datasets, and areas with different densities. Embedded with this criterion, an effective noise filtering procedure is also designed. An ensemble filtering scheme and an iterative filtering process are combined to detect as many po-tential noisy samples as possible from the original training set. According to the acquire noise detection infor-mation, a noise score for evaluating the noise level is specifically developed. The potential noisy samples whose scores exceed a reasonable threshold are further filtered, which can compensate for the possible errors incurred during the previous procedure, and contribute to more reliable filtering results. The validity of the proposed method is studied in exhaustive experiments. We discuss reasonable hyperparameters, and compare the devel-oped method with several state-of-the-art noise filters. The outcomes show that the prediction accuracy of the utilized regressor can greatly benefit from preprocessing the given raw dataset by using our method. Simulta-neously, the method is able to acquire a good balance between the elimination of noisy samples and the retention of clean samples, and consistently achieves a better noise filtering performance.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Noise adaptive filtering model integrating spatio-temporal feature for soft sensor
    Hu, Xuan
    Zhang, Tianyu
    Geng, Zhiqiang
    Han, Yongming
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 239
  • [42] Lorentzian Based Adaptive Filters for Impulsive Noise Environments
    Das, Rajib Lochan
    Narwaria, Manish
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2017, 64 (06) : 1529 - 1539
  • [43] An adaptive seismic random noise attenuation method based on Engl criterion using curvelet transform
    Yin, Hanjun
    Cao, Jingjie
    Yang, Helong
    Chen, Xue
    JOURNAL OF APPLIED GEOPHYSICS, 2024, 227
  • [44] Error-Based Noise Filtering During Neural Network Training
    Alharbi, Fahad
    Hindi, Khalil El
    Al-Ahmadi, Saad
    IEEE ACCESS, 2020, 8 : 156996 - 157004
  • [45] Experimental Study on the Curve Evolution Method for Noise Filtering in the Inertial State Estimation
    Huang, L.
    Lotti, B.
    2017 3RD INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2017, : 95 - 98
  • [46] A mathematical programming approach to SVM-based classification with label noise
    Blanco, Victor
    Japon, Alberto
    Puerto, Justo
    COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 172
  • [47] A two-stage ensemble method for the detection of class-label noise
    Sabzevari, Maryam
    Martinez-Munoz, Gonzalo
    Suarez, Alberto
    NEUROCOMPUTING, 2018, 275 : 2374 - 2383
  • [48] Bearing fault diagnosis method based on improved meta-ResNet and sample weighting under noise label
    Xie, Suchao
    Wang, Jiacheng
    Li, Yaxin
    Yang, Lingzhi
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2024,
  • [49] KalmanTune: A Kalman Filter Based Tuning Method to Make Boosted Ensembles Robust to Class-Label Noise
    Pakrashi, Arjun
    Namee, Brian Mac
    IEEE ACCESS, 2020, 8 : 145887 - 145897
  • [50] Adaptive Tap-Length Based Sub-band Mean M-Estimate Filtering for Active Noise Cancellation
    Kar, Asutosh
    Shoba, S.
    Burra, Srikanth
    Goel, Pankaj
    Kumar, Sanjeev
    Vasundhara, Vladimir
    Mladenovic, Vladimir
    Sooraksa, Pitikhate
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (09) : 5912 - 5932