Impact of Data Leakage in Vibration Signals Used for Bearing Fault Diagnosis

被引:0
作者
Wheat, Lesley [1 ,2 ]
Mohrenschildt, Martin V. [1 ]
Habibi, Saeid [2 ]
Al-Ani, Dhafar [3 ]
机构
[1] McMaster Univ, Dept Comp & Software, Hamilton, ON L8S 4L7, Canada
[2] McMaster Univ, CMHT, Dept Mech Engn, Hamilton, ON L8S 4L8, Canada
[3] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4L7, Canada
来源
IEEE ACCESS | 2024年 / 12卷
基金
加拿大自然科学与工程研究理事会;
关键词
Vibrations; Fault diagnosis; Training; Soft sensors; Rotating machines; Vibration measurement; Data models; Robustness; Time factors; Principal component analysis; Bearing; fault diagnosis; vibration analysis; domain shift; data leakage; machine learning; train-test split; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1109/ACCESS.2024.3497716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bearing fault diagnosis is a well-developed field and an active area of research in which the combination of model-free machine learning techniques with vibration data has become a popular approach. However, vibration data from rotating machines has the potential to contain domain shifts beyond the accepted causes in this research area (different part models, operating conditions and sensor locations) which can enable data leakage between training and test datasets. To demonstrate the impact of data leakage, six common bearing diagnosis methods are applied to two datasets using three data splitting methods to compare classification performance. Diagnosis is preformed using Principal Component Analysis (PCA), Supervised Principal Component Analysis (SPCA) and Linear Discriminant Analysis (LDA) in combination with frequency analysis and envelope analysis feature extraction methods. Datasets from McMaster University and Paderborn University are used as experimental data sources, and produce vastly differing results (over a 40% drop in accuracy) depending on the selected dataset splitting method, revealing a previously unknown domain shift. Despite great results for diagnosis methods using frequency response analysis on the data from McMaster, these results are not expected to generalize due to possible data leakage. Out of fifty-five previous works using the Paderborn dataset, ten are identified as likely to be affected and only six properly address the problem. Recommendations are given for future experiment design, model creation and model evaluation.
引用
收藏
页码:169879 / 169895
页数:17
相关论文
共 50 条
  • [31] Experimental Vibration Data in Fault Diagnosis: A Machine Learning Approach to Robust Classification of Rotor and Bearing Defects in Rotating Machines
    Almutairi, Khalid M.
    Sinha, Jyoti K.
    MACHINES, 2023, 11 (10)
  • [32] Fault Diagnosis of RV Reducers Used in Industrial Robots Based on Vibration Analysis
    Han, Huanqing
    Xu, Qirong
    Li, Dongqin
    Li, Bing
    Sun, Xiuquan
    Gu, Fengshou
    PROCEEDINGS OF TEPEN 2022, 2023, 129 : 306 - 317
  • [33] Rolling Bearing Fault Diagnosis Based on Improved GAN and 2-D Representation of Acoustic Emission Signals
    Pham, Minh Tuan
    Kim, Jong-Myon
    Kim, Cheol Hong
    IEEE ACCESS, 2022, 10 : 78056 - 78069
  • [34] A Novel Fault Diagnosis Method for Marine Blower with Vibration Signals
    Yan, Guohua
    Hu, Yihuai
    Jiang, Jiawei
    POLISH MARITIME RESEARCH, 2022, 29 (02) : 77 - 86
  • [35] Roller bearing fault diagnosis using Hilbert vibration decomposition
    Zhu, K.-H., 1600, Chinese Vibration Engineering Society (33): : 160 - 164
  • [36] Wind Turbine Bearing Fault Diagnosis Based on Sparse Representation of Condition Monitoring Signals
    Wang, Jun
    Qiao, Wei
    Qu, Liyan
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2019, 55 (02) : 1844 - 1852
  • [37] Wind Turbine Bearing Fault Diagnosis Based on Sparse Representation of Condition Monitoring Signals
    Wang, Jun
    Qiao, Wei
    Qu, Liyan
    2017 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2017, : 3696 - 3702
  • [38] An intelligent fault diagnosis method for rolling bearing using motor stator current signals
    Ye, Xiangbiao
    Li, Guofu
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (08)
  • [39] Vibration-Based Bearing Fault Diagnosis Using Reflection Coefficients of the Autoregressive Model
    Heydarzadeh, Mehrdad
    Nourani, Mehrdad
    Azimi, Vahid
    Kashani-Pour, Amir R.
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 5801 - 5806
  • [40] Fast Frequency Sparsity Learning Approach for Missing Data-Resistant Bearing Fault Diagnosis
    Cao, Zheng
    Dai, Jisheng
    Xu, Weichao
    Xiong, Weizu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74