Research on bearing fault diagnosis method based on cjbm with semi-supervised and imbalanced data

被引:1
作者
Li, Sai [1 ]
Peng, Yanfeng [1 ]
Bin, Guangfu [1 ]
Shen, Yiping [1 ]
Guo, Yong [1 ]
Li, Baoqing [1 ]
Jiang, Yongzheng [1 ]
Fan, Chao [1 ]
机构
[1] Hunan Univ Sci & Technol, Hunan Prov Key Lab Hlth Maintenance Mech Equipment, Xiangtan 411201, Peoples R China
基金
中国国家自然科学基金;
关键词
Bearing; Fault diagnosis; Semi-supervised learning; Imbalanced data; CJBM; LightGBM;
D O I
10.1007/s11071-024-10073-4
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Data-driven intelligent methods have been widely used in bearing fault diagnosis. However, it is observed that previous studies on bearing fault diagnosis always assume that the label samples are sufficient and that the number of normal and fault samples is the same or similar, which is challenging to meet in practical engineering applications. This assumption reduces the accuracy and stability of the semi-supervised imbalanced bearing data fault diagnosis model in practical working conditions. The complex training and weak interpretation problems of transfer learning methods are analyzed, and a center jumping boosting machine method for bearing intelligent fault recognition with semi-supervised and imbalanced data is proposed. First, a modified density peak clustering (DPC) algorithm is used to classify unlabeled samples and select subsamples, and aiming at the DPC problem, a gamma\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\gamma $$\end{document} DPC algorithm based on the gamma\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\gamma $$\end{document} jumping phenomenon is proposed to determine the number of clusters and intercept distance automatically. Second, combined with the synthetic minority oversampling technique, some minority class samples are added to achieve a balanced bearing dataset. Then, a few known faults are used to assign pseudo-labels to unknown samples. Finally, to diagnose the new data and reduce the amount of calculation in actual production, the balanced data after processing are used to train the bottom light gradient boosting machine model to solve intelligent classification and recognition of bearing vibration data. In addition, by using three bearing datasets with different balance ratios and comparing them with other methods, the superiority of the proposed method is verified in bearing condition identification.
引用
收藏
页码:19759 / 19781
页数:23
相关论文
共 32 条
  • [11] Arshad A., Riaz S., Jiao L., Semi-supervised deep fuzzy C-mean clustering for imbalanced multi-class classification, IEEE Access, 7, pp. 28100-28112, (2019)
  • [12] An Y., Zhang K., Chai Y., Zhu Z., Liu Q., Gaussian mixture variational based transformer domain adaptation fault diagnosis method and its application in bearing fault diagnosis, IEEE Trans. Industr. Inf, 20, 1, pp. 615-625, (2023)
  • [13] Xu Z., Shen D., Nie T., Kou Y., Yin N., Han X., A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data, Inf. Sci, 572, pp. 574-589, (2021)
  • [14] Tao X., Chen W., Zhang X., Guo W., Qi L., Fan Z., SVDD boundary and DPC clustering technique-based oversampling approach for handling imbalanced and overlapped data, Knowl.-Based Syst, 234, (2021)
  • [15] Tang M., Meng C., Wu H., Zhu H., Yi J., Tang J., Wang Y., Fault Detection for Wind Turbine Blade Bolts Based on GSG Combined with CS-LightGBM, Sensors, 22, 18, (2022)
  • [16] Rodriguez A., Laio A., Clustering by fast search and find of density peaks, Science, 344, 6191, pp. 1492-1496, (2014)
  • [17] Liu R., Wang H., Yu X., Shared-nearest-neighbor-based clustering by fast search and find of density peaks, Inf. Sci, 450, pp. 200-226, (2018)
  • [18] Liu Y., Yu Z., Chen C., Han Y., Yu B., Prediction of protein crotonylation sites through LightGBM classifier based on SMOTE and elastic net, Anal. Biochem, 609, (2020)
  • [19] Wang S., Liu S., Zhang J., Che X., Yuan Y., Wang Z., Kong D., A new method of diesel fuel brands identification: SMOTE oversampling combined with XGBoost ensemble learning, Fuel, 282, (2020)
  • [20] Chawla N.V., Bowyer K.W., Hall L.O., Kegelmeyer W.P., SMOTE: synthetic minority over-sampling technique, J. Artif. Intell. Res, 16, pp. 321-357, (2002)