Handling Imbalanced Datasets for Robust Deep Neural Network-Based Fault Detection in Manufacturing Systems

被引:7
作者
Kafunah, Jefkine [1 ]
Ali, Muhammad Intizar [2 ]
Breslin, John G. [1 ]
机构
[1] Natl Univ Ireland, Data Sci Inst, Galway H91 TK33, Ireland
[2] Dublin City Univ, Sch Elect Engn, Dublin 9, Ireland
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 21期
基金
爱尔兰科学基金会;
关键词
fault detection; imbalanced datasets; deep neural networks; QUANTITATIVE MODEL; DIAGNOSIS; CLASSIFICATION;
D O I
10.3390/app11219783
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Over the recent years, Industry 4.0 (I4.0) technologies such as the Industrial Internet of Things (IIoT), Artificial Intelligence (AI), and the presence of Industrial Big Data (IBD) have helped achieve intelligent Fault Detection (FD) in manufacturing. Notably, data-driven approaches in FD apply Deep Learning (DL) techniques to help generate insights required for monitoring complex manufacturing processes. However, due to the ratio of instances where actual faults occur, FD datasets tend to be imbalanced, leading to training challenges that result in inefficient DL-based FD models. In this paper, we propose Dual Logits Weights Perturbation (DLWP) loss, a method featuring weight vectors for improved dataset generalization in FD systems. The weight vectors act as hyperparameters adjusted on a case-by-case basis to regulate focus accorded to individual minority classes during training. In particular, our proposed method is suitable for imbalanced datasets from safety-related FD tasks as it generates DL models that minimize false negatives. Subsequently, we integrate human experts into the workflow as a strategy to help safeguard the system. A subset of the results, model predictions with uncertainties exceeding a preset threshold, are considered a preliminary output subject to cross-checking by human experts. We demonstrate that DLWP achieves improved Recall, AUC, F1 scores.
引用
收藏
页数:24
相关论文
共 90 条
[1]  
Achille A., 2019, P INT C LEARN REPR N
[2]  
Adam A., 2011, Proceedings of the 2011 11th International Conference on Hybrid Intelligent Systems (HIS 2011), P133, DOI 10.1109/HIS.2011.6122093
[3]  
Anantrasirichai N., 2019, P 2019 IEEE INT C IM
[4]   Deep Over-sampling Framework for Classifying Imbalanced Data [J].
Ando, Shin ;
Huang, Chun Yuan .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I, 2017, 10534 :770-785
[5]  
[Anonymous], 2008, P 7 AUSTR DAT MIN C
[6]   An improved plant-wide fault detection scheme based on PCA and adaptive threshold for reliable process monitoring: Application on the new revised model of Tennessee Eastman process [J].
Bakdi, Azzeddine ;
Kouadri, Abdelmalek .
JOURNAL OF CHEMOMETRICS, 2018, 32 (05)
[7]  
Berger J., 2013, Springer Series in Statistics
[8]   A systematic study of the class imbalance problem in convolutional neural networks [J].
Buda, Mateusz ;
Maki, Atsuto ;
Mazurowski, Maciej A. .
NEURAL NETWORKS, 2018, 106 :249-259
[9]   MetaNet: The theory of independent judges [J].
Buscema, M .
SUBSTANCE USE & MISUSE, 1998, 33 (02) :439-461
[10]  
Buscema M., 2010, 2010 ANN M N AM FUZZ, P1, DOI DOI 10.1109/NAFIPS.2010.5548298