Novel deep-learning model for chemical process fault detection based on DCW transformer

被引:0
作者
Xie, Ying [1 ,2 ]
Hu, Fanchao [1 ,2 ]
Zhu, Yuan [1 ,2 ]
机构
[1] Shenyang Univ Chem Technol, Coll Informat Engn, Shenyang, Peoples R China
[2] Shenyang Univ Chem Technol, Liaoning Key Lab Ind Environm Resource Collaborat, Shenyang, Peoples R China
关键词
attention mechanism; DCW transformer; feature extraction; local and global information; NEURAL-NETWORKS;
D O I
10.1002/cjce.25507
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
In this study, a double-channel convolutional neural network and weighted (DCW) transformer model is proposed to address the problem of insufficient extraction of local information and no attention to channel-step information in the traditional transformer model. First, a double-channel information extraction method is proposed, so both the channel-step and time-step information achieve attention; second, the local information in the time and channel dimension of the data is extracted from deep and multiple scales, improving the feature extraction capability for local information; third, the long distance dependency relationship of the data is preserved by the attention mechanism, hence, the global correlation of the data is extracted effectively; finally, using the Gumbel-SoftMax function, the weights of the time-step and channel-step feature information are assigned, so the extracted feature information has been optimized. The proposed method was applied for the penicillin fermentation process to verify its efficacy. Experimental results show that the proposed method achieved a better fault detection accuracy, outperforming the existing models. Further ablation experiments were conducted to demonstrate the effectiveness of each component of the proposed model.
引用
收藏
页数:18
相关论文
共 46 条
  • [1] Bi J., presented at 2021 IEEE Int. Conf. Comput. Sci. Elect. Inform. Eng. Intell. Control Technol. (CEI), Fuzhou, China, 2426 September 2021
  • [2] Traffic transformer: Capturing the continuity and periodicity of time series for traffic forecasting
    Cai, Ling
    Janowicz, Krzysztof
    Mai, Gengchen
    Yan, Bo
    Zhu, Rui
    [J]. TRANSACTIONS IN GIS, 2020, 24 (03) : 736 - 755
  • [3] Chen W., 2022, P 28 ACM SIGKDD C KN
  • [4] Chowdhury R. R., 2022, in Proc. 28th ACM SIGKDD Conf. Knowl. Discov. Data Mining
  • [5] Dong L., presented at 2018 IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), Calgary, Canada, 1520 April 2018
  • [6] Guo Q., 2019, arXiv, DOI DOI 10.48550/ARXIV.1902.09113
  • [7] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [8] Herrmann C., 2020, in Eur. Conf. Comput. Vis., Springer, Cham
  • [9] Hu J., 2018, Proc. IEEE Conf. Comput. Vis. Pattern Recogn. (CVPR), Washington
  • [10] An evolutionary deep learning soft sensor model based on random forest feature selection technique for penicillin fermentation process
    Hua, Lei
    Zhang, Chu
    Sun, Wei
    Li, Yiman
    Xiong, Jinlin
    Nazir, Muhammad Shahzad
    [J]. ISA TRANSACTIONS, 2023, 136 : 139 - 151