Prediction of tandem cold-rolled strip flatness based on Attention-LSTM model

被引:24
作者
Chen, Yafei [1 ]
Peng, Lianggui [1 ]
Wang, Yu [1 ]
Zhou, Yilin [2 ]
Li, Changsheng [1 ]
机构
[1] Northeastern Univ, State Key Lab Rolling & Automat, Shenyang 110819, Peoples R China
[2] Cold Rolling Mill Panzhihua Steel & Vanadium Co Lt, Pangang Grp, Panzhihua 617023, Peoples R China
关键词
Tandem cold-rolled strip; Flatness; Long-short-term memory model; Attention mechanism; Interpretation; RECURRENT NEURAL-NETWORKS; TENSION;
D O I
10.1016/j.jmapro.2023.02.048
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Flatness is a key quality indicator of tandem cold-rolled strip. Tandem cold-rolled production is a multi-stand simultaneous rolling process. The prediction of flatness is a typical spatial sequence data prediction problem by considering not only the complex generation mechanism but also the spatial dimension dependence. How-ever, the currently available mechanistic and machine learning models for flatness prediction only focus on its highly nonlinear characteristics and ignore its complex spatial correlation. In addition, the over-parameterized "black box" nature of machine learning models has often been questioned. Based on this, a long-short-term memory model with an attention mechanism (Attention-LSTM) is proposed in this paper. The model is struc-tured as a two-layer LSTM network to fully learn the highly nonlinear and complex spatial data correlation of tandem cold-rolled strip flatness. Furthermore, attention mechanisms are also added to the spatial dimension and flatness feature vector dimension, respectively, to enhance the interpretability of the model. The superiority of the proposed model is verified by comparing the error back propagation neural network model (BPNN), the extreme gradient boosting algorithm model (XGBoost), the deep neural network model (DNN), the LSTM model without the attention mechanism, and the Attention-LSTM model with the same dataset and the same optimum training method. The interpretation of the attention mechanism weights is consistent with the mechanistic model, which enhances the credibility of the model. The results of the ablative experiments also validate the effectiveness of incorporating the attention mechanism in the model.
引用
收藏
页码:110 / 121
页数:12
相关论文
共 26 条
  • [1] Post-buckling modeling for strips under tension and residual stresses using asymptotic numerical method
    Abdelkhalek, Sami
    Zahrouni, Hamid
    Legrand, Nicolas
    Potier-Ferry, Michel
    [J]. INTERNATIONAL JOURNAL OF MECHANICAL SCIENCES, 2015, 104 : 126 - 137
  • [2] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
  • [3] Cho K., 2014, WORKSHOP SYNTAX SEMA
  • [4] RECURRENT NEURAL NETWORKS AND ROBUST TIME-SERIES PREDICTION
    CONNOR, JT
    MARTIN, RD
    ATLAS, LE
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 240 - 254
  • [5] Application of neural networks for predicting hot-rolled strip crown
    Deng, Jifei
    Sun, Jie
    Peng, Wen
    Hu, Yaohui
    Zhang, Dianhua
    [J]. APPLIED SOFT COMPUTING, 2019, 78 : 119 - 131
  • [6] Interpretable spatio-temporal attention LSTM model for flood forecasting
    Ding, Yukai
    Zhu, Yuelong
    Feng, Jun
    Zhang, Pengcheng
    Cheng, Zirun
    [J]. NEUROCOMPUTING, 2020, 403 : 348 - 359
  • [7] Experimental and numerical modeling of flatness defects in strip cold rolling
    Dinh Cuong Tran
    Tardif, Nicolas
    Limam, Ali
    [J]. INTERNATIONAL JOURNAL OF SOLIDS AND STRUCTURES, 2015, 69-70 : 343 - 349
  • [8] Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
  • [9] Edwards WJ, 1973, AUTOMATION TANDEM MI, P177
  • [10] Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947