Big Data as a Tool for Building a Predictive Model of Mill Roll Wear

被引:34
|
作者
Vasilyeva, Natalia [1 ]
Fedorova, Elmira [1 ]
Kolesnikov, Alexandr [2 ]
机构
[1] St Petersburg Min Univ, Dept Econ Org & Management, St Petersburg 199106, Russia
[2] M Auezov South Kazakhstan Univ, NonProfit Joint Stock Co, Shymkent 160012, Kazakhstan
来源
SYMMETRY-BASEL | 2021年 / 13卷 / 05期
关键词
big data; rolling mill; rolled steel; rolling mill roll wear; mathematical model; correlation coefficient; TECHNOLOGY;
D O I
10.3390/sym13050859
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Big data analysis is becoming a daily task for companies all over the world as well as for Russian companies. With advances in technology and reduced storage costs, companies today can collect and store large amounts of heterogeneous data. The important step of extracting knowledge and value from such data is a challenge that will ultimately be faced by all companies seeking to maintain their competitiveness and place in the market. An approach to the study of metallurgical processes using the analysis of a large array of operational control data is considered. Using the example of steel rolling production, the development of a predictive model based on processing a large array of operational control data is considered. The aim of the work is to develop a predictive model of rolling mill roll wear based on a large array of operational control data containing information about the time of filling and unloading of rolls, rolled assortment, roll material, and time during which the roll is in operation. Preliminary preparation of data for modeling was carried out, which includes the removal of outliers, uncharacteristic and random measurement results (misses), as well as data gaps. Correlation analysis of the data showed that the dimensions and grades of rolled steel sheets, as well as the material from which the rolls are made, have the greatest influence on the wear of rolling mill rolls. Based on the processing of a large array of operational control data, various predictive models of the technological process were designed. The adequacy of the models was assessed by the value of the mean square error (MSE), the coefficient of determination (R-2), and the value of the Pearson correlation coefficient (R) between the calculated and experimental values of the mill roll wear. In addition, the adequacy of the models was assessed by the symmetry of the values predicted by the model relative to the straight line Ypredicted = Yactual. Linear models constructed using the least squares method and cross-validation turned out to be inadequate (the coefficient of determination R-2 does not exceed 0.3) to the research object. The following regressions were built on the basis of the same operational control database: Linear Regression multivariate, Lasso multivariate, Ridge multivariate, and ElasticNet multivariate. However, these models also turned out to be inadequate to the object of the research. Testing these models for symmetry showed that, in all cases, there is an underestimation of the predicted values. Models using algorithm composition have also been built. The methods of random forest and gradient boosting are considered. Both methods were found to be adequate for the object of the research (for the random forest model, the coefficient of determination is R-2 = 0.798; for the gradient boosting model, the coefficient of determination is R-2 = 0.847). However, the gradient boosting algorithm is recognized as preferable thanks to its high accuracy compared with the random forest algorithm. Control data for symmetry in reference to the straight line Ypredicted = Yactual showed that, in the case of developing the random forest model, there is a tendency to underestimate the predicted values (the calculated values are located below the straight line). In the case of developing a gradient boosting model, the predicted values are located symmetrically regarding the straight line Ypredicted = Yactual. Therefore, the gradient boosting model is preferred. The predictive model of mill roll wear will allow rational use of rolls in terms of minimizing overall roll wear. Thus, the proposed model will make it possible to redistribute the existing work rolls between the stands in order to reduce the total wear of the rolls.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Big Data Predictive Analysis: Using R Analytical Tool
    Shinde, Priyanka P.
    Oza, Kavita S.
    Kamat, R. K.
    2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 839 - 842
  • [2] Building an Ontology Model of Big Spectrum Data
    Yi, Xu
    Guo, Daoxing
    You, Wei
    Lei, Wangchun
    2016 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND INFORMATION TECHNOLOGY ENGINEERING (ICMITE 2016), 2016, : 434 - 438
  • [3] A Framework for Big Data Driven On-Line Monitoring of Tool Wear
    Gui, Yong
    Leng, Sheng
    Dai, Zhiqiang
    Wu, Jiyuan
    2021 THE 8TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS-EUROPE, ICIEA 2021-EUROPE, 2021, : 1 - 5
  • [4] Big Data Tool Integration in Physical Design Process Find hidden patterns, predictive analysis and classifying Big Data
    Ahmed, Waseem
    Fan, Lisa
    PROCEEDINGS OF 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2015, : 339 - 345
  • [5] Predicting Big Data Adoption in Companies With an Explanatory and Predictive Model
    Villarejo-Ramos, Angel F.
    Cabrera-Sanchez, Juan-Pedro
    Lara-Rubio, Juan
    Liebana-Cabanillas, Francisco
    FRONTIERS IN PSYCHOLOGY, 2021, 12
  • [6] Building an Implementation Model of IoT and Big Data and Its Improvement
    Jonny
    Kriswanto
    Toshio, Matsumura
    INTERNATIONAL JOURNAL OF TECHNOLOGY, 2021, 12 (05) : 1000 - 1008
  • [7] Big Building Data - a Big Data Platform for Smart Buildings
    Linder, Lucy
    Vionnet, Damien
    Bacher, Jean-Philippe
    Hennebert, Jean
    CISBAT 2017 INTERNATIONAL CONFERENCE FUTURE BUILDINGS & DISTRICTS - ENERGY EFFICIENCY FROM NANO TO URBAN SCALE, 2017, 122 : 589 - 594
  • [8] Big data and predictive analytics: A sytematic review of applications
    Jamarani, Amirhossein
    Haddadi, Saeid
    Sarvizadeh, Raheleh
    Kashani, Mostafa Haghi
    Akbari, Mohammad
    Moradi, Saeed
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (07)
  • [9] Predictive Big Data Analytics Using Multiple Linear Regression Model
    Khine, Kyi Lai Lai
    Nyunt, Thi Thi Soe
    BIG DATA ANALYSIS AND DEEP LEARNING APPLICATIONS, 2019, 744 : 9 - 19
  • [10] Real Time Big Data Analytics for Tool Wear Protection with Deep Learning in Manufacturing Industry
    Cakir, Altan
    Ozkaya, Emre
    Akkus, Fatih
    Kucukbas, Ezgi
    Yilmaz, Okan
    INTELLIGENT AND FUZZY SYSTEMS: DIGITAL ACCELERATION AND THE NEW NORMAL, INFUS 2022, VOL 2, 2022, 505 : 148 - 155