Interpretation of nonlinear relationships between process variables by use of random forests

被引:176
作者
Auret, Lidia [2 ]
Aldrich, Chris [1 ]
机构
[1] Curtin Univ Technol, Western Australian Sch Mines, Perth, WA 6845, Australia
[2] Univ Stellenbosch, Dept Proc Engn, ZA-7602 Stellenbosch, South Africa
关键词
Modelling; Pyrometallurgy; Comminution; MODELS;
D O I
10.1016/j.mineng.2012.05.008
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Better understanding of process phenomena is dependent on the interpretation of models capturing the relationships between the process variables. Although linear regression is used routinely in the mineral process industries for this purpose, it may not be useful where the relationships between variables are nonlinear or complex. Under these circumstances, nonlinear methods, such as neural networks or decision trees can be used to develop reliable models, without necessarily giving any particular or explicit insight into the relationships between the process and the target variables. This is a major drawback in situations where such information would be very important, such as in fault identification or gaining a better understanding of the fundamentals of a process. In this paper, the use of variable importance measures and partial dependency plots generated by random forest models are proposed as a practical tool that can be used to surmount this problem. In particular, it is shown that important variables can be flagged by appropriate threshold generated by inclusion of dummy variables in the system. Moreover, the results of the study indicate that random forest models can reliably identify the influence of individual variables, even in the presence of high levels of additive noise. This would make it a useful tool in continuous process improvement and root cause analysis of abnormal process behaviour. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:27 / 42
页数:16
相关论文
共 30 条
[1]   Monitoring of metallurgical reactors by the use of topographic mapping of process data [J].
Aldrich, C ;
Reuter, MA .
MINERALS ENGINEERING, 1999, 12 (11) :1301-1312
[2]  
[Anonymous], METRIKA
[3]  
[Anonymous], ENV MONITORING ASSES
[4]  
[Anonymous], P 3 INT WORKSH DISTR
[5]  
[Anonymous], 2003, Manual-Setting Up, Using
[6]   Empirical characterization of random forest variable importance measures [J].
Archer, Kelfie J. ;
Kirnes, Ryan V. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (04) :2249-2260
[7]   Forecasting murder within a population of probationers and parolees: a high stakes application of statistical learning [J].
Berk, Richard ;
Sherman, Lawrence ;
Barnes, Geoffrey ;
Kurtz, Ellen ;
Ahlman, Lindsay .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 :191-211
[8]   MODELING AND CHARACTERIZING OF THE THIXOFORMING OF STEEL PROCESS PARAMETERS - THE CASE OF FORMING LOAD [J].
Berrado, A. ;
Rassili, A. .
INTERNATIONAL JOURNAL OF MATERIAL FORMING, 2010, 3 :735-738
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32