Support vector regression modeling of coal flotation based on variable importance measurements by mutual information method

被引:37
作者
Chelgani, S. Chehreh [1 ]
Shahbazi, B. [2 ]
Hadavandi, E. [3 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] Tarbiat Modares Univ, Dept Min Engn, Tehran, Iran
[3] Birjand Univ Technol, Dept Ind Engn, Birjand, Iran
关键词
Variable importance measurement; Flotation rate constant; Recovery; Mutual information; Support vector regression; GROSS CALORIFIC VALUE; FEATURE-SELECTION; FINE COAL; EXPLAINING RELATIONSHIPS; PARAMETERS; INDEX;
D O I
10.1016/j.measurement.2017.09.025
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Support vector regression (SVR) modeling was used to predict the coal flotation responses (recovery (R*) and flotation rate constant (k)) as a function of measured particle properties and hydrodynamic flotation variables. Coal flotation is a complicated multifaceted separation process and many measurable and unmeasurable variables can be considered for its modeling. Therefore, feature selection can be used to save time and cost of measuring irrelevant parameters. Mutual information (MI) as a powerful variable selection tool was used through laboratory measured variables to assess interactions and choose the most effective ones for predictions of R* and k. Feature selection by MI through variables indicated that the best arrangements for the R* and k predictions are the sets of particle Reynolds number-energy dissipation and particle size-bubble Reynolds number, respectively. Correlation of determination (R-2) and difference between laboratory measured and SVR predicted values based on MI selected variables indicated that the SVR can model R* and k quite accurately with R-2 = 0.93 and R-2 = 0.72, respectively. These results demonstrated that the MI-SVR combination can quite satisfactorily measure the importance of variables, increase interpretability, reduce the risk of overfitting, decrease complexity and generate predictive models for high dimension of variables based on selected features for complicated processing systems.
引用
收藏
页码:102 / 108
页数:7
相关论文
共 46 条
[1]   A study on the effect of particle size on coal flotation kinetics using fuzzy logic [J].
Abkhoshk, Emad ;
Kor, Mohammad ;
Rezai, Bahram .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (07) :5201-5207
[2]   Flotation of a refuse tailing fine coal slurry [J].
Barraza, Juan ;
Guerrero, Juan ;
Pineres, Jorge .
FUEL PROCESSING TECHNOLOGY, 2013, 106 :498-500
[3]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4]   Modeling of free swelling index based on variable importance measurements of parent coal properties by random forest method [J].
Chelgani, S. Chehreh ;
Matin, S. S. ;
Makaremi, S. .
MEASUREMENT, 2016, 94 :416-422
[5]   Explaining relationships between coke quality index and coal properties by Random Forest method [J].
Chelgani, S. Chehreh ;
Matin, S. S. ;
Hower, James C. .
FUEL, 2016, 182 :754-760
[6]   Microwave irradiation pretreatment and peroxyacetic acid desulfurization of coal and application of GRNN simultaneous predictor [J].
Chelgani, S. Chehreh ;
Jorjani, E. .
FUEL, 2011, 90 (11) :3156-3163
[7]   Study Relationship between Inorganic and Organic Coal Analysis with Gross Calorific Value by Multiple Regression and ANFIS [J].
Chelgani, S. Chehreh ;
Hart, Brian ;
Grady, William C. ;
Hower, James C. .
INTERNATIONAL JOURNAL OF COAL PREPARATION AND UTILIZATION, 2011, 31 (01) :9-19
[8]  
Do H., 2010, THESIS VIRGINIA TECH, P1
[9]   Variable selection using random forests [J].
Genuer, Robin ;
Poggi, Jean-Michel ;
Tuleau-Malot, Christine .
PATTERN RECOGNITION LETTERS, 2010, 31 (14) :2225-2236
[10]   Information-theoretic feature selection for functional data classification [J].
Gomez-Verdejo, Vanessa ;
Verleysen, Michel ;
Fleury, Jerome .
NEUROCOMPUTING, 2009, 72 (16-18) :3580-3589