Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms

被引：7

作者：

Bezerra, Francisco Elanio ^{[1
]}

de Oliveira Neto, Geraldo Cardoso ^{[2
]}

Cervi, Gabriel Magalhaes ^{[3
]}

Mazetto, Rafaella Francesconi ^{[3
]}

de Faria, Aline Mariane ^{[3
]}

Vido, Marcos ^{[4
]}

Lima, Gustavo Araujo ^{[5
]}

de Araujo, Sidnei Alves ^{[5
]}

Sampaio, Mauro ^{[6
]}

Amorim, Marlene ^{[7
]}

机构：

[1] Univ Sao Paulo, Polytech Sch, Dept Energy Engn & Elect Automat, 158 Prof Luciano Gualberto Ave, BR-05508010 Sao Paulo, Brazil

[2] Fed Univ ABC, Alameda Univ, Ind Engn Post Grad Program, S-n Bairro Anchieta, BR-09606045 Sao Bernardo Do Campo, SP, Brazil

[3] FEI Univ, Business Adm Postgrad Program, Tamandare St 688,5 Floor, BR-01525000 Sao Paulo, Brazil

[4] Nove de Julho Univ UNINOVE, Ind Engn Postgrad Program, Vergueiro St 235-249, BR-01504001 Sao Paulo, Brazil

[5] Nove de Julho Univ UNINOVE, Informat & Knowledge Management Postgrad Program, Vergueiro St 235-249, BR-01504001 Sao Paulo, Brazil

[6] FEI Univ, Ind Engn Postgrad Program, Ave Humberto Alencar Castelo Branco 3972-B, BR-09850901 Sao Bernardo Do Campo, Brazil

[7] Univ Aveiro, GOVCOPP DEGEIT, P-3810193 Aveiro, Portugal

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 08期

关键词：

machine learning; machine failure; feature selection; predictive maintenance; sensor selection; CLASSIFICATION; NETWORKS;

D O I：

10.3390/app14083337

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In the context of Industry 4.0, managing large amounts of data is essential to ensure informed decision-making in intelligent production environments. It enables, for example, predictive maintenance, which is essential for anticipating and identifying causes of failures in machines and equipment, optimizing processes, and promoting proactive management of human, financial, and material resources. However, generating accurate information for decision-making requires adopting suitable data preprocessing and analysis techniques. This study explores the identification of machine failures based on synthetic industrial data. Initially, we applied the feature selection techniques Principal Component Analysis (PCA), Minimum Redundancy Maximum Relevance (mRMR), Neighborhood Component Analysis (NCA), and Denoising Autoencoder (DAE) to the collected data and compared their results. In the sequence, a comparison among three widely known machine learning classifiers, namely Random Forest (RF), Support Vector Machine (SVM), and Multilayer Perceptron neural network (MLP), was conducted, with and without considering feature selection. The results showed that PCA and RF were superior to the other techniques, allowing the classification of failures with rates of 0.98, 0.97, and 0.98 for the accuracy, precision, and recall metrics, respectively. Thus, this work contributes by solving an industrial problem and detailing techniques to identify the most relevant variables and machine learning algorithms for predicting machine failures that negatively impact production planning. The findings provided by this study can assist industries in giving preference to employing sensors and collecting data that can contribute more effectively to machine failure predictions.

引用

页数：14

共 34 条

[1] A PCA-based variable ranking and selection approach for electric energy load forecasting [J].

Bezerra, Francisco Elanio ;

Grassi, Flavio ;

Dias, Clcber Gustavo ;

Pereira, Fabio Henrique .

INTERNATIONAL JOURNAL OF ENERGY SECTOR MANAGEMENT, 2022, 16 (06) :1172-1191

[2] Random forests [J].

Breiman, L .

MACHINE LEARNING, 2001, 45 (01) :5-32

[3] A Hybrid Intelligent Approach for Classification of Incipient Faults in Transmission Network [J].

Chang, Gary W. ;

Hong, Yong-Han ;

Li, Guan-Yi .

IEEE TRANSACTIONS ON POWER DELIVERY, 2019, 34 (04) :1785-1794

[4] Prediction model of the failure mode of beam-column joints using machine learning methods [J].

Gao, Xiangling ;

Lin, Chen .

ENGINEERING FAILURE ANALYSIS, 2021, 120

[5] Dissecting a data-driven prognostic pipeline: A powertrain use case [J].

Giordano, Danilo ;

Pastor, Eliana ;

Giobergia, Flavio ;

Cerquitelli, Tania ;

Baralis, Elena ;

Mellia, Marco ;

Neri, Alessandra ;

Tricarico, Davide .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 180

[6]

Javaid M., 2021, Sens. Int., V2, DOI DOI 10.1016/J.SINTL.2021.100110

[7] Feature Selection Engineering for Credit Risk Assessment in Retail Banking [J].

Jemai, Jaber ;

Zarrad, Anis .

INFORMATION, 2023, 14 (03)

[8]

Kariuki H., 2022, INT J DATA SCI ANAL, V8, P72, DOI DOI 10.11648/J.IJDSA.20220803.11

[9]

Khalid S, 2014, 2014 SCIENCE AND INFORMATION CONFERENCE (SAI), P372, DOI 10.1109/SAI.2014.6918213

[10] Effects of data set features on the performances of classification algorithms [J].

Kwon, Ohbyung ;

Sim, Jae Mun .

EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) :1847-1857

← 1 2 3 4 →