Impacts of Feature Selection on Predicting Machine Failures by Machine Learning Algorithms

被引:4
作者
Bezerra, Francisco Elanio [1 ]
de Oliveira Neto, Geraldo Cardoso [2 ]
Cervi, Gabriel Magalhaes [3 ]
Mazetto, Rafaella Francesconi [3 ]
de Faria, Aline Mariane [3 ]
Vido, Marcos [4 ]
Lima, Gustavo Araujo [5 ]
de Araujo, Sidnei Alves [5 ]
Sampaio, Mauro [6 ]
Amorim, Marlene [7 ]
机构
[1] Univ Sao Paulo, Polytech Sch, Dept Energy Engn & Elect Automat, 158 Prof Luciano Gualberto Ave, BR-05508010 Sao Paulo, Brazil
[2] Fed Univ ABC, Alameda Univ, Ind Engn Post Grad Program, S-n Bairro Anchieta, BR-09606045 Sao Bernardo Do Campo, SP, Brazil
[3] FEI Univ, Business Adm Postgrad Program, Tamandare St 688,5 Floor, BR-01525000 Sao Paulo, Brazil
[4] Nove de Julho Univ UNINOVE, Ind Engn Postgrad Program, Vergueiro St 235-249, BR-01504001 Sao Paulo, Brazil
[5] Nove de Julho Univ UNINOVE, Informat & Knowledge Management Postgrad Program, Vergueiro St 235-249, BR-01504001 Sao Paulo, Brazil
[6] FEI Univ, Ind Engn Postgrad Program, Ave Humberto Alencar Castelo Branco 3972-B, BR-09850901 Sao Bernardo Do Campo, Brazil
[7] Univ Aveiro, GOVCOPP DEGEIT, P-3810193 Aveiro, Portugal
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 08期
关键词
machine learning; machine failure; feature selection; predictive maintenance; sensor selection; CLASSIFICATION; NETWORKS;
D O I
10.3390/app14083337
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the context of Industry 4.0, managing large amounts of data is essential to ensure informed decision-making in intelligent production environments. It enables, for example, predictive maintenance, which is essential for anticipating and identifying causes of failures in machines and equipment, optimizing processes, and promoting proactive management of human, financial, and material resources. However, generating accurate information for decision-making requires adopting suitable data preprocessing and analysis techniques. This study explores the identification of machine failures based on synthetic industrial data. Initially, we applied the feature selection techniques Principal Component Analysis (PCA), Minimum Redundancy Maximum Relevance (mRMR), Neighborhood Component Analysis (NCA), and Denoising Autoencoder (DAE) to the collected data and compared their results. In the sequence, a comparison among three widely known machine learning classifiers, namely Random Forest (RF), Support Vector Machine (SVM), and Multilayer Perceptron neural network (MLP), was conducted, with and without considering feature selection. The results showed that PCA and RF were superior to the other techniques, allowing the classification of failures with rates of 0.98, 0.97, and 0.98 for the accuracy, precision, and recall metrics, respectively. Thus, this work contributes by solving an industrial problem and detailing techniques to identify the most relevant variables and machine learning algorithms for predicting machine failures that negatively impact production planning. The findings provided by this study can assist industries in giving preference to employing sensors and collecting data that can contribute more effectively to machine failure predictions.
引用
收藏
页数:14
相关论文
共 34 条
  • [1] A PCA-based variable ranking and selection approach for electric energy load forecasting
    Bezerra, Francisco Elanio
    Grassi, Flavio
    Dias, Clcber Gustavo
    Pereira, Fabio Henrique
    [J]. INTERNATIONAL JOURNAL OF ENERGY SECTOR MANAGEMENT, 2022, 16 (06) : 1172 - 1191
  • [2] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [3] A Hybrid Intelligent Approach for Classification of Incipient Faults in Transmission Network
    Chang, Gary W.
    Hong, Yong-Han
    Li, Guan-Yi
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 2019, 34 (04) : 1785 - 1794
  • [4] Prediction model of the failure mode of beam-column joints using machine learning methods
    Gao, Xiangling
    Lin, Chen
    [J]. ENGINEERING FAILURE ANALYSIS, 2021, 120
  • [5] Dissecting a data-driven prognostic pipeline: A powertrain use case
    Giordano, Danilo
    Pastor, Eliana
    Giobergia, Flavio
    Cerquitelli, Tania
    Baralis, Elena
    Mellia, Marco
    Neri, Alessandra
    Tricarico, Davide
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 180
  • [6] Javaid M., 2021, Sens. Int., V2, DOI DOI 10.1016/J.SINTL.2021.100110
  • [7] Feature Selection Engineering for Credit Risk Assessment in Retail Banking
    Jemai, Jaber
    Zarrad, Anis
    [J]. INFORMATION, 2023, 14 (03)
  • [8] Kariuki H., 2022, INT J DATA SCI ANAL, V8, P72, DOI DOI 10.11648/J.IJDSA.20220803.11
  • [9] Khalid S, 2014, 2014 SCIENCE AND INFORMATION CONFERENCE (SAI), P372, DOI 10.1109/SAI.2014.6918213
  • [10] Effects of data set features on the performances of classification algorithms
    Kwon, Ohbyung
    Sim, Jae Mun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1847 - 1857