Green AI in the finance industry: Exploring the impact of feature engineering on the accuracy and computational time of Machine Learning models

被引:0
作者
Machado, Marcos R. [1 ]
Asadi, Amin [1 ]
de Souza, Renato William R. [2 ]
Ugulino, Wallace C. [3 ]
机构
[1] Univ Twente, Dept Ind Engn & Business Informat Syst IEBIS, NL-7500 AE Enschede, Netherlands
[2] Fed Inst Educ Sci & Technol Ceara, Rod Pres Juscelino Kubitschek, BR-63870000 Boa Viagem, Ceara, Brazil
[3] Univ Twente, Dept Semant Cybersecur & Serv SCS, NL-7500 AE Enschede, Netherlands
关键词
Feature engineering; Green AI; Machine Learning; Hybrid Machine Learning; Customer loyalty; Finance industry; CUSTOMER LOYALTY; CLASSIFICATION; ALGORITHM; PREDICTION; SYSTEMS;
D O I
10.1016/j.asoc.2024.112343
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As research and practice in Artificial Intelligence (AI) applications rapidly expand, the support for AI deployment is also increasing. While the abundance of data allows for sophisticated feature engineering techniques that can enhance accuracy, it is crucial to highlight both the computational costs and the efficiency with which these models operate. This paper compares the processing time and accuracy of individual and hybrid Machine Learning (ML) models in predicting customer loyalty within financial contexts. Frameworks that incorporate feature engineering and green AI principles are used separately in both individual and hybrid approaches. The individual models are the commonly used regressor-based algorithms applied to business problems. The hybrid models first use k-Means to cluster customers, followed by the application of individual regressor-based models (e.g., decision trees, gradient boosting, and LightGBM). The present results show that using fewer features results in only a marginally lower accuracy compared to models with more features (a difference of approximate to 0.01 in MAE when comparing the use of 18 versus 85 features). Additionally, this article clearly demonstrate the trade-off between higher accuracy and longer computational time in hybrid ML models versus lower accuracy and shorter computational time in individual models when predicting customer loyalty. Hybrid models exhibit a lower MSE ( approximate to 0 . 88 ) compared to individual models (approximate to 0.91). These findings provide managers with insights on selecting the most appropriate model based on their organization's specific needs.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Combining machine learning and process engineering physics towards enhanced accuracy and explainability of data-driven models
    Bikmukhametov, Timur
    Jaschke, Johannes
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 138
  • [32] Feature Engineering and Machine Learning Predictive Quality Models for Friction Stir Welding Defect Prediction in Aerospace Applications
    Camps, Marta
    Etxegarai, Maddi
    Bonada, Francesc
    Lacheny, William
    Pauleau, Sylvain
    Domingo, Xavier
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 151 - 154
  • [33] Impact of Feature Selection Techniques on the Performance of Machine Learning Models for Depression Detection Using EEG Data
    Hassan, Marwa
    Kaabouch, Naima
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [34] Improving the Accuracy of Ensemble Machine Learning Classification Models Using a Novel Bit-Fusion Algorithm for Healthcare AI Systems
    Mishra, Sashikala
    Shaw, Kailash
    Mishra, Debahuti
    Patil, Shruti
    Kotecha, Ketan
    Kumar, Satish
    Bajaj, Simi
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [35] Feature Selection Stability and Accuracy of Prediction Models for Genomic Prediction of Residual Feed Intake in Pigs Using Machine Learning
    Piles, Miriam
    Bergsma, Rob
    Gianola, Daniel
    Gilbert, Helene
    Tusell, Llibertat
    FRONTIERS IN GENETICS, 2021, 12
  • [36] EXTES: An Execution-Time Estimation Scheme for Efficient Computational Science and Engineering Simulation via Machine Learning
    Kim, Seounghyeon
    Suh, Young-Kyoon
    Kim, Jeeyoung
    IEEE ACCESS, 2019, 7 : 98993 - 99002
  • [37] Exploring the variable space of shallow machine learning models for reversed-phase retention time prediction
    Yeung, Darien
    Spicer, Victor
    Zahedi, Rene P.
    Krokhin, Oleg
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 2446 - 2453
  • [38] Introducing Technical Indicators to Electricity Price Forecasting: A Feature Engineering Study for Linear, Ensemble, and Deep Machine Learning Models
    Demir, Sumeyra
    Mincev, Krystof
    Kok, Koen
    Paterakis, Nikolaos G.
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [39] Quantitative Physiologic MRI Combined with Feature Engineering for Developing Machine Learning-Based Prediction Models to Distinguish Glioblastomas from Single Brain Metastases
    Hosseini, Seyyed Ali
    Servaes, Stijn
    Hall, Brandon
    Bhaduri, Sourav
    Rajan, Archith
    Rosa-Neto, Pedro
    Brem, Steven
    Loevner, Laurie A.
    Mohan, Suyash
    Chawla, Sanjeev
    DIAGNOSTICS, 2025, 15 (01)
  • [40] Performance Evaluation of Online Machine Learning Models Based on Cyclic Dynamic and Feature-Adaptive Time Series
    Al-Khaleefa, Ahmed Salih
    Hassan, Rosilah
    Ahmad, Mohd Riduan
    Qamar, Faizan
    Wen, Zheng
    Aman, Azana Hafizah Mohd
    Yu, Keping
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (08) : 1172 - 1184