Normalization Influence on ANN-Based Models Performance: A New Proposal for Features' Contribution Analysis

被引:12
|
作者
Nino-Adan, Iratxe [1 ,2 ]
Portillo, Eva [2 ]
Landa-Torres, Itziar [3 ]
Manjarres, Diana [1 ]
机构
[1] Basque Res & Technol Alliance BRTA, Tecnalia Res & Innovat, Derio 48160, Spain
[2] Univ Basque Country, UPV EHU, Fac Engn, Dept Automat Control & Syst Engn, Bilbao 48013, Spain
[3] Petronor Innovac SL, Muskiz 48550, Spain
关键词
Mathematical model; Dispersion; Neurons; Modeling; Feature extraction; Training; Proposals; Artificial neural networks; explainability; feature contribution; feature normalization; FEATURE-SELECTION; NEURAL-NETWORK; BLACK-BOX; OPTIMIZATION;
D O I
10.1109/ACCESS.2021.3110647
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial Neural Networks (ANNs) are weighted directed graphs of interconnected neurons widely employed to model complex problems. However, the selection of the optimal ANN architecture and its training parameters is not enough to obtain reliable models. The data preprocessing stage is fundamental to improve the model's performance. Specifically, Feature Normalisation (FN) is commonly utilised to remove the features' magnitude aiming at equalising the features' contribution to the model training. Nevertheless, this work demonstrates that the FN method selection affects the model performance. Also, it is well-known that ANNs are commonly considered a "black box" due to their lack of interpretability. In this sense, several works aim to analyse the features' contribution to the network for estimating the output. However, these methods, specifically those based on network's weights, like Garson's or Yoon's methods, do not consider preprocessing factors, such as dispersion factors, previously employed to transform the input data. This work proposes a new features' relevance analysis method that includes the dispersion factors into the weight matrix analysis methods to infer each feature's actual contribution to the network output more precisely. Besides, in this work, the Proportional Dispersion Weights (PWD) are proposed as explanatory factors of similarity between models' performance results. The conclusions from this work improve the understanding of the features' contribution to the model that enhances the feature selection strategy, which is fundamental for reliably modelling a given problem.
引用
收藏
页码:125462 / 125477
页数:16
相关论文
共 50 条
  • [41] Improving photovoltaic water pumping system performance with ANN-based direct torque control using real-time simulation
    Saady, Ikram
    Majout, Btissam
    El Kafazi, Ismail
    Karim, Mohammed
    Bossoufi, Badre
    El Ouanjli, Najib
    Mahfoud, Said
    Althobaiti, Ahmed
    Alghamdi, Thamer A. H.
    Alenezi, Mohammed
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [42] Predicting the adsorption capacity of organic dye using zirconium-based metal-organic framework: a comparative analysis of RSM and ANN-based models
    Dahlan, Irvan
    Obi, Christopher Chiedozie
    Poopathi, Veshmen
    BIOMASS CONVERSION AND BIOREFINERY, 2025, 15 (01) : 285 - 295
  • [44] Development of an ANN-based air pollution forecasting system with explicit knowledge through sensitivity analysis
    Elangasinghe, Madhavi Anushka
    Singhal, Naresh
    Dirks, Kim N.
    Salmond, Jennifer A.
    ATMOSPHERIC POLLUTION RESEARCH, 2014, 5 (04) : 696 - 708
  • [45] Comparison of SARIMAX, SARIMA, Modified SARIMA and ANN-based Models for Short-Term PV Generation Forecasting
    Vagropoulos, Stylianos I.
    Chouliaras, G. I.
    Kardakos, E. G.
    Simoglou, C. K.
    Bakirtzis, A. G.
    2016 IEEE INTERNATIONAL ENERGY CONFERENCE (ENERGYCON), 2016,
  • [46] Designing the input vector to ANN-based models for short-term load forecast in electricity distribution systems
    Santos, P. J.
    Martins, A. G.
    Pires, A. J.
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2007, 29 (04) : 338 - 347
  • [47] ANN-Based Assessment of the Influence of Natural and Anthropogenic Forcing on Surface Air Temperature Variability Over the Indian Subcontinent
    Padmavathi, B.
    Tiwari, R. K.
    Tiwari, V. M.
    PURE AND APPLIED GEOPHYSICS, 2021, 178 (05) : 1911 - 1926
  • [48] Prediction of flow curves and forming limit curves of Mg alloy thin sheets using ANN-based models
    Forcellese, A.
    Gabrielli, F.
    Simoncini, M.
    COMPUTATIONAL MATERIALS SCIENCE, 2011, 50 (11) : 3184 - 3197
  • [49] Analytical and ANN-based models for assessment of hunchback retaining walls: Investigating lateral earth pressure in unsaturated backfill
    Thottoth, Sivani Remash
    Khatri, Vishwas
    GEOMECHANICS AND ENGINEERING, 2024, 38 (03) : 285 - 305
  • [50] ANN-Based Wear Performance Prediction for Plasma Nitrided Ti6Al4V Alloy
    Kahraman, Fatih
    Karadeniz, Suleyman
    Durmus, Hulya
    MATERIALS TESTING, 2012, 54 (01) : 30 - 35