Normalization Influence on ANN-Based Models Performance: A New Proposal for Features' Contribution Analysis

被引:12
|
作者
Nino-Adan, Iratxe [1 ,2 ]
Portillo, Eva [2 ]
Landa-Torres, Itziar [3 ]
Manjarres, Diana [1 ]
机构
[1] Basque Res & Technol Alliance BRTA, Tecnalia Res & Innovat, Derio 48160, Spain
[2] Univ Basque Country, UPV EHU, Fac Engn, Dept Automat Control & Syst Engn, Bilbao 48013, Spain
[3] Petronor Innovac SL, Muskiz 48550, Spain
关键词
Mathematical model; Dispersion; Neurons; Modeling; Feature extraction; Training; Proposals; Artificial neural networks; explainability; feature contribution; feature normalization; FEATURE-SELECTION; NEURAL-NETWORK; BLACK-BOX; OPTIMIZATION;
D O I
10.1109/ACCESS.2021.3110647
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial Neural Networks (ANNs) are weighted directed graphs of interconnected neurons widely employed to model complex problems. However, the selection of the optimal ANN architecture and its training parameters is not enough to obtain reliable models. The data preprocessing stage is fundamental to improve the model's performance. Specifically, Feature Normalisation (FN) is commonly utilised to remove the features' magnitude aiming at equalising the features' contribution to the model training. Nevertheless, this work demonstrates that the FN method selection affects the model performance. Also, it is well-known that ANNs are commonly considered a "black box" due to their lack of interpretability. In this sense, several works aim to analyse the features' contribution to the network for estimating the output. However, these methods, specifically those based on network's weights, like Garson's or Yoon's methods, do not consider preprocessing factors, such as dispersion factors, previously employed to transform the input data. This work proposes a new features' relevance analysis method that includes the dispersion factors into the weight matrix analysis methods to infer each feature's actual contribution to the network output more precisely. Besides, in this work, the Proportional Dispersion Weights (PWD) are proposed as explanatory factors of similarity between models' performance results. The conclusions from this work improve the understanding of the features' contribution to the model that enhances the feature selection strategy, which is fundamental for reliably modelling a given problem.
引用
收藏
页码:125462 / 125477
页数:16
相关论文
共 50 条
  • [21] Interpretation of ANN-based QSAR models for prediction of antioxidant activity of flavonoids
    Zuvela, Petar
    David, Jonathan
    Wong, Ming Wah
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2018, 39 (16) : 953 - 963
  • [22] ANN-based sediment yield river basin models for Vamsadhara (India)
    Agarwal, A
    Singh, RD
    Mishra, SK
    Bhunya, PK
    WATER SA, 2005, 31 (01) : 95 - 100
  • [23] ANN-based sediment yield models for Vamsadhara river basin (India)
    Agarwal, Avinash
    Singh, R.D.
    Mishra, S.K.
    Bhunya, P.K.
    2005, South African Water Research Commission (31)
  • [24] A new ANN-based detection algorithm of the masses in digital mammograms
    Xu, Weidong
    Li, Lihua
    Xu, Ping
    2007 IEEE INTERNATIONAL CONFERENCE ON INTEGRATION TECHNOLOGY, PROCEEDINGS, 2007, : 26 - +
  • [25] A New ANN-Based Cleaning Approach for Photovoltaic Solar Panels
    Mokhtar, Mohamed
    Shaaban, Mostafa F.
    2022 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2022), 2022, : 260 - 263
  • [26] Feasibility Analysis and ANN-Based Tuning of a Capacitorless Bandpass Biquad
    Moonngam, Montira
    Chaisricharoen, Roungsan
    Chipipop, Boonruk
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2009), 2009, : 393 - +
  • [27] Performance analysis of HMM- and ANN-based spectrum vacancy predictor behaviour for cognitive radios
    Karia, Deepak C.
    Lande, B. K.
    Daruwala, R. D.
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2012, 11 (04) : 206 - 213
  • [28] KNN and ANN-based Recognition of Handwritten Pashto Letters using Zoning Features
    Khan, Sulaiman
    Ali, Hazrat
    Ullah, Zahid
    Minallah, Nasru
    Maqsood, Shahid
    Hafeez, Abdul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (10) : 570 - 577
  • [29] Which Features Most Impact: Prediction of ANN-Based Lightpath Quality of Transmission?
    Li, Longfei
    Shen, Gangxiang
    Fu, Xiaodong
    Jie, Shuiping
    Bose, Sanjay Kumar
    2018 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2018,
  • [30] An ANN-based clustering analysis algorithm with Dynamic Data Window
    Tang, TH
    Wang, TZ
    2005 International Conference on Control and Automation (ICCA), Vols 1 and 2, 2005, : 581 - 586