Leveraging Neural Networks and Calibration Measures for Confident Feature Selection

被引:0
作者
Gharoun, Hassan [1 ]
Yazdanjue, Navid [1 ]
Khorshidi, Mohammad Sadegh [1 ]
Chen, Fang [1 ]
Gandomi, Amir H. [1 ,2 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, Ultimo, NSW 2007, Australia
[2] Univ Res & Innovat Ctr EKIK, Obuda Univ, H-1034 Budapest, Hungary
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2025年 / 9卷 / 03期
关键词
Uncertainty; Feature extraction; Accuracy; Predictive models; Entropy; Measurement uncertainty; Prediction algorithms; Mutual information; Filtering theory; Classification tree analysis; Neural networks; measurement uncertainty; feature selection; boruta; transfer learning; perturbation analysis; feature importance; UNCERTAINTY MEASURES; CLASSIFICATION;
D O I
10.1109/TETCI.2025.3535659
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the surge in data generation, both vertically (i.e., volume of data) and horizontally (i.e., dimensionality) the burden of the curse of dimensionality has become increasingly palpable. Feature selection, a key facet of dimensionality reduction techniques, has advanced considerably to address this challenge. One such advancement is the Boruta feature selection algorithm, which successfully discerns meaningful features by contrasting them to their permutated counterparts known as shadow features. Building on this, this paper introduces NeuroBoruta, that extends the traditional Boruta approach by integrating neural networks and calibration metrics to improve prediction accuracy and reduce model uncertainty. By augmenting shadow features with noise and utilizing neural network-based perturbation for importance evaluation, and further incorporating calibration metrics alongside accuracy this evolved version of the Boruta method is presented. Experimental results demonstrate that NeuroBoruta significantly enhances the predictive performance and reliability of classification models across various datasets, including medical imaging and standard UCI datasets. This study underscores the importance of considering both feature relevance and model uncertainty in the feature selection process, particularly in domains requiring high accuracy and reliability.
引用
收藏
页码:2179 / 2193
页数:15
相关论文
共 60 条
[1]   A review of uncertainty quantification in deep learning: Techniques, applications and challenges [J].
Abdar, Moloud ;
Pourpanah, Farhad ;
Hussain, Sadiq ;
Rezazadegan, Dana ;
Liu, Li ;
Ghavamzadeh, Mohammad ;
Fieguth, Paul ;
Cao, Xiaochun ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
INFORMATION FUSION, 2021, 76 :243-297
[2]   FracAtlas: A Dataset for Fracture Classification, Localization and Segmentation of Musculoskeletal Radiographs [J].
Abedeen, Iftekharul ;
Rahman, Md. Ashiqur ;
Prottyasha, Fatema Zohra ;
Ahmed, Tasnim ;
Chowdhury, Tareque Mohmud ;
Shatabda, Swakkhar .
SCIENTIFIC DATA, 2023, 10 (01)
[3]   Gully Erosion Susceptibility Assessment in the Kondoran Watershed Using Machine Learning Algorithms and the Boruta Feature Selection [J].
Ahmadpour, Hamed ;
Bazrafshan, Ommolbanin ;
Rafiei-Sardooi, Elham ;
Zamani, Hossein ;
Panagopoulos, Thomas .
SUSTAINABILITY, 2021, 13 (18)
[4]  
Aldrich C., 2013, UNSUPERVISED PROCESS, V16
[5]  
[Anonymous], 1959, IEEE Trans. Autom. Control, DOI 10.1109/TAC.1959.1104847
[6]  
[Anonymous], 2017, J. Eng. Sci. Technol. Rev.
[7]   Symmetric uncertainty class-feature association map for feature selection in microarray dataset [J].
Bakhshandeh, Soodeh ;
Azmi, Reza ;
Teshnehlab, Mohammad .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (01) :15-32
[8]   Transfer Learning VGG16 Model for Classification of Tomato Plant Leaf Diseases: A Novel Approach for Multi-Level Dimensional Reduction [J].
Borugadda, Premkumar ;
Lakshmi, Ramasami ;
Sahoo, Satyasangram .
PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2023, 31 (02) :813-841
[9]   Uncertainty aware training to improve deep learning model calibration for classification of cardiac MR images [J].
Dawood, Tareen ;
Chen, Chen ;
Sidhu, Baldeep S. ;
Ruijsink, Bram ;
Gould, Justin ;
Porter, Bradley ;
Elliott, Mark K. ;
Mehta, Vishal ;
Rinaldi, Christopher A. ;
Puyol-Anton, Esther ;
Razavi, Reza ;
King, Andrew P. .
MEDICAL IMAGE ANALYSIS, 2023, 88
[10]   Radiomics model to classify mammary masses using breast DCE-MRI compared to the BI-RADS classification performance [J].
Debbi, Kawtar ;
Habert, Paul ;
Grob, Anais ;
Loundou, Anderson ;
Siles, Pascale ;
Bartoli, Axel ;
Jacquier, Alexis .
INSIGHTS INTO IMAGING, 2023, 14 (01)