Distribution-dependent feature selection for deep neural networks

被引:1
作者
Zhao, Xuebin [1 ]
Li, Weifu [1 ]
Chen, Hong [1 ]
Wang, Yingjie [2 ]
Chen, Yanhong [3 ]
John, Vijay [4 ]
机构
[1] Huazhong Agr Univ, Coll Sci, Wuhan 430062, Peoples R China
[2] Huazhong Agr Univ, Coll Informat, Wuhan 430062, Peoples R China
[3] Chinese Acad Sci, Natl Space Sci Ctr, Beijing 100190, Peoples R China
[4] Toyota Technol Inst, Res Ctr Smart Vehicles, Tempaku Ku, 2-12-1 Hisakata, Nagoya, Aichi 4688511, Japan
基金
中国国家自然科学基金;
关键词
Feature selection; Coronal mass ejections; Deep neural networks; Interpretability; Hypothesis-testing; FALSE DISCOVERY RATE; REGRESSION; FILTER;
D O I
10.1007/s10489-021-02663-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep neural networks (DNNs) have achieved impressive performance on a wide variety of tasks, the black-box nature hinders their applicability to high-risk, decision-making fields. In such fields, besides accurate prediction, it is also desired to provide interpretable insights into DNNs, e.g., screening important features based on their contributions to predictive accuracy. To improve the interpretability of DNNs, this paper originally proposes a new feature selection algorithm for DNNs by integrating the knockoff technique and the distribution information of irrelevant features. With the help of knockoff features and central limit theorem, we state that the irrelevant feature's statistic follows a known Gaussian distribution under mild conditions. This information is applied in hypothesis testing to discover key features associated with the DNNs. Empirical evaluations on simulated data demonstrate that the proposed method can select more true informative features with higher F-1 scores. Meanwhile, the Friedman test and the post-hoc Nemenyi test are employed to validate the superiority of the proposed method. Then we apply our method to Coronal Mass Ejections (CME) data and uncover the key features which contribute to the DNN-based CME arrival time.
引用
收藏
页码:4432 / 4442
页数:11
相关论文
共 50 条
[31]   Toward the scalability of neural networks through feature selection [J].
Peteiro-Barral, D. ;
Bolon-Canedo, V. ;
Alonso-Betanzos, A. ;
Guijarro-Berdinas, B. ;
Sanchez-Marono, N. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (08) :2807-2816
[32]   Robust Test Selection for Deep Neural Networks [J].
Sun, Weifeng ;
Yan, Meng ;
Liu, Zhongxin ;
Lo, David .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (12) :5250-5278
[33]   Cluster-Dependent Feature Selection for the RBF Networks [J].
Czarnowski, Ireneusz ;
Jedrzejowicz, Piotr .
COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT II, 2015, 9330 :223-232
[34]   Hybrid intelligent phishing website prediction using deep neural networks with genetic algorithm-based feature selection and weighting [J].
Ali, Waleed ;
Ahmed, Adel A. .
IET INFORMATION SECURITY, 2019, 13 (06) :659-669
[35]   Visualizing complex feature interactions and feature sharing in genomic deep neural networks [J].
Ge Liu ;
Haoyang Zeng ;
David K. Gifford .
BMC Bioinformatics, 20
[36]   Visualizing complex feature interactions and feature sharing in genomic deep neural networks [J].
Liu, Ge ;
Zeng, Haoyang ;
Gifford, David K. .
BMC BIOINFORMATICS, 2019, 20 (1)
[37]   Feature selection accelerated convolutional neural networks for visual tracking [J].
Zhiyan Cui ;
Na Lu .
Applied Intelligence, 2021, 51 :8230-8244
[38]   Forecasting of Wind Speed Using Feature Selection and Neural Networks [J].
Kumar, Senthil P. ;
Lopez, Daphne .
INTERNATIONAL JOURNAL OF RENEWABLE ENERGY RESEARCH, 2016, 6 (03) :833-837
[39]   Feature selection accelerated convolutional neural networks for visual tracking [J].
Cui, Zhiyan ;
Lu, Na .
APPLIED INTELLIGENCE, 2021, 51 (11) :8230-8244
[40]   Fast, Accurate, and Stable Feature Selection Using Neural Networks [J].
James Deraeve ;
William H. Alexander .
Neuroinformatics, 2018, 16 :253-268