Mixture of Deep Neural Networks for Instancewise Feature Selection

被引:0
作者
Xiao, Qi [1 ]
Wang, Zhengdao [1 ]
机构
[1] Iowa State Univ, ECpE Dept, Ames, IA 50011 USA
来源
2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON) | 2019年
关键词
Instancewise feature selection; deep learning; mixture of models;
D O I
10.1109/allerton.2019.8919657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning relevant features is important for interpreting data in a machine learning model. Comparing with selecting a relevant feature subset for the entire data, instance-wise feature selection is more flexible for model interpretation. However current instancewise feature selection approaches are complex and suffer from high computational cost. We consider instancewise feature selection under supervised learning framework. We design a compact and interpretable neural network to approach the problem. To reduce the computational cost and gain better interpretability, we group relevant features and construct a mixture of neural networks. Using softmax as activation function for sub-model selection, the model membership can be learned accurately through gradient descent. To the best of our knowledge, our model is the first interpretable deep neural network model for instancewise feature selection using end-to-end training.
引用
收藏
页码:917 / 921
页数:5
相关论文
共 10 条
  • [1] Abadi M., 2015, P 12 USENIX S OPERAT
  • [2] [Anonymous], 2013, CORR
  • [3] How Do the Open Source Communities Address Usability and UX Issues? An Exploratory Study
    Cheng, Jinghui
    Guo, Jin L. C.
    [J]. CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
  • [4] Gu S., 2016, ARXIV
  • [5] Klambauer G., 2017, Advances in Neural Information Processing Systems, V30, P972
  • [6] "Why Should I Trust You?" Explaining the Predictions of Any Classifier
    Ribeiro, Marco Tulio
    Singh, Sameer
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1135 - 1144
  • [7] Shazeer Noam, 2017, P INT C LEARNING REP
  • [8] Shrikumar A, 2017, INT C MACH LEARN, V70, P3145
  • [9] Simonyan K., 2013, WORKSH INT C LEARN R, DOI DOI 10.48550/ARXIV.1312.6034
  • [10] Yoon Jinsung, 2019, INT C LEARN REPR