Deep Neural Networks for High Dimension, Low Sample Size Data

被引:0
作者
Liu, Bo [1 ]
Wei, Ying [1 ]
Zhang, Yu [1 ]
Yang, Qiang [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2017年
基金
中国国家自然科学基金;
关键词
FEATURE-SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks (DNN) have achieved breakthroughs in applications with large sample size. However, when facing high dimension, low sample size (HDLSS) data, such as the phenotype prediction problem using genetic data in bioinformatics, DNN suffers from overfitting and high-variance gradients. In this paper, we propose a DNN model tailored for the HDLSS data, named Deep Neural Pursuit (DNP). DNP selects a subset of high dimensional features for the alleviation of overfitting and takes the average over multiple dropouts to calculate gradients with low variance. As the first DNN method applied on the HDLSS data, DNP enjoys the advantages of the high nonlinearity, the robustness to high dimensionality, the capability of learning from a small number of samples, the stability in feature selection, and the end-to-end training. We demonstrate these advantages of DNP via empirical results on both synthetic and real-world biological datasets.
引用
收藏
页码:2287 / 2293
页数:7
相关论文
共 50 条
  • [41] Deep Fuzzy Neural Networks for Biomarker Selection for Accurate Cancer Detection
    Bamunu Mudiyanselage, Thosini K.
    Xiao, Xueli
    Zhang, Yanqing
    Pan, Yi
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (12) : 3219 - 3228
  • [42] Growing random forest on deep convolutional neural networks for scene categorization
    Bai, Shuang
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 71 : 279 - 287
  • [43] Transfer Learning for Molecular Cancer Classification Using Deep Neural Networks
    Sevakula, Rahul K.
    Singh, Vikas
    Verma, Nishchal K.
    Kumar, Chandan
    Cui, Yan
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (06) : 2089 - 2100
  • [44] Personalized PageRank Based Feature Selection for High-dimension Data
    Zhu, Zhibo
    Peng, Qinke
    Guan, Xinyu
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 197 - 202
  • [45] Various dimension reduction techniques for high dimensional data analysis: a review
    Ray, Papia
    Reddy, S. Surender
    Banerjee, Tuhina
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3473 - 3515
  • [46] Various dimension reduction techniques for high dimensional data analysis: a review
    Papia Ray
    S. Surender Reddy
    Tuhina Banerjee
    Artificial Intelligence Review, 2021, 54 : 3473 - 3515
  • [47] A Hybrid Feature Selection Optimization Model for High Dimension Data Classification
    Qaraad, Mohammed
    Amjad, Souad
    Manhrawy, Ibrahim I. M.
    Fathi, Hanaa
    Hassan, Bayoumi Ali
    El Kafrawy, Passent
    IEEE ACCESS, 2021, 9 : 42884 - 42895
  • [48] Deep neural networks with L1 and L2 regularization for high dimensional corporate credit risk prediction
    Yang, Mei
    Lim, Ming K.
    Qu, Yingchi
    Li, Xingzhi
    Ni, Du
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [49] Feature Selection Solution with High Dimensionality and Low-Sample Size for Land Cover Classification in Object-Based Image Analysis
    Huang, Yaohuan
    Zhao, Chuanpeng
    Yang, Haijun
    Song, Xiaoyang
    Chen, Jie
    Li, Zhonghua
    REMOTE SENSING, 2017, 9 (09):
  • [50] TRAINING DATA REDUCTION IN DEEP NEURAL NETWORKS WITH PARTIAL MUTUAL INFORMATION BASED FEATURE SELECTION AND CORRELATION MATCHING BASED ACTIVE LEARNING
    Zheng, Jian
    Yang, Wei
    Li, Xiaohua
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2362 - 2366