EDLT: Enabling Deep Learning for Generic Data Classification

被引:2
|
作者
Han, Huimei [1 ,2 ]
Zhu, Xingquan [2 ]
Li, Ying [1 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
[2] Florida Atlantic Univ, Dept Comp & Elec Engn & Comp Sci, Boca Raton, FL 33431 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2018年
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Deep learning; feature learning; convolutional neural networks; classification;
D O I
10.1109/ICDM.2018.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes to enable deep learning for generic machine learning tasks. Our goal is to allow deep learning to be applied to data which are already represented in instancefeature tabular format for a better classification accuracy. Because deep learning relies on spatial/temporal correlation to learn new feature representation, our theme is to convert each instance of the original dataset into a synthetic matrix format to take the full advantage of the feature learning power of deep learning methods. To maximize the correlation of the matrix , we use 0/1 optimization to reorder features such that the ones with strong correlations are adjacent to each other. By using a two dimensional feature reordering, we are able to create a synthetic matrix, as an image, to represent each instance. Because the synthetic image preserves the original feature values and data correlation, existing deep learning algorithms, such as convolutional neural networks (CNN), can be applied to learn effective features for classification. Our experiments on 20 generic datasets, using CNN as the deep learning classifier, confirm that enabling deep learning to generic datasets has clear performance gain, compared to generic machine learning methods. In addition, the proposed method consistently outperforms simple baselines of using CNN for generic dataset. As a result, our research allows deep learning to be broadly applied to generic datasets for learning and classification (Algorithm source code is available at http://github.com/hhmzwc/EDLT).
引用
收藏
页码:147 / 156
页数:10
相关论文
共 50 条
  • [41] Skin Lesion Classification on Imbalanced Data Using Deep Learning with Soft Attention
    Viet Dung Nguyen
    Ngoc Dung Bui
    Hoang Khoi Do
    SENSORS, 2022, 22 (19)
  • [42] Synthetic data augmentation for surface defect detection and classification using deep learning
    Jain, Saksham
    Seth, Gautam
    Paruthi, Arpit
    Soni, Umang
    Kumar, Girish
    JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (04) : 1007 - 1020
  • [43] Synthetic data augmentation for surface defect detection and classification using deep learning
    Saksham Jain
    Gautam Seth
    Arpit Paruthi
    Umang Soni
    Girish Kumar
    Journal of Intelligent Manufacturing, 2022, 33 : 1007 - 1020
  • [44] Experiments on Fine Tuning Deep Learning Models With News Data For Tweet Classification
    Hallac, Ibrahim R.
    Ay, Betul
    Aydin, Galip
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [45] Cancer Classification Based on Microarray Gene Expression Data Using Deep Learning
    Guillen, Pablo
    Ebalunode, Jerry
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1403 - 1405
  • [46] Learning of Generic Vision Features using Deep CNN
    Nithin, Kanishka D.
    Sivakumar, Bagavathi P.
    2015 FIFTH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS (ICACC), 2015, : 54 - 57
  • [47] Deep Learning-Based Classification of Hyperspectral Data
    Chen, Yushi
    Lin, Zhouhan
    Zhao, Xing
    Wang, Gang
    Gu, Yanfeng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (06) : 2094 - 2107
  • [48] Employing deep learning and sparse representation for data classification
    Fard, Seyed Mehdi Hazrati
    Hashemi, Sattar
    2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 289 - 293
  • [49] Deep Learning for Proteomics Data for Feature Selection and Classification
    Iravani, Sahar
    Conrad, Tim O. F.
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2019, 2019, 11713 : 301 - 316
  • [50] A Review on Deep Learning Techniques for 3D Sensed Data Classification
    Griffiths, David
    Boehm, Jan
    REMOTE SENSING, 2019, 11 (12)