Simplicity of Kmeans versus Deepness of Deep Learning: A Case of Unsupervised Feature Learning with Limited Data

被引:24
|
作者
Dundar, Murat [1 ]
Kou, Qiang [2 ]
Zhang, Baichuan [1 ]
He, Yicheng [1 ]
Rajwa, Bartek [3 ]
机构
[1] Indiana Univ Purdue Univ, Dept Comp & Informat Sci, Indianapolis, IN 46202 USA
[2] Indiana Univ Purdue Univ, Dept Biohlth Informat, Indianapolis, IN 46202 USA
[3] Purdue Univ, Bindley Biosci Ctr, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICMLA.2015.78
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We study a bio-detection application as a case study to demonstrate that Kmeans-based unsupervised feature learning can be a simple yet effective alternative to deep learning techniques for small data sets with limited intra-as well as inter-class diversity. We investigate the effect on the classifier performance of data augmentation as well as feature extraction with multiple patch sizes and at different image scales. Our data set includes 1833 images from four different classes of bacteria, each bacterial culture captured at three different wavelengths and overall data collected during a three-day period. The limited number and diversity of images present, potential random effects across multiple days, and the multi-mode nature of class distributions pose a challenging setting for representation learning. Using images collected on the first day for training, on the second day for validation, and on the third day for testing Kmeans-based representation learning achieves 97% classification accuracy on the test data. This compares very favorably to 56% accuracy achieved by deep learning and 74% accuracy achieved by handcrafted features. Our results suggest that data augmentation or dropping connections between units offers little help for deep-learning algorithms, whereas significant boost can be achieved by Kmeans-based representation learning by augmenting data and by concatenating features obtained at multiple patch sizes or image scales.
引用
收藏
页码:883 / 888
页数:6
相关论文
共 50 条
  • [1] Deep Computation Model for Unsupervised Feature Learning on Big Data
    Zhang, Qingchen
    Yang, Laurence T.
    Chen, Zhikui
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (01) : 161 - 171
  • [2] Deep Learning With Unsupervised Feature in Echocardiographic Imaging
    Krittanawong, Chayakrit
    Tunhasiriwet, Anusith
    Zhang, HongJu
    Aydar, Mehmet
    Kitai, Takeshi
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2017, 69 (16) : 2100 - 2101
  • [3] Unsupervised Feature Learning For Bootleg Detection Using Deep Learning Architectures
    Buccoli, Michele
    Bestagini, Paolo
    Zanoni, Massimiliano
    Sarti, Augusto
    Tubaro, Stefano
    2014 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS'14), 2014, : 131 - 136
  • [4] REPLY: Deep Learning With Unsupervised Feature in Echocardiographic Imaging
    Narula, Sukrit
    Shameer, Khader
    Omar, Alaa Mabrouk Salem
    Dudley, Joel T.
    Sengupta, Partho P.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2017, 69 (16) : 2101 - 2102
  • [5] Transfer Learning in Deep Learning Models for Building Load Forecasting: Case of Limited Data
    Nawar, Menna
    Shomer, Moustafa
    Faddel, Samy
    Gong, Huangjie
    SOUTHEASTCON 2023, 2023, : 532 - 538
  • [6] Unsupervised feature learning and automatic modulation classification using deep learning model
    Ali, Afan
    Fan Yangyu
    PHYSICAL COMMUNICATION, 2017, 25 : 75 - 84
  • [7] A review of unsupervised feature learning and deep learning for time-series modeling
    Langkvist, Martin
    Karlsson, Lars
    Loutfi, Amy
    PATTERN RECOGNITION LETTERS, 2014, 42 : 11 - 24
  • [8] Unsupervised Geometrical Feature Learning from Hyperspectral Data
    Ahmad, Muhammad
    Khan, Adil Mehmood
    Hussain, Rasheed
    Protasov, Stanislav
    Chow, Francis
    Khattak, Asad Masood
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [9] Unsupervised Feature-Learning for Hyperspectral Data with Autoencoders
    Windrim, Lloyd
    Ramakrishnan, Rishi
    Melkumyan, Arman
    Murphy, Richard J.
    Chlingaryan, Anna
    REMOTE SENSING, 2019, 11 (07)
  • [10] Unsupervised Deep Feature Learning for Remote Sensing Image Retrieval
    Tang, Xu
    Zhang, Xiangrong
    Liu, Fang
    Jiao, Licheng
    REMOTE SENSING, 2018, 10 (08)