Deep Active Transfer Learning for Image Recognition

被引:13
作者
Singh, Ankita [1 ]
Chakraborty, Shayok [1 ]
机构
[1] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
关键词
active learning; transfer learning; deep learning; image recognition;
D O I
10.1109/ijcnn48605.2020.9207391
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning has revolutionized the field of computer vision and has achieved state-of-the-art performance in a variety of applications. However, training a robust deep neural network necessitates a large amount of hand-labeled training data, which is time-consuming and labor-intensive to acquire. Active learning and transfer learning are two popular methodologies to address the problem of learning with limited labeled data. Active learning attempts to select the salient and exemplar instances from large amounts of unlabeled data; transfer learning leverages knowledge from a labeled source domain to develop a model for a (related) target domain, where labeled data is scarce. In this paper, we propose a novel active transfer learning algorithm with the objective of learning informative feature representations from a given dataset using a deep convolutional neural network, under the constraint of weak supervision. We formulate a loss function relevant to the research task and exploit the gradient descent algorithm to optimize the loss and train the deep network. To the best of our knowledge, this is the first research effort to propose a task-specific loss function integrating active and transfer learning, with the goal of learning informative feature representations using a deep neural network, under weak human supervision. Our extensive empirical studies on a variety of challenging, real-world applications depict the merit of our framework over competing baselines.
引用
收藏
页数:9
相关论文
共 21 条
  • [1] [Anonymous], INT C MACH LEARN ICM
  • [2] Chan YeeSeng., 2007, ASS COMPUTATIONAL LI
  • [3] Chattopadhyay Rita, 2013, P MACHINE LEARNING R, P253
  • [4] Ganin Y, 2016, J MACH LEARN RES, V17
  • [5] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [6] Kale D., 2015, SIAM DAT MIN C SDM
  • [7] Kale D., 2013, IEEE INT C DAT MIN I
  • [8] King DB, 2015, ACS SYM SER, V1214, P1
  • [9] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [10] Long MS, 2015, PR MACH LEARN RES, V37, P97