Deep Active Transfer Learning for Image Recognition

被引：16

作者：

Singh, Ankita ^{[1
]}

Chakraborty, Shayok ^{[1
]}

机构：

[1] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA

来源：

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年

关键词：

active learning; transfer learning; deep learning; image recognition;

D O I：

10.1109/ijcnn48605.2020.9207391

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, deep learning has revolutionized the field of computer vision and has achieved state-of-the-art performance in a variety of applications. However, training a robust deep neural network necessitates a large amount of hand-labeled training data, which is time-consuming and labor-intensive to acquire. Active learning and transfer learning are two popular methodologies to address the problem of learning with limited labeled data. Active learning attempts to select the salient and exemplar instances from large amounts of unlabeled data; transfer learning leverages knowledge from a labeled source domain to develop a model for a (related) target domain, where labeled data is scarce. In this paper, we propose a novel active transfer learning algorithm with the objective of learning informative feature representations from a given dataset using a deep convolutional neural network, under the constraint of weak supervision. We formulate a loss function relevant to the research task and exploit the gradient descent algorithm to optimize the loss and train the deep network. To the best of our knowledge, this is the first research effort to propose a task-specific loss function integrating active and transfer learning, with the goal of learning informative feature representations using a deep neural network, under weak human supervision. Our extensive empirical studies on a variety of challenging, real-world applications depict the merit of our framework over competing baselines.

引用

页数：9

共 21 条

[1]

[Anonymous], INT C MACH LEARN ICM

[2]

[Anonymous], 2015, ACS SYM SER

[3]

Chan YeeSeng., 2007, ASS COMPUTATIONAL LI

[4]

Chattopadhyay R., 2013, P 30 INT C MACH LEAR, P253

[5]

Ganin Y, 2016, J MACH LEARN RES, V17

[6]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

[7]

Kale D., 2015, SIAM DAT MIN C SDM

[8]

Kale D., 2013, IEEE INT C DAT MIN I

[9] Gradient-based learning applied to document recognition [J].

Lecun, Y ;

Bottou, L ;

Bengio, Y ;

Haffner, P .

PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324

[10]

Long MS, 2015, PR MACH LEARN RES, V37, P97

← 1 2 3 →