Adaptive RGB Image Recognition by Visual-Depth Embedding

被引：15

作者：

Cai, Ziyun ^{[1
]}

Long, Yang ^{[2
]}

Shao, Ling ^{[3
,4
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing, Jiangsu, Peoples R China

[2] Newcastle Univ, Sch Comp, Open Lab, Newcastle Upon Tyne NE4 5TG, Tyne & Wear, England

[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates

[4] Univ East Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2018年 / 27卷 / 05期

关键词：

RGB-D data; domain adaptation; visual categorization; NONNEGATIVE MATRIX FACTORIZATION; KERNEL;

D O I：

10.1109/TIP.2018.2806839

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recognizing RGB images from RGB-D data is a promising application, which significantly reduces the cost while can still retain high recognition rates. However, existing methods still suffer from the domain shifting problem due to conventional surveillance cameras and depth sensors are using different mechanisms. In this paper, we aim to simultaneously solve the above two challenges: 1) how to take advantage of the additional depth information in the source domain? 2) how to reduce the data distribution mismatch between the source and target domains? We propose a novel method called adaptive visual-depth embedding (aVDE), which learns the compact shared latent space between two representations of labeled RGB and depth modalities in the source domain first. Then the shared latent space can help the transfer of the depth information to the unlabeled target dataset. At last, aVDE models two separate learning strategies for domain adaptation (feature matching and instance reweighting) in a unified optimization problem, which matches features and reweights instances jointly across the shared latent space and the projected target domain for an adaptive classifier. We test our method on five pairs of data sets for object recognition and scene classification, the results of which demonstrates the effectiveness of our proposed method.

引用

页码：2471 / 2483

页数：13

共 51 条

[1]

[Anonymous], TECH REP

[2]

[Anonymous], 2013, INT C MACH LEARN PML

[3]

[Anonymous], 2012, 2012 AS C COMP VIS B, DOI DOI 10.1007/978-3-642-37410-412

[4]

[Anonymous], PROC CVPR IEEE

[5]

[Anonymous], 2007, PROC IEEE INT C COMP

[6] Unsupervised Domain Adaptation by Domain Invariant Projection [J].

Baktashmotlagh, Mahsa ;

Harandi, Mehrtash T. ;

Lovell, Brian C. ;

Salzmann, Mathieu .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :769-776

[7]

Bo LF, 2011, IEEE INT C INT ROBOT, P821, DOI 10.1109/IROS.2011.6048717

[8] Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].

Cai, Deng ;

He, Xiaofei ;

Han, Jiawei ;

Huang, Thomas S. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560

[9] Non-negative Matrix Factorization on Manifold [J].

Cai, Deng ;

He, Xiaofei ;

Wu, Xiaoyun ;

Han, Jiawei .

ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :63-+

[10] RGB-D datasets using microsoft kinect or similar sensors: a survey [J].

Cai, Ziyun ;

Han, Jungong ;

Liu, Li ;

Shao, Ling .

MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) :4313-4355

← 1 2 3 4 5 6 →