Adaptive RGB Image Recognition by Visual-Depth Embedding

被引:14
作者
Cai, Ziyun [1 ]
Long, Yang [2 ]
Shao, Ling [3 ,4 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing, Jiangsu, Peoples R China
[2] Newcastle Univ, Sch Comp, Open Lab, Newcastle Upon Tyne NE4 5TG, Tyne & Wear, England
[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Univ East Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
关键词
RGB-D data; domain adaptation; visual categorization; NONNEGATIVE MATRIX FACTORIZATION; KERNEL;
D O I
10.1109/TIP.2018.2806839
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing RGB images from RGB-D data is a promising application, which significantly reduces the cost while can still retain high recognition rates. However, existing methods still suffer from the domain shifting problem due to conventional surveillance cameras and depth sensors are using different mechanisms. In this paper, we aim to simultaneously solve the above two challenges: 1) how to take advantage of the additional depth information in the source domain? 2) how to reduce the data distribution mismatch between the source and target domains? We propose a novel method called adaptive visual-depth embedding (aVDE), which learns the compact shared latent space between two representations of labeled RGB and depth modalities in the source domain first. Then the shared latent space can help the transfer of the depth information to the unlabeled target dataset. At last, aVDE models two separate learning strategies for domain adaptation (feature matching and instance reweighting) in a unified optimization problem, which matches features and reweights instances jointly across the shared latent space and the projected target domain for an adaptive classifier. We test our method on five pairs of data sets for object recognition and scene classification, the results of which demonstrates the effectiveness of our proposed method.
引用
收藏
页码:2471 / 2483
页数:13
相关论文
共 51 条
  • [1] [Anonymous], TECH REP
  • [2] [Anonymous], 2013, INT C MACH LEARN PML
  • [3] [Anonymous], 2012, 2012 AS C COMP VIS B, DOI DOI 10.1007/978-3-642-37410-412
  • [4] [Anonymous], PROC CVPR IEEE
  • [5] [Anonymous], 2007, PROC IEEE INT C COMP
  • [6] Unsupervised Domain Adaptation by Domain Invariant Projection
    Baktashmotlagh, Mahsa
    Harandi, Mehrtash T.
    Lovell, Brian C.
    Salzmann, Mathieu
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 769 - 776
  • [7] Bo LF, 2011, IEEE INT C INT ROBOT, P821, DOI 10.1109/IROS.2011.6048717
  • [8] Graph Regularized Nonnegative Matrix Factorization for Data Representation
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1548 - 1560
  • [9] Non-negative Matrix Factorization on Manifold
    Cai, Deng
    He, Xiaofei
    Wu, Xiaoyun
    Han, Jiawei
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 63 - +
  • [10] RGB-D datasets using microsoft kinect or similar sensors: a survey
    Cai, Ziyun
    Han, Jungong
    Liu, Li
    Shao, Ling
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4313 - 4355