RGB-D datasets using microsoft kinect or similar sensors: a survey

被引:96
作者
Cai, Ziyun [1 ]
Han, Jungong [2 ]
Liu, Li [2 ]
Shao, Ling [2 ]
机构
[1] Univ Sheffield, Dept Elect & Elect Engn, Mappin St, Sheffield S1 3JD, S Yorkshire, England
[2] Northumbria Univ, Dept Comp Sci & Digital Technol, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
关键词
Microsoft Kinect sensor or similar devices; RGB-D dataset; Computer vision; Survey; Database; ACTION RECOGNITION; ACTIONLET ENSEMBLE; DEPTH INFORMATION; FALL DETECTION; SEGMENTATION; FRAMEWORK; DATABASE; FUSION; CAMERA; MAPS;
D O I
10.1007/s11042-016-3374-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RGB-D data has turned out to be a very useful representation of an indoor scene for solving fundamental computer vision problems. It takes the advantages of the color image that provides appearance information of an object and also the depth image that is immune to the variations in color, illumination, rotation angle and scale. With the invention of the low-cost Microsoft Kinect sensor, which was initially used for gaming and later became a popular device for computer vision, high quality RGB-D data can be acquired easily. In recent years, more and more RGB-D image/video datasets dedicated to various applications have become available, which are of great importance to benchmark the state-of-the-art. In this paper, we systematically survey popular RGB-D datasets for different applications including object recognition, scene classification, hand gesture recognition, 3D-simultaneous localization and mapping, and pose estimation. We provide the insights into the characteristics of each important dataset, and compare the popularity and the difficulty of those datasets. Overall, the main goal of this survey is to give a comprehensive description about the available RGB-D datasets and thus to guide researchers in the selection of suitable datasets for evaluating their algorithms.
引用
收藏
页码:4313 / 4355
页数:43
相关论文
共 105 条
  • [31] Erdogmus N, 2013, SPOOFING 2D FACE REC, P1
  • [32] Random Forests for Real Time 3D Face Analysis
    Fanelli, Gabriele
    Dantone, Matthias
    Gall, Juergen
    Fossati, Andrea
    Van Gool, Luc
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 101 (03) : 437 - 458
  • [33] Gao J, 2014, LECT NOTES COMPUT SC, V8691, P188, DOI 10.1007/978-3-319-10578-9_13
  • [34] Garcia J., 2008, U.S. Patent 7 433 024, Patent No. [7 433 024, 7433024]
  • [35] Gasparrini S, SENSORS, V14, P2756
  • [36] Structured-light 3D surface imaging: a tutorial
    Geng, Jason
    [J]. ADVANCES IN OPTICS AND PHOTONICS, 2011, 3 (02): : 128 - 160
  • [37] Gossow D, 2012, INT C PATT RECOG, P2764
  • [38] Learning Rich Features from RGB-D Images for Object Detection and Segmentation
    Gupta, Saurabh
    Girshick, Ross
    Arbelaez, Pablo
    Malik, Jitendra
    [J]. COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 345 - 360
  • [39] Enhanced Computer Vision with Microsoft Kinect Sensor: A Review
    Han, Jungong
    Shao, Ling
    Xu, Dong
    Shotton, Jamie
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (05) : 1318 - 1334
  • [40] Handa A, 2014, IEEE INT CONF ROBOT, P1524, DOI 10.1109/ICRA.2014.6907054