Collecting public RGB-D datasets for human daily activity recognition

被引:7
|
作者
Wu, Hanbo [1 ]
Ma, Xin [1 ]
Zhang, Zhimeng [1 ]
Wang, Haibo [1 ]
Li, Yibin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, 17923 Jingshi Rd, Jinan, Shandong, Peoples R China
来源
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2017年 / 14卷 / 04期
关键词
Human daily activity recognition; public RGB-D data sets merging; large-scale RGB-D activity data set; depth motion maps; depth cuboid similarity feature; curvature space scale; OBJECT RECOGNITION; FUSION; MODEL;
D O I
10.1177/1729881417709079
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Human daily activity recognition has been a hot spot in the field of computer vision for many decades. Despite best efforts, activity recognition in naturally uncontrolled settings remains a challenging problem. Recently, by being able to perceive depth and visual cues simultaneously, RGB-D cameras greatly boost the performance of activity recognition. However, due to some practical difficulties, the publicly available RGB-D data sets are not sufficiently large for benchmarking when considering the diversity of their activities, subjects, and background. This severely affects the applicability of complicated learning-based recognition approaches. To address the issue, this article provides a large-scale RGB-D activity data set by merging five public RGB-D data sets that differ from each other on many aspects such as length of actions, nationality of subjects, or camera angles. This data set comprises 4528 samples depicting 7 action categories (up to 46 subcategories) performed by 74 subjects. To verify the challengeness of the data set, three feature representation methods are evaluated, which are depth motion maps, spatiotemporal depth cuboid similarity feature, and curvature space scale. Results show that the merged large-scale data set is more realistic and challenging and therefore more suitable for benchmarking.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] Human body reshaping and its application using multiple RGB-D sensors
    Xu, Wanxin
    Su, Po-chang
    Cheung, Sen-ching Samson
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 79 : 71 - 81
  • [42] A nearest neighbor approach for fruit recognition in RGB-D images based on detection of convex surfaces
    Nyarko, Emmanuel Karlo
    Vidovic, Ivan
    Radocaj, Kristijan
    Cupec, Robert
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 454 - 466
  • [43] A Unified Multimodal De- and Re-Coupling Framework for RGB-D Motion Recognition
    Zhou, Benjia
    Wang, Pichao
    Wan, Jun
    Liang, Yanyan
    Wang, Fan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11428 - 11442
  • [44] Uniform and Variational Deep Learning for RGB-D Object Recognition and Person Re-Identification
    Ren, Liangliang
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4970 - 4983
  • [45] Multi-Model Convolutional Extreme Learning Machine with Kernel for RGB-D Object Recognition
    Yin, Yunhua
    Li, Huifang
    Wen, Xinling
    LIDAR IMAGING DETECTION AND TARGET RECOGNITION 2017, 2017, 10605
  • [46] Object Recognition and Augmentation for Wearable-Assistive System Using Egocentric RGB-D Sensor
    Gao, Ge
    Qian, Kun
    Ma, Xudong
    Xia, Jing
    Yu, Hai
    2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 775 - 780
  • [47] ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition
    Wan, Jun
    Lin, Chi
    Wen, Longyin
    Li, Yunan
    Miao, Qiguang
    Escalera, Sergio
    Anbarjafari, Gholamreza
    Guyon, Isabelle
    Guo, Guodong
    Li, Stan Z.
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 3422 - 3433
  • [48] Multi-view CSPMPR-ELM feature learning and classifying for RGB-D object recognition
    Yunhua Yin
    Huifang Li
    Cluster Computing, 2019, 22 : 8181 - 8191
  • [49] Multi-view CSPMPR-ELM feature learning and classifying for RGB-D object recognition
    Yin, Yunhua
    Li, Huifang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S8181 - S8191
  • [50] Efficient Recognition and 6D Pose Tracking of Markerless Objects with RGB-D and Motion Sensors on Mobile Devices
    Huang, Sheng-Chu
    Huang, Wei-Lun
    Lu, Yi-Cheng
    Tsai, Ming-Han
    Lin, I-Chen
    Lau, Yo-Chung
    Liu, Hsu-Hang
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (GRAPP), VOL 1, 2019, : 375 - 382