A novel unsupervised 3D skeleton detection in RGB-D images for video surveillance

被引:1
|
作者
Cheng, Shyi-Chyi [1 ]
Hsiao, Kuei-Fang [2 ]
Yang, Chen-Kuei [2 ]
Hsiao, Po-Fu [1 ]
Yu, Wan-Hsuan [1 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, 2 Pei Ning Rd, Keelung 202, Taiwan
[2] Ming Chuan Univ, Dept Informat Management, 5 De Ming Rd, Taoyuan 333, Taiwan
关键词
Object skeleton modeling and detection; Moment-based symmetry feature detection; RGB-D images; Part merging; Unsupervised feature learning; SYMMETRY DETECTION; EXTRACTION; TIME;
D O I
10.1007/s11042-018-6292-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present a novel moment-based skeleton detection for representing human objects in RGB-D videos with animated 3D skeletons. An object often consists of several parts, where each of them can be concisely represented with a skeleton. However, it remains as a challenge to detect the skeletons of individual objects in an image since it requires an effective part detector and a part merging algorithm to group parts into objects. In this paper, we present a novel fully unsupervised learning framework to detect the skeletons of human objects in a RGB-D video. The skeleton modeling algorithm uses a pipeline architecture which consists of a series of cascaded operations, i.e., symmetry patch detection, linear time search of symmetry patch pairs, part and symmetry detection, symmetry graph partitioning, and object segmentation. The properties of geometric moment-based functions for embedding symmetry features into centers of symmetry patches are also investigated in detail. As compared with the state-of-the-art deep learning approaches for skeleton detection, the proposed approach does not require tedious human labeling work on training images to locate the skeleton pixels and their associated scale information. Although our algorithm can detect parts and objects simultaneously, a pre-learned convolution neural network (CNN) can be used to locate the human object from each frame of the input video RGB-D video in order to achieve the goal of constructing real-time applications. This much reduces the complexity to detect the skeleton structure of individual human objects with our proposed method. Using the segmented human object skeleton model, a video surveillance application is constructed to verify the effectiveness of the approach. Experimental results show that the proposed method gives good performance in terms of detection and recognition using publicly available datasets.
引用
收藏
页码:15829 / 15857
页数:29
相关论文
共 50 条
  • [1] A novel unsupervised 3D skeleton detection in RGB-D images for video surveillance
    Shyi-Chyi Cheng
    Kuei-Fang Hsiao
    Chen-Kuei Yang
    Po-Fu Hsiao
    Wan-Hsuan Yu
    Multimedia Tools and Applications, 2020, 79 : 15829 - 15857
  • [2] Unsupervised Segmentation of RGB-D Images
    Deng, Zhuo
    Latecki, Longin Jan
    COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 423 - 435
  • [3] 3D Hand Pose Detection in Egocentric RGB-D Images
    Rogez, Gregory
    Khademi, Maryam
    Supancic, J. S., III
    Montiel, J. M. M.
    Ramanan, Deva
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 356 - 371
  • [4] 3D Texture Recognition for RGB-D Images
    Zhong, Guoqiang
    Mao, Xin
    Shi, Yaxin
    Dong, Junyu
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 518 - 528
  • [5] 2D-Driven 3D Object Detection in RGB-D Images
    Lahoud, Jean
    Ghanem, Bernard
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4632 - 4640
  • [6] Expandable YOLO: 3D Object Detection from RGB-D Images
    Takahashi, Masahiro
    Ji, Yonghoon
    Umeda, Kazunori
    Moro, Alessandro
    2020 21ST INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM), 2020,
  • [7] Unsupervised Human Activity Detection with Skeleton Data From RGB-D Sensor
    Ong, Wee-Hong
    Koseki, Takafumi
    Palafox, Leon
    2013 FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN), 2013, : 30 - 35
  • [8] An automatic 2D to 3D video conversion approach based on RGB-D images
    Pan, Baiyu
    Zhang, Liming
    Yin, Hanxiong
    Lan, Jun
    Cao, Feilong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (13) : 19179 - 19201
  • [9] An automatic 2D to 3D video conversion approach based on RGB-D images
    Baiyu Pan
    Liming Zhang
    Hanxiong Yin
    Jun Lan
    Feilong Cao
    Multimedia Tools and Applications, 2021, 80 : 19179 - 19201
  • [10] Basic 3D Solid Recognition in RGB-D Images
    Kornuta, Tomasz
    Stefanczyk, Maciej
    Kasprzak, Wlodzimierz
    RECENT ADVANCES IN AUTOMATION, ROBOTICS AND MEASURING TECHNIQUES, 2014, 267 : 421 - 430