A novel unsupervised 3D skeleton detection in RGB-D images for video surveillance

被引：1

作者：

Cheng, Shyi-Chyi ^{[1
]}

Hsiao, Kuei-Fang ^{[2
]}

Yang, Chen-Kuei ^{[2
]}

Hsiao, Po-Fu ^{[1
]}

Yu, Wan-Hsuan ^{[1
]}

机构：

[1] Natl Taiwan Ocean Univ, Dept Comp Sci & Engn, 2 Pei Ning Rd, Keelung 202, Taiwan

[2] Ming Chuan Univ, Dept Informat Management, 5 De Ming Rd, Taoyuan 333, Taiwan

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2020年 / 79卷 / 23-24期

关键词：

Object skeleton modeling and detection; Moment-based symmetry feature detection; RGB-D images; Part merging; Unsupervised feature learning; SYMMETRY DETECTION; EXTRACTION; TIME;

D O I：

10.1007/s11042-018-6292-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we present a novel moment-based skeleton detection for representing human objects in RGB-D videos with animated 3D skeletons. An object often consists of several parts, where each of them can be concisely represented with a skeleton. However, it remains as a challenge to detect the skeletons of individual objects in an image since it requires an effective part detector and a part merging algorithm to group parts into objects. In this paper, we present a novel fully unsupervised learning framework to detect the skeletons of human objects in a RGB-D video. The skeleton modeling algorithm uses a pipeline architecture which consists of a series of cascaded operations, i.e., symmetry patch detection, linear time search of symmetry patch pairs, part and symmetry detection, symmetry graph partitioning, and object segmentation. The properties of geometric moment-based functions for embedding symmetry features into centers of symmetry patches are also investigated in detail. As compared with the state-of-the-art deep learning approaches for skeleton detection, the proposed approach does not require tedious human labeling work on training images to locate the skeleton pixels and their associated scale information. Although our algorithm can detect parts and objects simultaneously, a pre-learned convolution neural network (CNN) can be used to locate the human object from each frame of the input video RGB-D video in order to achieve the goal of constructing real-time applications. This much reduces the complexity to detect the skeleton structure of individual human objects with our proposed method. Using the segmented human object skeleton model, a video surveillance application is constructed to verify the effectiveness of the approach. Experimental results show that the proposed method gives good performance in terms of detection and recognition using publicly available datasets.

引用

页码：15829 / 15857

页数：29

共 50 条

[1] A novel unsupervised 3D skeleton detection in RGB-D images for video surveillance
Shyi-Chyi Cheng
Kuei-Fang Hsiao
Chen-Kuei Yang
Po-Fu Hsiao
Wan-Hsuan Yu
Multimedia Tools and Applications, 2020, 79 : 15829 - 15857
[2] Unsupervised Segmentation of RGB-D Images
Deng, Zhuo
Latecki, Longin Jan
COMPUTER VISION - ACCV 2014, PT III, 2015, 9005 : 423 - 435
[3] 3D Hand Pose Detection in Egocentric RGB-D Images
Rogez, Gregory
Khademi, Maryam
Supancic, J. S., III
Montiel, J. M. M.
Ramanan, Deva
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 356 - 371
[4] 3D Texture Recognition for RGB-D Images
Zhong, Guoqiang
Mao, Xin
Shi, Yaxin
Dong, Junyu
COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 518 - 528
[5] 2D-Driven 3D Object Detection in RGB-D Images
Lahoud, Jean
Ghanem, Bernard
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4632 - 4640
[6] Expandable YOLO: 3D Object Detection from RGB-D Images
Takahashi, Masahiro
Ji, Yonghoon
Umeda, Kazunori
Moro, Alessandro
2020 21ST INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM), 2020,
[7] Unsupervised Human Activity Detection with Skeleton Data From RGB-D Sensor
Ong, Wee-Hong
Koseki, Takafumi
Palafox, Leon
2013 FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN), 2013, : 30 - 35
[8] An automatic 2D to 3D video conversion approach based on RGB-D images
Pan, Baiyu
Zhang, Liming
Yin, Hanxiong
Lan, Jun
Cao, Feilong
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (13) : 19179 - 19201
[9] An automatic 2D to 3D video conversion approach based on RGB-D images
Baiyu Pan
Liming Zhang
Hanxiong Yin
Jun Lan
Feilong Cao
Multimedia Tools and Applications, 2021, 80 : 19179 - 19201
[10] Basic 3D Solid Recognition in RGB-D Images
Kornuta, Tomasz
Stefanczyk, Maciej
Kasprzak, Wlodzimierz
RECENT ADVANCES IN AUTOMATION, ROBOTICS AND MEASURING TECHNIQUES, 2014, 267 : 421 - 430

← 1 2 3 4 5 →