Recognition of Human Continuous Action with 3D CNN

被引：2

作者：

Yu, Gang ^{[1
]}

Li, Ting ^{[1
]}

机构：

[1] Harbin Inst Technol, Shenzhen Grad Sch, Shenzhen 518055, Guangdong, Peoples R China

来源：

COMPUTER VISION SYSTEMS, ICVS 2017 | 2017年 / 10528卷

关键词：

Human continuous action recognition; 3D CNN; KNN; Improved L-K optical flow; Gabor filter;

D O I：

10.1007/978-3-319-68345-4_28

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Under the boom of the service robot, the human continuous action recognition becomes an indispensable research. In this paper, we propose a continuous action recognition method based on multi-channel 3D CNN for extracting multiple features, which are classified with KNN. First, we use fragmentary action as training samples which can be identified in the process of action. Then the training samples are processed through the gray scale, improved L-K optical flow and Gabor filter, to extract the characteristics of diversification using a priori knowledge. Then the 3D CNN is constructed to process multi-channel features that are formed into 128-dimension feature maps. Finally, we use KNN to classify those samples. We find that the fragmentary action in continuous action of the identification showed a good robustness. And the proposed method is verified in HMDB-51 and UCF-101 to be more accurate than Gaussian Bayes or the single 3D CNN in action recognition.

引用

页码：314 / 322

页数：9

共 46 条

[21] ENHANCED 3D TREE MODEL SIMPLIFICATION AND PERCEPTUAL ANALYSIS [J].

Lee, Jessy ;

Kuo, May-Chen ;

Kuo, C-C Jay .

ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, :1250-+

[22] Diagnostic system for 3D ultrasonography based on gabor filter [J].

Chen W.-M. ;

Chang W.-L. ;

Chang C.-C. .

Journal of Multimedia, 2010, 5 (06) :613-621

[23] A visual cognizance based multi-resolution descriptor for human action recognition using key pose [J].

Vishwakarma, Dinesh Kumar ;

Singh, Tej .

AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2019, 107 :157-169

[24] An Automatic 3D Facial Landmarking Algorithm Using 2D Gabor Wavelets [J].

de Jong, Markus A. ;

Wollstein, Andreas ;

Ruff, Clifford ;

Dunaway, David ;

Hysi, Pirro ;

Spector, Tim ;

Liu, Fan ;

Niessen, Wiro ;

Koudstaal, Maarten J. ;

Kayser, Manfred ;

Wolvius, Eppo B. ;

Bohringer, Stefan .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) :580-588

[25] ROBUST SEGMENTATION OF CORONARY ARTERIES IN CINE ANGIOGRAPHY FOR 3D MODELING [J].

Zwettler, Gerald A. ;

Swoboda, Roland ;

Backfrieder, Werner ;

Steinwender, Clemens ;

Leisch, Franz ;

Gabriel, Christian .

INTERNATIONAL MEDITERRANEAN MODELLING MULTICONFERENCE 2006, 2006, :675-+

[26] 3D local circular difference patterns for biomedical image retrieval [J].

Mohite, Nilima ;

Waghmare, Laxman ;

Gonde, Anil ;

Vipparthi, Santoshkumar .

INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2019, 8 (02) :115-125

[27] A Genetic Algorithm-Based 3D Feature Selection for Lip Reading [J].

Morade, Sunil Sudam ;

Patnaik, Suprava .

2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,

[28] A 3D Obstacle Classification Method in Point Clouds Using K-NN [J].

Tian, Yifei ;

Song, Wei ;

Fong, Simon ;

Zou, Shuanghui ;

Lee, Euy Soo ;

Jongtae, Rhee .

BDIOT 2018: PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS, 2018, :76-79

[29] Automatic co-registration of 3D multi-sensor point clouds [J].

Persad, Ravi Ancil ;

Armenakis, Costas .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :162-186

[30] Human Activity Recognition Based on Smart Phone's 3-Axis Acceleration Sensor [J].

Cai, Shubin ;

Shan, Zhiguang ;

Zeng, Tian ;

Yin, Jianfei ;

Ming, Zhong .

SMART COMPUTING AND COMMUNICATION, SMARTCOM 2016, 2017, 10135 :163-172

← 1 2 3 4 5 →