A combined multiple action recognition and summarization for surveillance video sequences

被引：71

作者：

Elharrouss, Omar ^{[1
]}

Almaadeed, Noor ^{[1
]}

Al-Maadeed, Somaya ^{[1
]}

Bouridane, Ahmed ^{[2
]}

Beghdadi, Azeddine ^{[3
]}

机构：

[1] Qatar Univ, Dept Comp Sci & Engn, Doha, Qatar

[2] Northumbria Univ Newcastle, Dept Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England

[3] Galilee Inst Sorbonne Paris Nord Univ France, Paris, France

来源：

APPLIED INTELLIGENCE | 2021年 / 51卷 / 02期

基金：

英国工程与自然科学研究理事会;

关键词：

Video summarization; Human action recognition; CNN; HOG; TDMap;

D O I：

10.1007/s10489-020-01823-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human action recognition and video summarization represent challenging tasks for several computer vision applications including video surveillance, criminal investigations, and sports applications. For long videos, it is difficult to search within a video for a specific action and/or person. Usually, human action recognition approaches presented in the literature deal with videos that contain only a single person, and they are able to recognize his action. This paper proposes an effective approach to multiple human action detection, recognition, and summarization. The multiple action detection extracts human bodies' silhouette, then generates a specific sequence for each one of them using motion detection and tracking method. Each of the extracted sequences is then divided into shots that represent homogeneous actions in the sequence using the similarity between each pair frames. Using the histogram of the oriented gradient (HOG) of the Temporal Difference Map (TDMap) of the frames of each shot, we recognize the action by performing a comparison between the generated HOG and the existed HOGs in the training phase which represents all the HOGs of many actions using a set of videos for training. Also, using the TDMap images we recognize the action using a proposed CNN model. Action summarization is performed for each detected person. The efficiency of the proposed approach is shown through the obtained results for mainly multi-action detection and recognition.

引用

页码：690 / 712

页数：23

共 65 条

[51]

ujatha C, 2014, 2014 5 INT C SIGN IM

[52] Suppression of Dynamic Stall by Leading Edge Slat on a Darrieus Vertical Axis Wind Turbine [J].

Ullah, Tariq ;

Khan, Amjid .

2019 3RD INTERNATIONAL CONFERENCE ON ENERGY CONSERVATION AND EFFICIENCY (ICECE), 2019, :28-32

[53] A Robust and Efficient Video Representation for Action Recognition [J].

Wang, Heng ;

Oneata, Dan ;

Verbeek, Jakob ;

Schmid, Cordelia .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 119 (03) :219-238

[54] A Comparative Review of Recent Kinect-Based Action Recognition Algorithms [J].

Wang, Lei ;

Huynh, Du Q. ;

Koniusz, Piotr .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :15-28

[55]

Wang LQ, 2017, PR IEEE I C PROGR IN, P108, DOI 10.1109/PIC.2017.8359524

[56]

Weinland D, 2007, IEEE I CONF COMP VIS, P170

[57]

Wilfred KJN, 2015, 2015 INTERNATIONAL CONFERENCED ON CIRCUITS, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2015)

[58]

Xu C, 2017, 2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI)

[59]

Xu KQ, 2017, INT CONF ACOUST SPEE, P1363, DOI 10.1109/ICASSP.2017.7952379

[60] Browsing and exploration of video sequences: A new scheme for key frame extraction and 3D visualization using entropy based Jensen divergence [J].

Xu, Qing ;

Liu, Yu ;

Li, Xiu ;

Yang, Zhen ;

Wang, Jie ;

Sbert, Mateu ;

Scopigno, Riccardo .

INFORMATION SCIENCES, 2014, 278 :736-756

← 1 2 3 4 5 6 7 →