Application of Video Scene Semantic Recognition Technology in Smart Video

被引：11

作者：

Qin, Lele ^{[1
]}

Kang, Lihua ^{[2
]}

机构：

[1] Hebei Univ Sci & Technol, Sch Econ Management, 70 East Yuhua Rd, Shijiazhuang 050018, Hebei, Peoples R China

[2] Hebei Univ Sci & Technol, Sch Civil Engn, 70 East Yuhua Rd, Shijiazhuang 050018, Hebei, Peoples R China

来源：

TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2018年 / 25卷 / 05期

关键词：

Convolutional Neural Network (CNN); deep learning; keyframe; semantic recognition; smart video; SIMULATION; TRACKING; MODEL;

D O I：

10.17559/TV-20180620082101

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Video behaviour recognition and semantic recognition understanding are important components of intelligent video analytics. Traditionally, human behaviour recognition has met problems of low recognition efficiencies and poor accuracies. For example, most existing behaviour recognition methods use the video frames obtained by even segmentation and fixed sampling as the input, which may lose important information between sampling intervals, fail to identify the key frames of the video segments and make use of the contextual semantics to understand current behaviour. In order to improve the semantic understanding capacity and efficiency of video segments, this paper adopts a 3-layer semantic recognition approach based on key frame extraction. First, it completes the segmentation for video recognition at the bottom layer, extracts the key frames in the video segments, primarily understands basic semantics of the persons' identifications, behaviours and environment, and then introduces the primarily acquired information into the middle layer for semantic integration, and through the integration of various semantics, adopts the loss function to learn the latent relationship between different modal semantics, to enhance the integrating capacity and the robustness of the character semantic integration, and finally, by overall fine tuning, semantic recognition and adjusting all the parameters of the network, completes the semantic recognition task of the video scenario. This method enjoys higher recognition accuracies based on certain datasets, capable of effectively recognizing the semantics of characters and behaviours in videos. Through practical testing, the adoption of the algorithm integrating key frame extractions with the video scene semantic recognition has improved the recognition accuracy and effect of the video character semantics.

引用

页码：1429 / 1436

页数：8

共 36 条

[1] [Anonymous], 2015, ARXIV150206796
[2] [Anonymous], 2015, CHINESE J COMPUTERS
[3] [Anonymous], 2018, J SYSTEM SIMULATION
[4] [Anonymous], 2017, CS CV
[5] Atan O., 2014, 2014 IEEE INT C AC S, V9, P180, DOI [10.1109/icassp.2014.6853687, DOI 10.1109/ICASSP.2014.6853687]
[6] Open Service for Video Learning Analytics
Chorianopoulos, Konstantinos
Giannakos, Michail N.
Chrisochoides, Nikos
Reed, Scott
[J]. 2014 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT), 2014, : 28 - +
[7] Adaptive Streaming of HEVC Tiled Videos Using MPEG-DASH
Concolato, Cyril
Le Feuvre, Jean
Denoual, Franck
Maze, Frederic
Nassor, Eric
Ouedraogo, Nael
Taquet, Jonathan
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (08) : 1981 - 1992
[8] NUMERICAL SIMULATION AND OPTIMIZATION OF OIL JET LUBRICATION FOR ROTORCRAFT MESHING GEARS
Dai, Y.
Wu, W.
Zhou, H. B.
Zhang, J.
Ma, F. Y.
[J]. INTERNATIONAL JOURNAL OF SIMULATION MODELLING, 2018, 17 (02) : 318 - 326
[9] Dai Y, 2018, INT J COMPUT COMMUN, V13, P465
[10] A MECHANICAL-HYDRAULIC VIRTUAL PROTOTYPE CO-SIMULATION MODEL FOR A SEABED REMOTELY OPERATED VEHICLE
Dai, Y.
Zhu, X.
Chen, L. S.
[J]. INTERNATIONAL JOURNAL OF SIMULATION MODELLING, 2016, 15 (03) : 532 - 541

← 1 2 3 4 →