Text query based summarized event searching interface system using deep learning over cloud

被引:43
作者
Kumar, Krishan [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Srinagar, Uttarakhand, India
关键词
Deep learning; DNA sequence; Searching; Event summarization; Local alignment; Text query; Video; VIDEO;
D O I
10.1007/s11042-020-10157-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the digital era, the growth of multimedia data is increasing at a rapid pace, which demands both effective and efficient summarization techniques. Such advanced techniques are required so that the users can quickly access the video content, recorded by multiple cameras for a certain period. At present, it is very challenging to manage and search a huge amount of multiview video data, which contains the inter-views dependencies, significant illumination changes, and many low-active frames. This work highlights an efficient summarization technique to summarize and then search the events in such multi-view videos over cloud through text query. Deep learning framework is employed to extract the features of moving objects in the frames. The inter-views dependencies among multiple views of the video are captured via local alignment. Parallel Virtual Machines (VMs) in the Cloud environment have been used to process the multiple video clip independently at a time. Object tracking is applied to filter the low-active frames. Experimental Results indicate that the model successfully reduces the video content, while preserving the momentous information in the form of the events. A computing analysis also indicates that it meets the requirement of real-time applications.
引用
收藏
页码:11079 / 11094
页数:16
相关论文
共 35 条
[1]   Online video summarization on compressed domain [J].
Almeida, Jurandy ;
Leite, Neucimar J. ;
Torres, Ricardo da S. .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (06) :729-738
[2]  
[Anonymous], 2015, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
[3]  
Ayguade E, 2007, SMITH WATERMAN ALGOR
[4]   Adaptive Learning for Target Tracking and True Linking Discovering Across Multiple Non-Overlapping Cameras [J].
Chen, Kuan-Wen ;
Lai, Chih-Chuan ;
Lee, Pei-Jyun ;
Chen, Chu-Song ;
Hung, Yi-Ping .
IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (04) :625-638
[5]   Rushes video parsing using video sequence alignment [J].
Dumont, Emilie ;
Merialdo, Bernard .
CBMI: 2009 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2009, :44-49
[6]   Multi-View Video Summarization [J].
Fu, Yanwei ;
Guo, Yanwen ;
Zhu, Yanshu ;
Liu, Feng ;
Song, Chuanming ;
Zhou, Zhi-Hua .
IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (07) :717-729
[7]   Unsupervised t-Distributed Video Hashing and Its Deep Hashing Extension [J].
Hao, Yanbin ;
Mu, Tingting ;
Goulermas, John Y. ;
Jiang, Jianguo ;
Hong, Richang ;
Wang, Meng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) :5531-5544
[8]   Coherent Semantic-Visual Indexing for Large-Scale Image Retrieval in the Cloud [J].
Hong, Richang ;
Li, Lei ;
Cai, Junjie ;
Tao, Dapeng ;
Wang, Meng ;
Tian, Qi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (09) :4128-4138
[9]   Saliency detection based on directional patches extraction and principal local color contrast [J].
Jian, Muwei ;
Zhang, Wenyin ;
Yu, Hui ;
Cui, Chaoran ;
Nie, Xiushan ;
Zhang, Huaxiang ;
Yin, Yilong .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 57 :1-11
[10]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90