Detection of Event of Interest for Satellite Video Understanding

被引:29
作者
Gu, Yanfeng [1 ]
Wang, Tengfei [1 ]
Jin, Xudong [1 ]
Gao, Guoming [1 ]
机构
[1] Harbin Inst Technol, Dept Informat Engn, Harbin 150001, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2020年 / 58卷 / 11期
关键词
Event detection; event of interest (EOI); satellite video; two-stream framework; video understanding; SEGMENTATION; RECOGNITION;
D O I
10.1109/TGRS.2020.2984656
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Satellite videos provide rich dynamic information of observed scenes at a large spatial and temporal scale and will play an important role in the future space information network. This work devotes to revealing events of interest (EOI) from satellite video scenes by using a two-stream method. In satellite videos, individual frames reflect the static information like the basic scenes where the event was happening, while a sequence of frames determines the motion information. Considering these facts, a novel two-stream EOI detection framework is proposed, where one stream extracts static spatial information of satellite videos by AlexNet, whereas the other stream extracts the motion information using a local trajectories analysis method. First, the whole video scene is segmented into small spatial-temporal patches, where labeling EOI and non-EOI is completed. Next, the trajectories are extracted from 3-D satellite video cubes that are generated from event scene patches. Finally, this trajectory classification process is treated as a weak supervision learning problem and solved by sparse dictionary learning. The experimental results demonstrate that the proposed two-stream method is effective for EOI detection and has a huge potential for satellite video scenes analysis and understanding. The proposed method also outperforms the existing competitive models for video analysis.
引用
收藏
页码:7860 / 7871
页数:12
相关论文
共 42 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]   Moving vehicle detection, tracking and traffic parameter estimation from a satellite video: a perspective on a smarter city [J].
Ahmadi, Seyed Ali ;
Ghorbanian, Arsalan ;
Mohammadzadeh, Ali .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2019, 40 (22) :8379-8394
[3]  
Ahmadi SA, 2017, JOINT URB REMOTE SEN
[4]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[5]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[6]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[7]   Sparse Coding Guided Spatiotemporal Feature Learning for Abnormal Event Detection in Large Videos [J].
Chu, Wenqing ;
Xue, Hongyang ;
Yao, Chengwei ;
Cai, Deng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (01) :246-255
[8]  
Direkoglu C, 2017, 2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS)
[9]   Object Tracking in Satellite Videos by Fusing the Kernel Correlation Filter and the Three-Frame-Difference Algorithm [J].
Du, Bo ;
Sun, Yujia ;
Cai, Shihan ;
Wu, Chen ;
Du, Qian .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (02) :168-172
[10]   Convolutional Two-Stream Network Fusion for Video Action Recognition [J].
Feichtenhofer, Christoph ;
Pinz, Axel ;
Zisserman, Andrew .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1933-1941