Cross-frame keypoint-based and spatial motion information-guided networks for moving vehicle detection and tracking in satellite videos

被引:48
|
作者
Feng, Jie [1 ]
Zeng, Dening [1 ]
Jia, Xiuping [2 ]
Zhang, Xiangrong [1 ]
Li, Jie [3 ]
Liang, Yuping [1 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Shaanxi, Peoples R China
[2] Univ New South Wales, Sch Engn & Informat Technol, Canberra, ACT 2612, Australia
[3] Shanghai Aerosp Elect Technol Inst, Space Platform Business Div, Shanghai 201109, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Keypoint-based detection; Moving vehicle detection; Multi-object tracking; Satellite videos; IMAGES;
D O I
10.1016/j.isprsjprs.2021.05.005
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Deep learning methods have achieved the state-of-the-art performance of object detection and tracking in natural images, such as keypoint-based detectors and appearance/motion-based trackers. However, for small and blurry moving vehicles in satellite videos, keypoint-based detectors cause the missed detection of keypoints and incorrect keypoint matching. In terms of multi-object tracking, it is difficult to track the crowded similar vehicles stably only by using the appearance or motion information. To address these problems, a novel deep learning framework is proposed for moving vehicle detection and tracking in the satellite videos. It is comprised of the cross-frame keypoint-based detection network (CKDNet) and spatial motion information-guided tracking network (SMTNet). In CKDNet, a customized cross-frame module is designed to assist the detection of keypoints by exploiting complementary information between frames. Furthermore, CKDNet improves keypoint matching by incorporating size prediction around the keypoints and defining the soft mismatch suppression for out-of-size keypoint pairs. Based on high-quality detection, SMTNet can track the densely-packed vehicles effectively by constructing two-branch long short-term memories. It extracts not only spatial information of the same frame by considering the relative spatial relationship of neighboring vehicles, but also motion information among consecutive frames by calculating the movement velocity. Especially, it regresses virtual positions for missed or occluded vehicles and keeps on tracking these vehicles while they reappear. Experimental results on Jilin-1 and SkySat satellite videos demonstrate the effectiveness of the proposed detection and tracking methods.
引用
收藏
页码:116 / 130
页数:15
相关论文
共 1 条
  • [1] 3D-MGLNET: MOVING VEHICLE DETECTION IN SATELLITE VIDEOS WITH 3D MOTION-GUIDED LIGHTWEIGHT NETWORK
    Zhu, Xiaoyu
    Li, Jie
    Feng, Jie
    Jiang, Quanpeng
    Zhang, Xiangrong
    Jiao, Licheng
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6478 - 6481