A Weakly-Supervised Cross-Domain Query Framework for Video Camouflage Object Detection

被引:1
作者
Lu, Zelin [1 ]
Xie, Liang [1 ]
Zhao, Xing [1 ]
Xu, Binwei [2 ]
Liang, Haoran [1 ]
Liang, Ronghua [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Ningbo Univ, Fac Informat Sci & Engn, Ningbo 315211, Peoples R China
基金
中国国家自然科学基金;
关键词
Annotations; Object detection; Optical flow; Accuracy; Motion segmentation; Feature extraction; Memory management; Circuits and systems; Visualization; Computer vision; Video camouflaged object detection; memory network; weakly supervised; SEGMENTATION; NET;
D O I
10.1109/TCSVT.2024.3470801
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
VCOD (Video Camouflage Object Detection) is a crucial security technology that identifies camouflaged objects in videos, bolstering security measures across diverse applications. On one hand, appearance-based VCOD methods face challenges because camouflaged appearances cause objects to blend into their surroundings, and current VCOD methods typically utilize optical flow to represent motion information. However, over-reliance on accurate estimation renders the model overly fragile. On the other hand, there is a shortage of effectively annotated camouflaged video datasets, coupled with the time-consuming and labor-intensive annotation process, severely constraining the development of this field. To address this, we propose a novel weakly-supervised framework for VCOD based on cross-domain querying of preceding and succeeding frames. Specifically, we propose a time-efficient and labor-saving manual annotation approach based on large visual models to rapidly generate pseudo-labels. Furthermore, we design a network based on Spatio-Temporal Memory (STM) that performs cross-modal feature querying with the current frame against preceding and succeeding frames to acquire useful information, thereby enhancing the focus on temporal information. Extensive experiments conducted on two common VCOD datasets have proven the effectiveness of our method, achieving state-of-the-art performance on the challenging camouflaged video data.
引用
收藏
页码:1506 / 1518
页数:13
相关论文
共 87 条
[1]   MoA-Net: Self-supervised Motion Segmentation [J].
Bideau, Pia ;
Menon, Rakesh R. ;
Learned-Miller, Erik .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI, 2019, 11134 :715-730
[2]   It's Moving! A Probabilistic Model for Causal Motion Segmentation in Moving Camera Videos [J].
Bideau, Pia ;
Learned-Miller, Erik .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :433-449
[3]   The robust estimation of multiple motions: Parametric and piecewise-smooth flow fields [J].
Black, MJ ;
Anandan, P .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1996, 63 (01) :75-104
[4]  
Brox T., 2010, P EUR C COMP VIS ECC, P9
[5]   Emerging Properties in Self-Supervised Vision Transformers [J].
Caron, Mathilde ;
Touvron, Hugo ;
Misra, Ishan ;
Jegou, Herve ;
Mairal, Julien ;
Bojanowski, Piotr ;
Joulin, Armand .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640
[6]   Exploring Rich and Efficient Spatial Temporal Interactions for Real-Time Video Salient Object Detection [J].
Chen, Chenglizhao ;
Wang, Guotao ;
Peng, Chong ;
Fang, Yuming ;
Zhang, Dingwen ;
Qin, Hong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3995-4007
[7]   SAM-Adapter: Adapting Segment Anything in Underperformed Scenes [J].
Chen, Tianrun ;
Zhu, Lanyun ;
Ding, Chaotao ;
Cao, Runlong ;
Wang, Yan ;
Zhang, Shangzhan ;
Li, Zejian ;
Sun, Lingyun ;
Zang, Ying ;
Mao, Papa .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, :3359-3367
[8]  
Cheng H. K., 2022, PROC EUR C COMPUT VI
[9]   Implicit Motion Handling for Video Camouflaged Object Detection [J].
Cheng, Xuelian ;
Xiong, Huan ;
Fan, Deng-Ping ;
Zhong, Yiran ;
Harandi, Mehrtash ;
Drummond, Tom ;
Ge, Zongyuan .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :13854-13863
[10]   Applications for deep learning in ecology [J].
Christin, Sylvain ;
Hervet, Eric ;
Lecomte, Nicolas .
METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (10) :1632-1644