A Weakly-Supervised Cross-Domain Query Framework for Video Camouflage Object Detection

被引:0
|
作者
Lu, Zelin [1 ]
Xie, Liang [1 ]
Zhao, Xing [1 ]
Xu, Binwei [2 ]
Liang, Haoran [1 ]
Liang, Ronghua [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Ningbo Univ, Fac Informat Sci & Engn, Ningbo 315211, Peoples R China
基金
中国国家自然科学基金;
关键词
Annotations; Object detection; Optical flow; Accuracy; Motion segmentation; Feature extraction; Memory management; Circuits and systems; Visualization; Computer vision; Video camouflaged object detection; memory network; weakly supervised; SEGMENTATION; NET;
D O I
10.1109/TCSVT.2024.3470801
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
VCOD (Video Camouflage Object Detection) is a crucial security technology that identifies camouflaged objects in videos, bolstering security measures across diverse applications. On one hand, appearance-based VCOD methods face challenges because camouflaged appearances cause objects to blend into their surroundings, and current VCOD methods typically utilize optical flow to represent motion information. However, over-reliance on accurate estimation renders the model overly fragile. On the other hand, there is a shortage of effectively annotated camouflaged video datasets, coupled with the time-consuming and labor-intensive annotation process, severely constraining the development of this field. To address this, we propose a novel weakly-supervised framework for VCOD based on cross-domain querying of preceding and succeeding frames. Specifically, we propose a time-efficient and labor-saving manual annotation approach based on large visual models to rapidly generate pseudo-labels. Furthermore, we design a network based on Spatio-Temporal Memory (STM) that performs cross-modal feature querying with the current frame against preceding and succeeding frames to acquire useful information, thereby enhancing the focus on temporal information. Extensive experiments conducted on two common VCOD datasets have proven the effectiveness of our method, achieving state-of-the-art performance on the challenging camouflaged video data.
引用
收藏
页码:1506 / 1518
页数:13
相关论文
共 50 条
  • [1] Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation
    Inoue, Naoto
    Furuta, Ryosuke
    Yamasaki, Toshihiko
    Aizawa, Kiyoharu
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5001 - 5009
  • [2] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
    Zhu, Fan
    Shao, Ling
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) : 42 - 59
  • [3] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
    Fan Zhu
    Ling Shao
    International Journal of Computer Vision, 2014, 109 : 42 - 59
  • [4] Weakly-Supervised Cross-Domain Adaptation for Endoscopic Lesions Segmentation
    Dong, Jiahua
    Cong, Yang
    Sun, Gan
    Yang, Yunsheng
    Xu, Xiaowei
    Ding, Zhengming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 2020 - 2033
  • [5] DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection
    Tang, Zongheng
    Sun, Yifan
    Liu, Si
    Yang, Yi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11422 - 11432
  • [6] Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection
    Hou, Luwei
    Zhang, Yu
    Fu, Kui
    Li, Jia
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9924 - 9933
  • [7] Weakly-Supervised RGBD Video Object Segmentation
    Yang, Jinyu
    Gao, Mingqi
    Zheng, Feng
    Zhen, Xiantong
    Ji, Rongrong
    Shao, Ling
    Leonardis, Ales
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2158 - 2170
  • [8] Query-Memory Re-Aggregation for Weakly-supervised Video Object Segmentation
    Lin, Fanchao
    Xie, Hongtao
    Li, Yan
    Zhang, Yongdong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2038 - 2046
  • [9] Weakly-Supervised Cross-Domain Segmentation of Electron Microscopy With Sparse Point Annotation
    Qiu, Dafei
    Xiong, Shan
    Yi, Jiajin
    Peng, Jialin
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (02) : 359 - 371
  • [10] Weakly-Supervised Domain Adaptation With Adversarial Entropy for Building Segmentation in Cross-Domain Aerial Imagery
    Yao, Xuedong
    Wang, Yandong
    Wu, Yanlan
    Liang, Zeyu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8407 - 8418