A Weakly-Supervised Cross-Domain Query Framework for Video Camouflage Object Detection

被引：0

作者：

Lu, Zelin ^{[1
]}

Xie, Liang ^{[1
]}

Zhao, Xing ^{[1
]}

Xu, Binwei ^{[2
]}

Liang, Haoran ^{[1
]}

Liang, Ronghua ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China

[2] Ningbo Univ, Fac Informat Sci & Engn, Ningbo 315211, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2025年 / 35卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Annotations; Object detection; Optical flow; Accuracy; Motion segmentation; Feature extraction; Memory management; Circuits and systems; Visualization; Computer vision; Video camouflaged object detection; memory network; weakly supervised; SEGMENTATION; NET;

D O I：

10.1109/TCSVT.2024.3470801

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

VCOD (Video Camouflage Object Detection) is a crucial security technology that identifies camouflaged objects in videos, bolstering security measures across diverse applications. On one hand, appearance-based VCOD methods face challenges because camouflaged appearances cause objects to blend into their surroundings, and current VCOD methods typically utilize optical flow to represent motion information. However, over-reliance on accurate estimation renders the model overly fragile. On the other hand, there is a shortage of effectively annotated camouflaged video datasets, coupled with the time-consuming and labor-intensive annotation process, severely constraining the development of this field. To address this, we propose a novel weakly-supervised framework for VCOD based on cross-domain querying of preceding and succeeding frames. Specifically, we propose a time-efficient and labor-saving manual annotation approach based on large visual models to rapidly generate pseudo-labels. Furthermore, we design a network based on Spatio-Temporal Memory (STM) that performs cross-modal feature querying with the current frame against preceding and succeeding frames to acquire useful information, thereby enhancing the focus on temporal information. Extensive experiments conducted on two common VCOD datasets have proven the effectiveness of our method, achieving state-of-the-art performance on the challenging camouflaged video data.

引用

页码：1506 / 1518

页数：13

共 50 条

[1] Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation
Inoue, Naoto
Furuta, Ryosuke
Yamasaki, Toshihiko
Aizawa, Kiyoharu
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5001 - 5009
[2] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
Zhu, Fan
Shao, Ling
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) : 42 - 59
[3] Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition
Fan Zhu
Ling Shao
International Journal of Computer Vision, 2014, 109 : 42 - 59
[4] Weakly-Supervised Cross-Domain Adaptation for Endoscopic Lesions Segmentation
Dong, Jiahua
Cong, Yang
Sun, Gan
Yang, Yunsheng
Xu, Xiaowei
Ding, Zhengming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 2020 - 2033
[5] DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection
Tang, Zongheng
Sun, Yifan
Liu, Si
Yang, Yi
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11422 - 11432
[6] Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection
Hou, Luwei
Zhang, Yu
Fu, Kui
Li, Jia
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9924 - 9933
[7] Weakly-Supervised RGBD Video Object Segmentation
Yang, Jinyu
Gao, Mingqi
Zheng, Feng
Zhen, Xiantong
Ji, Rongrong
Shao, Ling
Leonardis, Ales
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2158 - 2170
[8] Query-Memory Re-Aggregation for Weakly-supervised Video Object Segmentation
Lin, Fanchao
Xie, Hongtao
Li, Yan
Zhang, Yongdong
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2038 - 2046
[9] Weakly-Supervised Cross-Domain Segmentation of Electron Microscopy With Sparse Point Annotation
Qiu, Dafei
Xiong, Shan
Yi, Jiajin
Peng, Jialin
IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (02) : 359 - 371
[10] Weakly-Supervised Domain Adaptation With Adversarial Entropy for Building Segmentation in Cross-Domain Aerial Imagery
Yao, Xuedong
Wang, Yandong
Wu, Yanlan
Liang, Zeyu
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8407 - 8418

← 1 2 3 4 5 →