Learning From Box Annotations for Referring Image Segmentation

被引:5
作者
Feng, Guang [1 ]
Zhang, Lihe [1 ]
Hu, Zhiwei [1 ]
Lu, Huchuan [1 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Proposals; Annotations; Image segmentation; Visualization; Semantics; Training; Noise measurement; Adversarial boundary loss; bounding box (BB) annotation; co-training (Co-T) strategy; weakly supervised referring image segmentation (RIS);
D O I
10.1109/TNNLS.2022.3201372
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Referring image segmentation (RIS) has obtained an impressive achievement by fully convolutional networks (FCNs). However, previous RIS methods require a large number of pixel-level annotations. In this article, we present a weakly supervised RIS method by using bounding box (BB) annotations. In the first stage, we introduce an adversarial boundary loss to extract the object contour from the BB, which is then used to select appropriate region proposals for pseudoground-truth (PGT) generation. In the second stage, we design a co-training (Co-T) strategy to purify the pseudolabels. Specifically, we train two networks and interactively guide them to pick clean labels for each other's networks, which can weaken the effect of noisy labels on model training. Experiment results on four benchmark datasets demonstrate that the proposed method can produce high-quality masks with a speed of 63 frames/s.
引用
收藏
页码:3927 / 3937
页数:11
相关论文
共 50 条
[31]   Multi-Task Deep Learning for Image Segmentation Using Recursive Approximation Tasks [J].
Ke, Rihuan ;
Bugeau, Aurelie ;
Papadakis, Nicolas ;
Kirkland, Mark ;
Schuetz, Peter ;
Schonlieb, Carola-Bibiane .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3555-3567
[32]   Instance-Specific Feature Propagation for Referring Segmentation [J].
Liu, Chang ;
Jiang, Xudong ;
Ding, Henghui .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :3657-3667
[33]   Cross-Modal Progressive Comprehension for Referring Segmentation [J].
Liu, Si ;
Hui, Tianrui ;
Huang, Shaofei ;
Wei, Yunchao ;
Li, Bo ;
Li, Guanbin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :4761-4775
[34]   Unambiguous Scene Text Segmentation With Referring Expression Comprehension [J].
Rong, Xuejian ;
Yi, Chucai ;
Tian, Yingli .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) :591-601
[35]   Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning [J].
Li, Hui ;
Sun, Mingjie ;
Xiao, Jimin ;
Lim, Eng Gee ;
Zhao, Yao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) :5999-6012
[36]   Global Selection and Local Attention Network for Referring Image Segmentation [J].
Ding, Haixin ;
Zhang, Shengchuan ;
Cao, Liujuan .
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VII, 2024, 14431 :284-295
[37]   DSAL: Deeply Supervised Active Learning From Strong and Weak Labelers for Biomedical Image Segmentation [J].
Zhao, Ziyuan ;
Zeng, Zeng ;
Xu, Kaixin ;
Chen, Cen ;
Guan, Cuntai .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (10) :3744-3751
[38]   Self-Supervised Learning for Seismic Image Segmentation From Few-Labeled Samples [J].
Monteiro, Bruno A. A. ;
Oliveira, Hugo ;
dos Santos, Jefersson A. .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[39]   CADFormer: Fine-Grained Cross-Modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation [J].
Liu, Maofu ;
Jiang, Xin ;
Zhang, Xiaokang .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 :14557-14569
[40]   Unsupervised Learning of Image Segmentation Based on Differentiable Feature Clustering [J].
Kim, Wonjik ;
Kanezaki, Asako ;
Tanaka, Masayuki .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :8055-8068