Rectifying Noisy Labels with Sequential Prior: Multi-scale Temporal Feature Affinity Learning for Robust Video Segmentation

被引:1
|
作者
Cui, Beilei [1 ]
Zhang, Minqing [2 ]
Xu, Mengya [3 ]
Wang, An [1 ]
Yuan, Wu [2 ]
Ren, Hongliang [1 ,3 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Biomed Engn, Hong Kong, Peoples R China
[3] Natl Univ Singapore, Dept Biomed Engn, Singapore, Singapore
关键词
Noisy label learning; Feature affinity; Semantic segmentation; MEDICAL IMAGE SEGMENTATION;
D O I
10.1007/978-3-031-43996-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Noisy label problems are inevitably in existence within medical image segmentation causing severe performance degradation. Previous segmentation methods for noisy label problems only utilize a single image while the potential of leveraging the correlation between images has been overlooked. Especially for video segmentation, adjacent frames contain rich contextual information beneficial in cognizing noisy labels. Based on two insights, we propose a Multi-Scale Temporal Feature Affinity Learning (MS-TFAL) framework to resolve noisy-labeled medical video segmentation issues. First, we argue the sequential prior of videos is an effective reference, i.e., pixel-level features from adjacent frames are close in distance for the same class and far in distance otherwise. Therefore, Temporal Feature Affinity Learning (TFAL) is devised to indicate possible noisy labels by evaluating the affinity between pixels in two adjacent frames. We also notice that the noise distribution exhibits considerable variations across video, image, and pixel levels. In this way, we introduce Multi-Scale Supervision (MSS) to supervise the network from three different perspectives by re-weighting and refining the samples. This design enables the network to concentrate on clean samples in a coarse-to-fine manner. Experiments with both synthetic and real-world label noise demonstrate that our method outperforms recent state-of-the-art robust segmentation approaches. Code is available at https://github. com/BeileiCui/MS-TFAL.
引用
收藏
页码:90 / 100
页数:11
相关论文
共 50 条
  • [1] Rectifying Noisy Labels with Sequential Prior: Multi-scale Temporal Feature Affinity Learning for Robust Video Segmentation
    Cui, Beilei
    Zhang, Minqing
    Xu, Mengya
    Wang, An
    Yuan, Wu
    Ren, Hongliang
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14228 LNCS : 90 - 100
  • [2] Multi-scale Spatial-Temporal Feature Aggregating for Video Salient Object Segmentation
    Mu, Changhong
    Yuan, Zebin
    Ouyang, Xiuqin
    Wang, Bo
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 224 - 229
  • [3] Scale-teaching: Robust Multi-scale Training for Time Series Classification with Noisy Labels
    Liu, Zhen
    Ma, Peitian
    Chen, Dongliang
    Pei, Wenbin
    Ma, Qianli
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Multi-scale Deep Feature Transfer for Automatic Video Object Segmentation
    Zhen Yang
    Qingxuan Shi
    Yichuan Fang
    Neural Processing Letters, 2023, 55 : 11701 - 11719
  • [5] Multi-scale Deep Feature Transfer for Automatic Video Object Segmentation
    Yang, Zhen
    Shi, Qingxuan
    Fang, Yichuan
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11701 - 11719
  • [6] Video anomaly detection with multi-scale feature and temporal information fusion
    Cai, Yiheng
    Liu, Jiaqi
    Guo, Yajun
    Hu, Shaobin
    Lang, Shinan
    NEUROCOMPUTING, 2021, 423 : 264 - 273
  • [7] Multi-view Robust Discriminative Feature Learning for Remote Sensing Image with Noisy Labels
    Jinyong Chen
    Guisheng Yin
    Kang Sun
    Yuxin Dong
    Mobile Networks and Applications, 2022, 27 : 2487 - 2505
  • [8] Multi-view Robust Discriminative Feature Learning for Remote Sensing Image with Noisy Labels
    Chen, Jinyong
    Yin, Guisheng
    Sun, Kang
    Dong, Yuxin
    MOBILE NETWORKS & APPLICATIONS, 2022, 27 (06): : 2487 - 2505
  • [9] Temporal Multi-Scale Complementary Feature for Video Person Re-Identification
    Hou R.-B.
    Chang H.
    Ma B.-P.
    Huang R.
    Shan S.-G.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (01): : 31 - 50
  • [10] SegR-Net: A deep learning framework with multi-scale feature fusion for robust retinal vessel segmentation
    Ryu, Jihyoung
    Rehman, Mobeen Ur
    Nizami, Imran Fareed
    Chong, Kil To
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163