Accurate and Robust Video Saliency Detection via Self-Paced Diffusion

被引:43
|
作者
Li, Yunxiao [1 ]
Li, Shuai [1 ]
Chen, Chenglizhao [1 ,2 ]
Hao, Aimin [1 ]
Qin, Hong [3 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Qingdao Univ, Qingdao 266071, Peoples R China
[3] SUNY Stony Brook, Stony Brook, NY 11794 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Saliency detection; Proposals; Video sequences; Spatial coherence; Computational modeling; Optical imaging; Optical sensors; Video saliency detection; long-term saliency revealing; key frame strategy; self-paced saliency diffusion; OBJECT DETECTION; SEGMENTATION; OPTIMIZATION; FUSION;
D O I
10.1109/TMM.2019.2940851
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional video saliency detection methods frequently follow the common bottom-up thread to estimate video saliency within the short-term fashion. As a result, such methods can not avoid the obstinate accumulation of errors when the collected low-level clues are constantly ill-detected. Also, being noticed that a portion of video frames, which are not nearby the current video frame over the time axis, may potentially benefit the saliency detection in the current video frame. Thus, we propose to solve the aforementioned problem using our newly-designed key frame strategy (KFS), whose core rationale is to utilize both the spatial-temporal coherency of the salient foregrounds and the objectness prior (i.e., how likely it is for an object proposal to contain an object of any class) to reveal the valuable long-term information. We could utilize all this newly-revealed long-term information to guide our subsequent "self-paced" saliency diffusion, which enables each key frame itself to determine its diffusion range and diffusion strength to correct those ill-detected video frames. At the algorithmic level, we first divide a video sequence into short-term frame batches, and the object proposals are obtained in a frame-wise manner. Then, for each object proposal, we utilize a pre-trained deep saliency model to obtain high-dimensional features in order to represent the spatial contrast. Since the contrast computation within multiple neighbored video frames (i.e., the non-local manner) is relatively insensitive to the appearance variation, those object proposals with high-quality low-level saliency estimation frequently exhibit strong similarity over the temporal scale. Next, the long-term common consistency (e.g., appearance models/movement patterns) of the salient foregrounds could be explicitly revealed via similarity analysis accordingly. We further boost the detection accuracy via long-term information guided saliency diffusion in a self-paced manner. We have conducted extensive experiments to compare our method with 16 state-of-the-art methods over 4 largest public available benchmarks, and all results demonstrate the superiority of our method in terms of both accuracy and robustness.
引用
收藏
页码:1153 / 1167
页数:15
相关论文
共 50 条
  • [31] Robust Saliency Detection via Regularized Random Walks Ranking
    Li, Changyang
    Yuan, Yuchen
    Cai, Weidong
    Xia, Yong
    Feng, David Dagan
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2710 - 2717
  • [32] Saliency Detection via Manifold Ranking Based on Robust Foreground
    Ma, Wei-Ping
    Li, Wen-Xin
    Sun, Jin-Chuan
    Cao, Peng-Xia
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (01) : 73 - 84
  • [33] Saliency Detection via Manifold Ranking Based on Robust Foreground
    Wei-Ping Ma
    Wen-Xin Li
    Jin-Chuan Sun
    Peng-Xia Cao
    International Journal of Automation and Computing, 2021, 18 : 73 - 84
  • [34] SCENE CLASSIFICATION OF HIGH RESOLUTION REMOTE SENSING IMAGES VIA SELF-PACED DEEP LEARNING
    Yao, Xiwen
    Yang, Liuqing
    Cheng, Gong
    Han, Junwei
    Guo, Lei
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 521 - 524
  • [35] Superpixel-based video saliency detection via the fusion of spatiotemporal saliency and temporal coherency
    Li, Yandi
    Xu, Xiping
    Zhang, Ning
    Du, Enyu
    OPTICAL ENGINEERING, 2019, 58 (08)
  • [36] Improving Video Saliency Detection via Localized Estimation and Spatiotemporal Refinement
    Zhou, Xiaofei
    Liu, Zhi
    Gong, Chen
    Liu, Wei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (11) : 2993 - 3007
  • [37] Video saliency detection via combining temporal difference and pixel gradient
    Xiangwei Lu
    Muwei Jian
    Rui Wang
    Xiangyu Liu
    Peiguang Lin
    Hui Yu
    Multimedia Tools and Applications, 2024, 83 : 37589 - 37602
  • [38] Robust online tracking via adaptive samples selection with saliency detection
    Jia Yan
    Xi Chen
    Qiu Ping Zhu
    EURASIP Journal on Advances in Signal Processing, 2013
  • [39] A Training Strategy of Flying Bird Object Detection Model Based on Improved Self-Paced Learning Algorithm
    Sun, Ziwei
    Hua, Zexi
    Li, Hengchao
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 444 - 450
  • [40] Robust online tracking via adaptive samples selection with saliency detection
    Yan, Jia
    Chen, Xi
    Zhu, QiuPing
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,