Accurate and Robust Video Saliency Detection via Self-Paced Diffusion

被引：43

作者：

Li, Yunxiao ^{[1
]}

Li, Shuai ^{[1
]}

Chen, Chenglizhao ^{[1
,2
]}

Hao, Aimin ^{[1
]}

Qin, Hong ^{[3
]}

机构：

[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Qingdao Univ, Qingdao 266071, Peoples R China

[3] SUNY Stony Brook, Stony Brook, NY 11794 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2020年 / 22卷 / 05期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Saliency detection; Proposals; Video sequences; Spatial coherence; Computational modeling; Optical imaging; Optical sensors; Video saliency detection; long-term saliency revealing; key frame strategy; self-paced saliency diffusion; OBJECT DETECTION; SEGMENTATION; OPTIMIZATION; FUSION;

D O I：

10.1109/TMM.2019.2940851

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Conventional video saliency detection methods frequently follow the common bottom-up thread to estimate video saliency within the short-term fashion. As a result, such methods can not avoid the obstinate accumulation of errors when the collected low-level clues are constantly ill-detected. Also, being noticed that a portion of video frames, which are not nearby the current video frame over the time axis, may potentially benefit the saliency detection in the current video frame. Thus, we propose to solve the aforementioned problem using our newly-designed key frame strategy (KFS), whose core rationale is to utilize both the spatial-temporal coherency of the salient foregrounds and the objectness prior (i.e., how likely it is for an object proposal to contain an object of any class) to reveal the valuable long-term information. We could utilize all this newly-revealed long-term information to guide our subsequent "self-paced" saliency diffusion, which enables each key frame itself to determine its diffusion range and diffusion strength to correct those ill-detected video frames. At the algorithmic level, we first divide a video sequence into short-term frame batches, and the object proposals are obtained in a frame-wise manner. Then, for each object proposal, we utilize a pre-trained deep saliency model to obtain high-dimensional features in order to represent the spatial contrast. Since the contrast computation within multiple neighbored video frames (i.e., the non-local manner) is relatively insensitive to the appearance variation, those object proposals with high-quality low-level saliency estimation frequently exhibit strong similarity over the temporal scale. Next, the long-term common consistency (e.g., appearance models/movement patterns) of the salient foregrounds could be explicitly revealed via similarity analysis accordingly. We further boost the detection accuracy via long-term information guided saliency diffusion in a self-paced manner. We have conducted extensive experiments to compare our method with 16 state-of-the-art methods over 4 largest public available benchmarks, and all results demonstrate the superiority of our method in terms of both accuracy and robustness.

引用

页码：1153 / 1167

页数：15

共 50 条

[41] Self-Adaptively Weighted Co-Saliency Detection via Rank Constraint
Cao, Xiaochun
Tao, Zhiqiang
Zhang, Bao
Fu, Huazhu
Feng, Wei
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 4175 - 4186
[42] Video saliency detection via bagging-based prediction and spatiotemporal propagation
Zhou, Xiaofei
Liu, Zhi
Li, Kai
Sun, Guangling
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 51 : 131 - 143
[43] Sea Ice Change Detection from Synthetic Aperture Radar Images Based on Self-Paced Boosting Learning
Wang, Qun
Gao, Feng
Dong, Junyu
Wang, Shengke
2018 FIFTH INTERNATIONAL WORKSHOP ON EARTH OBSERVATION AND REMOTE SENSING APPLICATIONS (EORSA), 2018, : 24 - 27
[44] A robust visual tracking method via local feature extraction and saliency detection
Wang, Yong
Wei, Xian
Ding, Lu
Tang, Xiaoliang
Zhang, Huanlong
VISUAL COMPUTER, 2020, 36 (04) : 683 - 700
[45] A robust visual tracking method via local feature extraction and saliency detection
Yong Wang
Xian Wei
Lu Ding
Xiaoliang Tang
Huanlong Zhang
The Visual Computer, 2020, 36 : 683 - 700
[46] Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking
Deng, Cheng
Yang, Xu
Nie, Feiping
Tao, Dapeng
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (04) : 885 - 896
[47] Leveraging Prior-Knowledge for Weakly Supervised Object Detection Under a Collaborative Self-Paced Curriculum Learning Framework
Dingwen Zhang
Junwei Han
Long Zhao
Deyu Meng
International Journal of Computer Vision, 2019, 127 : 363 - 380
[48] Leveraging Prior-Knowledge for Weakly Supervised Object Detection Under a Collaborative Self-Paced Curriculum Learning Framework
Zhang, Dingwen
Han, Junwei
Zhao, Long
Meng, Deyu
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (04) : 363 - 380
[49] Saliency detection via coarse-to-fine diffusion-based compactness with weighted learning affinity matrix
Wang, Fan
Peng, Guohua
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 78
[50] Multi-scale Self-searching Saliency Detection Combined with Rectangular Diffusion
Song, Tengfei
Liu, Zhengyi
PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 137 - 142

← 1 2 3 4 5 →