Video saliency detection via combining temporal difference and pixel gradient

被引：1

作者：

Lu, Xiangwei ^{[1
]}

Jian, Muwei ^{[1
,2
]}

Wang, Rui ^{[1
]}

Liu, Xiangyu ^{[1
]}

Lin, Peiguang ^{[1
]}

Yu, Hui ^{[3
]}

机构：

[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan, Peoples R China

[2] Linyi Univ, Sch Informat Sci & Engn, Linyi, Peoples R China

[3] Univ Portsmouth, Sch Creat Technol, Portsmouth, England

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 13期

基金：

中国国家自然科学基金;

关键词：

Video saliency detection; Temporal difference; Pixels gradient; Edge refinement; Co-Attention; OPTIMIZATION;

D O I：

10.1007/s11042-023-17128-5

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Even though temporal information matters for the quality of video saliency detection, many problems still arise/emerge in present network frameworks, such as bad performance in time-space coherence and edge continuity. In order to solve these problems, this paper proposes a full convolutional neural network, which integrates temporal differential and pixel gradient to fine tune the edges of salient targets. Considering the features of neighboring frames are highly relevant because of their proximity in location, a co-attention mechanism is used to put pixel-wise weight on the saliency probability map after features extraction with multi-scale pooling so that attention can be paid on both the edge and central of images. And the changes of pixel gradients of original images are used to recursively improve the continuity of target edges and details of central areas. In addition, residual networks are utilized to integrate information between modules, ensuring stable connections between the backbone network and modules and propagation of pixel gradient changes. In addition, a self-adjustment strategy for loss functions is presented to solve the problem of overfitting in experiments. The method presented in the paper has been tested with three available public datasets and its effectiveness has been proved after comparing with 6 other typically stat-of-the-art methods.

引用

页码：37589 / 37602

页数：14

共 50 条

[41] Saliency detection via background and foregrond null space learning
Zhang, Ying Ying
Zhang, Shuo
Zhang, Ping
Zhang, XinGang
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 70 : 271 - 281
[42] Robust saliency detection via corner information and an energy function
Zhang, Hanling
Xia, Chenxing
Gao, Xiuju
IET COMPUTER VISION, 2017, 11 (06) : 379 - 388
[43] Multi-Features Fusion Based on Boolean Map for Video Saliency Detection
Wei, Longsheng
Wang, Min
Liu, Wei
Wang, Xinmei
Sun, Jiale
Yin, Xu
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 7589 - 7594
[44] Compressed domain video saliency detection using global and local spatiotemporal features
Lee, Se-Ho
Kang, Je-Won
Kim, Chang-Su
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 35 : 169 - 183
[45] A spatial-frequency-temporal domain based saliency model for low contrast video sequences
Mu, Nan
Xu, Xin
Zhang, Xiaolong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 79 - 88
[46] Video Watermarking Approach Based on Temporal Difference And Discrete Wavelet Transform
Pu, Dongbing
Lu, Yinghua
Dai, Jiangyan
PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 346 - 350
[47] Salient object detection via reliable boundary seeds and saliency refinement
Wu, Xiyin
Ma, Xiaodi
Zhang, Jinxia
Jin, Zhong
IET COMPUTER VISION, 2019, 13 (03) : 302 - 311
[48] Saliency Detection via Multi-view Synchronized Manifold Ranking
Guan, Yuanyuan
Jiang, Bo
Zhang, Yuan
Zheng, Aihua
Sun, Dengdi
Luo, Bin
ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 473 - 483
[49] Small Object Detection via Pixel Level Balancing With Applications to Blood Cell Detection
Hu, Bin
Liu, Yang
Chu, Pengzhi
Tong, Minglei
Kong, Qingjie
FRONTIERS IN PHYSIOLOGY, 2022, 13
[50] Saliency detection via a multi-layer graph based diffusion model
Jiang, Bo
He, Zhouqin
Ding, Chris
Luo, Bin
NEUROCOMPUTING, 2018, 314 : 215 - 223

← 1 2 3 4 5 →