Video saliency detection via combining temporal difference and pixel gradient

被引：0

作者：

Lu, Xiangwei ^{[1
]}

Jian, Muwei ^{[1
,2
]}

Wang, Rui ^{[1
]}

Liu, Xiangyu ^{[1
]}

Lin, Peiguang ^{[1
]}

Yu, Hui ^{[3
]}

机构：

[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan, Peoples R China

[2] Linyi Univ, Sch Informat Sci & Engn, Linyi, Peoples R China

[3] Univ Portsmouth, Sch Creat Technol, Portsmouth, England

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 13期

基金：

中国国家自然科学基金;

关键词：

Video saliency detection; Temporal difference; Pixels gradient; Edge refinement; Co-Attention; OPTIMIZATION;

D O I：

10.1007/s11042-023-17128-5

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Even though temporal information matters for the quality of video saliency detection, many problems still arise/emerge in present network frameworks, such as bad performance in time-space coherence and edge continuity. In order to solve these problems, this paper proposes a full convolutional neural network, which integrates temporal differential and pixel gradient to fine tune the edges of salient targets. Considering the features of neighboring frames are highly relevant because of their proximity in location, a co-attention mechanism is used to put pixel-wise weight on the saliency probability map after features extraction with multi-scale pooling so that attention can be paid on both the edge and central of images. And the changes of pixel gradients of original images are used to recursively improve the continuity of target edges and details of central areas. In addition, residual networks are utilized to integrate information between modules, ensuring stable connections between the backbone network and modules and propagation of pixel gradient changes. In addition, a self-adjustment strategy for loss functions is presented to solve the problem of overfitting in experiments. The method presented in the paper has been tested with three available public datasets and its effectiveness has been proved after comparing with 6 other typically stat-of-the-art methods.

引用

页码：37589 / 37602

页数：14

共 50 条

[31] A Unified Video Semantics Extraction and Noise Object Suppression Network for Video Saliency Detection
Tan, Zhenshan
Gu, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 337 - 348
[32] Saliency Detection via Bidirectional Absorbing Markov Chain
Jiang, Fengling
Kong, Bin
Adeel, Ahsan
Xiao, Yun
Hussain, Amir
ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 495 - 505
[33] MTDAN: A Lightweight Multi-Scale Temporal Difference Attention Networks for Automated Video Depression Detection
Zhang, Shiqing
Zhang, Xingnan
Zhao, Xiaoming
Fang, Jiangxiong
Niu, Mingyue
Zhao, Ziping
Yu, Jun
Tian, Qi
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1078 - 1089
[34] Video saliency detection using 3D shearlet transform
Bao, Lei
Zhang, Xiongwei
Zheng, Yunfei
Li, Yang
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (13) : 7761 - 7778
[35] Video saliency detection using 3D shearlet transform
Lei Bao
Xiongwei Zhang
Yunfei Zheng
Yang Li
Multimedia Tools and Applications, 2016, 75 : 7761 - 7778
[36] FBR-CNN: A FEEDBACK RECURRENT NETWORK FOR VIDEO SALIENCY DETECTION
Ding, Guanqun
Imamoglu, Nevrez
Caglayan, Ali
Murakawa, Masahiro
Nakamura, Ryosuke
2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
[37] TDSNet: A temporal difference based network for video semantic segmentation
Yuan, Haochen
Peng, Junjie
Cai, Zesu
INFORMATION SCIENCES, 2025, 686
[38] Modified temporal difference method for change detection
Chang, CC
Chia, TL
Yang, CK
OPTICAL ENGINEERING, 2005, 44 (02) : 1 - 10
[39] Moving object detection via segmentation and saliency constrained RPCA
Li, Yang
Liu, Guangcan
Liu, Qingshan
Sun, Yubao
Chen, Shengyong
NEUROCOMPUTING, 2019, 323 : 352 - 362
[40] Robust saliency detection via corner information and an energy function
Zhang, Hanling
Xia, Chenxing
Gao, Xiuju
IET COMPUTER VISION, 2017, 11 (06) : 379 - 388

← 1 2 3 4 5 →