Video saliency detection via combining temporal difference and pixel gradient

被引:0
作者
Lu, Xiangwei [1 ]
Jian, Muwei [1 ,2 ]
Wang, Rui [1 ]
Liu, Xiangyu [1 ]
Lin, Peiguang [1 ]
Yu, Hui [3 ]
机构
[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan, Peoples R China
[2] Linyi Univ, Sch Informat Sci & Engn, Linyi, Peoples R China
[3] Univ Portsmouth, Sch Creat Technol, Portsmouth, England
基金
中国国家自然科学基金;
关键词
Video saliency detection; Temporal difference; Pixels gradient; Edge refinement; Co-Attention; OPTIMIZATION;
D O I
10.1007/s11042-023-17128-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Even though temporal information matters for the quality of video saliency detection, many problems still arise/emerge in present network frameworks, such as bad performance in time-space coherence and edge continuity. In order to solve these problems, this paper proposes a full convolutional neural network, which integrates temporal differential and pixel gradient to fine tune the edges of salient targets. Considering the features of neighboring frames are highly relevant because of their proximity in location, a co-attention mechanism is used to put pixel-wise weight on the saliency probability map after features extraction with multi-scale pooling so that attention can be paid on both the edge and central of images. And the changes of pixel gradients of original images are used to recursively improve the continuity of target edges and details of central areas. In addition, residual networks are utilized to integrate information between modules, ensuring stable connections between the backbone network and modules and propagation of pixel gradient changes. In addition, a self-adjustment strategy for loss functions is presented to solve the problem of overfitting in experiments. The method presented in the paper has been tested with three available public datasets and its effectiveness has been proved after comparing with 6 other typically stat-of-the-art methods.
引用
收藏
页码:37589 / 37602
页数:14
相关论文
共 50 条
  • [31] A Unified Video Semantics Extraction and Noise Object Suppression Network for Video Saliency Detection
    Tan, Zhenshan
    Gu, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 337 - 348
  • [32] Saliency Detection via Bidirectional Absorbing Markov Chain
    Jiang, Fengling
    Kong, Bin
    Adeel, Ahsan
    Xiao, Yun
    Hussain, Amir
    ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 495 - 505
  • [33] MTDAN: A Lightweight Multi-Scale Temporal Difference Attention Networks for Automated Video Depression Detection
    Zhang, Shiqing
    Zhang, Xingnan
    Zhao, Xiaoming
    Fang, Jiangxiong
    Niu, Mingyue
    Zhao, Ziping
    Yu, Jun
    Tian, Qi
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1078 - 1089
  • [34] Video saliency detection using 3D shearlet transform
    Bao, Lei
    Zhang, Xiongwei
    Zheng, Yunfei
    Li, Yang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (13) : 7761 - 7778
  • [35] Video saliency detection using 3D shearlet transform
    Lei Bao
    Xiongwei Zhang
    Yunfei Zheng
    Yang Li
    Multimedia Tools and Applications, 2016, 75 : 7761 - 7778
  • [36] FBR-CNN: A FEEDBACK RECURRENT NETWORK FOR VIDEO SALIENCY DETECTION
    Ding, Guanqun
    Imamoglu, Nevrez
    Caglayan, Ali
    Murakawa, Masahiro
    Nakamura, Ryosuke
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [37] TDSNet: A temporal difference based network for video semantic segmentation
    Yuan, Haochen
    Peng, Junjie
    Cai, Zesu
    INFORMATION SCIENCES, 2025, 686
  • [38] Modified temporal difference method for change detection
    Chang, CC
    Chia, TL
    Yang, CK
    OPTICAL ENGINEERING, 2005, 44 (02) : 1 - 10
  • [39] Moving object detection via segmentation and saliency constrained RPCA
    Li, Yang
    Liu, Guangcan
    Liu, Qingshan
    Sun, Yubao
    Chen, Shengyong
    NEUROCOMPUTING, 2019, 323 : 352 - 362
  • [40] Robust saliency detection via corner information and an energy function
    Zhang, Hanling
    Xia, Chenxing
    Gao, Xiuju
    IET COMPUTER VISION, 2017, 11 (06) : 379 - 388