Video saliency detection via combining temporal difference and pixel gradient

被引:1
作者
Lu, Xiangwei [1 ]
Jian, Muwei [1 ,2 ]
Wang, Rui [1 ]
Liu, Xiangyu [1 ]
Lin, Peiguang [1 ]
Yu, Hui [3 ]
机构
[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan, Peoples R China
[2] Linyi Univ, Sch Informat Sci & Engn, Linyi, Peoples R China
[3] Univ Portsmouth, Sch Creat Technol, Portsmouth, England
基金
中国国家自然科学基金;
关键词
Video saliency detection; Temporal difference; Pixels gradient; Edge refinement; Co-Attention; OPTIMIZATION;
D O I
10.1007/s11042-023-17128-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Even though temporal information matters for the quality of video saliency detection, many problems still arise/emerge in present network frameworks, such as bad performance in time-space coherence and edge continuity. In order to solve these problems, this paper proposes a full convolutional neural network, which integrates temporal differential and pixel gradient to fine tune the edges of salient targets. Considering the features of neighboring frames are highly relevant because of their proximity in location, a co-attention mechanism is used to put pixel-wise weight on the saliency probability map after features extraction with multi-scale pooling so that attention can be paid on both the edge and central of images. And the changes of pixel gradients of original images are used to recursively improve the continuity of target edges and details of central areas. In addition, residual networks are utilized to integrate information between modules, ensuring stable connections between the backbone network and modules and propagation of pixel gradient changes. In addition, a self-adjustment strategy for loss functions is presented to solve the problem of overfitting in experiments. The method presented in the paper has been tested with three available public datasets and its effectiveness has been proved after comparing with 6 other typically stat-of-the-art methods.
引用
收藏
页码:37589 / 37602
页数:14
相关论文
共 50 条
  • [41] Saliency detection via background and foregrond null space learning
    Zhang, Ying Ying
    Zhang, Shuo
    Zhang, Ping
    Zhang, XinGang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 70 : 271 - 281
  • [42] Robust saliency detection via corner information and an energy function
    Zhang, Hanling
    Xia, Chenxing
    Gao, Xiuju
    IET COMPUTER VISION, 2017, 11 (06) : 379 - 388
  • [43] Multi-Features Fusion Based on Boolean Map for Video Saliency Detection
    Wei, Longsheng
    Wang, Min
    Liu, Wei
    Wang, Xinmei
    Sun, Jiale
    Yin, Xu
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 7589 - 7594
  • [44] Compressed domain video saliency detection using global and local spatiotemporal features
    Lee, Se-Ho
    Kang, Je-Won
    Kim, Chang-Su
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 35 : 169 - 183
  • [45] A spatial-frequency-temporal domain based saliency model for low contrast video sequences
    Mu, Nan
    Xu, Xin
    Zhang, Xiaolong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 79 - 88
  • [46] Video Watermarking Approach Based on Temporal Difference And Discrete Wavelet Transform
    Pu, Dongbing
    Lu, Yinghua
    Dai, Jiangyan
    PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 346 - 350
  • [47] Salient object detection via reliable boundary seeds and saliency refinement
    Wu, Xiyin
    Ma, Xiaodi
    Zhang, Jinxia
    Jin, Zhong
    IET COMPUTER VISION, 2019, 13 (03) : 302 - 311
  • [48] Saliency Detection via Multi-view Synchronized Manifold Ranking
    Guan, Yuanyuan
    Jiang, Bo
    Zhang, Yuan
    Zheng, Aihua
    Sun, Dengdi
    Luo, Bin
    ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 473 - 483
  • [49] Small Object Detection via Pixel Level Balancing With Applications to Blood Cell Detection
    Hu, Bin
    Liu, Yang
    Chu, Pengzhi
    Tong, Minglei
    Kong, Qingjie
    FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [50] Saliency detection via a multi-layer graph based diffusion model
    Jiang, Bo
    He, Zhouqin
    Ding, Chris
    Luo, Bin
    NEUROCOMPUTING, 2018, 314 : 215 - 223