A fast and efficient saliency detection model in video compressed-domain for human fixations prediction

被引:11
作者
Li, Yongjun [1 ,2 ,3 ]
Li, Yunsong [1 ,2 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, 2 South Taibai St, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Telecommun Engn, Joint Lab High Speed Multisource Image Coding & P, 2 South Taibai St, Xian 710071, Peoples R China
[3] Henan Univ, Sch Phys & Elect, 1 Jinming St, Kaifeng 475004, Henan, Peoples R China
关键词
Compressed domain; Human fixations detection; Visual saliency; BOTTOM-UP; VISUAL-ATTENTION; TOP-DOWN; VISION; SEARCH;
D O I
10.1007/s11042-016-4118-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research and application of human fixations detection in video compressed-domain have gained an increasing attention in the latest years. However, both prediction accuracy and computational complexity still remain a challenge. This paper addresses the problem of compressed-domain video human fixations prediction based on saliency detection, and presents a fast and efficient algorithm based on Residual DCT Coefficients Norm (RDCN feature) and Operational Block Description Length (OBDL feature). These two features are directly extracted from the compressed bit-stream with partial decoding, and are normalized. After spatial and temporal filtering, the normalized salient maps are fused by the dynamic fusion coefficients with variation of quantization parameters. Then the fused salient map is worked by Gaussian model whose center is determined by the feature values. The proposed saliency detection model for human fixations prediction combines the accuracy of the pixel-domain saliency detections with the computational efficiency of their compressed-domain counterparts. The validation and comparison are made by several accuracy metrics on two ground truth datasets. Experimental results show that the proposed saliency detection model for human fixations prediction obtains superior performances over several state-of-the-art compressed-domain and pixel-domain algorithms on evaluation metrics. Computationally, our algorithm achieves a speed-up of over 10 times as compared to similar algorithms, which illustrates it appropriate for in-camera saliency estimation.
引用
收藏
页码:26273 / 26295
页数:23
相关论文
共 50 条
  • [31] Leveraging Human Fixations in Sparse Coding: Learning a Discriminative Dictionary for Saliency Prediction
    Jiang, Ming
    Song, Mingli
    Zhao, Qi
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2126 - 2133
  • [32] Graph convolutional network for fast video summarization in compressed domain
    Yeh, Chia-Hung
    Lien, Chih-Ming
    Zhan, Zhi-Xiang
    Tsai, Feng-Hsu
    Chen, Mei-Juan
    NEUROCOMPUTING, 2025, 617
  • [33] A fast and effective method for static video summarization on compressed domain
    Hernandez, A. C.
    Hernandez, M. C.
    Ugalde, F. G.
    Miyatake, M. N.
    Meana, H. P.
    IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (11) : 4554 - 4559
  • [34] Compressed-domain Video Synopsis via 3D Graph Cut and Blank Frame Deletion
    Liao, Wenjuan
    Tu, Zhigang
    Wang, Shizheng
    Li, Yongzhou
    Zhong, Rui
    Zhong, Huicai
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 253 - 261
  • [35] Ant Colony Optimization Inspired Saliency Detection Using Compressed Video Information
    Li, Cuiwei
    Tu, Qin
    Xu, Jun
    Gao, Ran
    Wang, Qiang
    Chang, Yongyu
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [36] Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction
    Bellitto, G.
    Proietto Salanitri, F.
    Palazzo, S.
    Rundo, F.
    Giordano, D.
    Spampinato, C.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (12) : 3216 - 3232
  • [37] Fast Moving- Object Detection in H.264/AVC Compressed Domain for Video Surveillance
    Tom, Manu
    Babu, R. Venkatesh
    2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [38] Compressed-Domain Shot Boundary Detection for H.264/AVC Using Intra Partitioning Maps
    De Bruyne, Sarah
    De Cock, Jan
    Poppe, Chris
    Hollemeersch, Charles-Frederik
    Lambert, Peter
    Van de Walle, Rik
    ADVANCES IN MULTIMEDIA MODELING, PT I, 2011, 6523 : 29 - 39
  • [39] Automatic video caption detection and extraction in the DCT compressed domain
    Tsao, CF
    Chen, YH
    Kuo, JH
    Lin, CW
    Wu, JL
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4, 2005, 5960 : 895 - 907
  • [40] Compressed Domain Motion Analysis for Video Semantic Events Detection
    Tao, Kun
    Lin, Shouxun
    Zhang, Yongdong
    2009 WASE INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING, ICIE 2009, VOL I, 2009, : 201 - +