A fast and efficient saliency detection model in video compressed-domain for human fixations prediction

被引:11
作者
Li, Yongjun [1 ,2 ,3 ]
Li, Yunsong [1 ,2 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, 2 South Taibai St, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Telecommun Engn, Joint Lab High Speed Multisource Image Coding & P, 2 South Taibai St, Xian 710071, Peoples R China
[3] Henan Univ, Sch Phys & Elect, 1 Jinming St, Kaifeng 475004, Henan, Peoples R China
关键词
Compressed domain; Human fixations detection; Visual saliency; BOTTOM-UP; VISUAL-ATTENTION; TOP-DOWN; VISION; SEARCH;
D O I
10.1007/s11042-016-4118-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research and application of human fixations detection in video compressed-domain have gained an increasing attention in the latest years. However, both prediction accuracy and computational complexity still remain a challenge. This paper addresses the problem of compressed-domain video human fixations prediction based on saliency detection, and presents a fast and efficient algorithm based on Residual DCT Coefficients Norm (RDCN feature) and Operational Block Description Length (OBDL feature). These two features are directly extracted from the compressed bit-stream with partial decoding, and are normalized. After spatial and temporal filtering, the normalized salient maps are fused by the dynamic fusion coefficients with variation of quantization parameters. Then the fused salient map is worked by Gaussian model whose center is determined by the feature values. The proposed saliency detection model for human fixations prediction combines the accuracy of the pixel-domain saliency detections with the computational efficiency of their compressed-domain counterparts. The validation and comparison are made by several accuracy metrics on two ground truth datasets. Experimental results show that the proposed saliency detection model for human fixations prediction obtains superior performances over several state-of-the-art compressed-domain and pixel-domain algorithms on evaluation metrics. Computationally, our algorithm achieves a speed-up of over 10 times as compared to similar algorithms, which illustrates it appropriate for in-camera saliency estimation.
引用
收藏
页码:26273 / 26295
页数:23
相关论文
共 50 条
  • [41] Fast Salient Object Detection in Non-stationary Video Sequences Based on Spatial Saliency Maps
    Favorskaya, Margarita
    Buryachenko, Vladimir
    INTELLIGENT INTERACTIVE MULTIMEDIA SYSTEMS AND SERVICES 2016, 2016, 55 : 121 - 132
  • [42] Compressed-Domain Ship Detection on Spaceborne Optical Image Using Deep Neural Network and Extreme Learning Machine
    Tang, Jiexiong
    Deng, Chenwei
    Huang, Guang-Bin
    Zhao, Baojun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (03): : 1174 - 1185
  • [43] Video saliency detection via bagging-based prediction and spatiotemporal propagation
    Zhou, Xiaofei
    Liu, Zhi
    Li, Kai
    Sun, Guangling
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 51 : 131 - 143
  • [44] An Effective Video Saliency Detection Model Based on Human Visual Acuity and Spatiotemporal Cues in Cloud Systems
    Fang, Zhijun
    Zhang, Juan
    Wan, Wanggen
    Fang, Yuming
    JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (05): : 835 - 840
  • [45] Compressed-domain-based no-reference video quality assessment model considering fast motion and scene change
    Zhang, Hong
    Li, Fan
    Li, Na
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (07) : 9485 - 9502
  • [46] Compressed-domain-based no-reference video quality assessment model considering fast motion and scene change
    Hong Zhang
    Fan Li
    Na Li
    Multimedia Tools and Applications, 2017, 76 : 9485 - 9502
  • [47] An HEVC Compressed Domain Content-Based Video Signature For Copy Detection and Video Retrieval
    Tahboub, Khalid
    Gadgil, Neeraj J.
    Comer, Mary L.
    Delp, Edward J.
    IMAGING AND MULTIMEDIA ANALYTICS IN A WEB AND MOBILE WORLD 2014, 2014, 9027
  • [48] A Fast and Efficient Compressed Domain JPEG2000 Image Retrieval Method
    Zargari, Farzad
    Mosleh, Ali
    Ghanbari, Mohammad
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (04) : 1886 - 1893
  • [49] An Efficient Saliency Detection Model Based on Wavelet Generalized Lifting
    Zhong, Xin
    Shih, Frank Y.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (02)
  • [50] 3DSAL: An Efficient 3D-CNN Architecture for Video Saliency Prediction
    Djilali, Yasser Abdelaziz Dahou
    Sayah, Mohamed
    McGuinness, Kevin
    O'Connor, Noel E.
    VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, : 27 - 36