A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization

被引:3
|
作者
Wang, Jinqiao [1 ]
Xu, Min [2 ]
He, Xiangjian [2 ]
Lu, Hanqing [1 ]
Hoang, Doan [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China
[2] Univ Technol Sydney, Sch Comp & Commun, Sydney, NSW 2007, Australia
基金
中国国家自然科学基金;
关键词
Video retargeting; Visual attention; Visual concept; Spatial-temporal importance; 3D grid optimization;
D O I
10.1016/j.sigpro.2013.06.007
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, a ubiquitous video access is highly demanded for online video applications. One big challenge is that video service needs to adapt different device capabilities. Pervasive multimedia devices require an accurate and user comfort video retargeting. Letting users see their preferred content accurately directly affects their comforts. User preferences on video contents are different in various video domains. In this paper, we present a hybrid framework of video retargeting with a domain enhanced spatial-temporal grid optimization. First, we parse videos from low-level features to high-level visual concepts, combining with visual attention for an accurate importance description. Second, a semantic importance map is built up representing the spatial importance and temporal continuity, which is incorporated with a 3D rectilinear grid scaleplate to map frames to a target display, thereby keeping the aspect ratio of semantically salient objects as well as the perceptual coherency. Extensive evaluations are made on five typical video genres, i.e. sports, advertisements, lecture, news and surveillance. The comparison with the state-of-the-art approaches on both images and videos have demonstrated the advantages of the proposed approach. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:33 / 47
页数:15
相关论文
共 50 条
  • [31] Human action recognition based on 3D body mask and depth spatial-temporal maps
    Xing Li
    Zhenjie Hou
    Jiuzhen Liang
    Chen Chen
    Multimedia Tools and Applications, 2020, 79 : 35761 - 35778
  • [32] Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks
    Cai, Yujun
    Ge, Liuhao
    Liu, Jun
    Cai, Jianfei
    Cham, Tat-Jen
    Yuan, Junsong
    Thalmann, Nadia Magnenat
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2272 - 2281
  • [33] U-shape Spatial-Temporal Prediction Network based on 3D Convolution and BDLSTM
    Peng, Ge
    Shi, Chunchao
    Zhong, Yujing
    Ai, Xinyu
    2024 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE, SEAI 2024, 2024, : 257 - 261
  • [34] Human action recognition based on 3D body mask and depth spatial-temporal maps
    Li, Xing
    Hou, Zhenjie
    Liang, Jiuzhen
    Chen, Chen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35761 - 35778
  • [35] A multiple frequency bands parallel spatial-temporal 3D deep residual learning framework for EEG-based emotion recognition
    Miao, Minmin
    Zheng, Longxin
    Xu, Baoguo
    Yang, Zhong
    Hu, Wenjun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [36] Spatial-temporal correlations in the speckle pattern for the characterization of cellular motion within a 3D object
    Weil, Yonni
    Shafran, Yana
    Sobolev, Maria
    Afrimzon, Lena
    Zurgil, Naomi
    Deutsch, Motti
    Schiffer, Zeev
    BIOMEDICAL OPTICS EXPRESS, 2023, 14 (05) : 1974 - 1991
  • [37] Hierarchical Spatial-Temporal Adaptive Graph Fusion for Monocular 3D Human Pose Estimation
    Zhang, Lijun
    Lu, Feng
    Zhou, Kangkang
    Zhou, Xiang-Dong
    Shi, Yu
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 61 - 65
  • [38] SSRL: Self-Supervised Spatial-Temporal Representation Learning for 3D Action Recognition
    Jin, Zhihao
    Wang, Yifan
    Wang, Qicong
    Shen, Yehu
    Meng, Hongying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 274 - 285
  • [39] U-shaped spatial-temporal transformer network for 3D human pose estimation
    Yang, Honghong
    Guo, Longfei
    Zhang, Yumei
    Wu, Xiaojun
    MACHINE VISION AND APPLICATIONS, 2022, 33 (06)
  • [40] 3D face imaging with the spatial-temporal correlation method using a rotary speckle projector
    Zhou, Pei
    Zhu, Jiangping
    Xiong, Wei
    Zhang, Jianwei
    APPLIED OPTICS, 2021, 60 (20) : 5925 - 5935