A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization

被引:3
|
作者
Wang, Jinqiao [1 ]
Xu, Min [2 ]
He, Xiangjian [2 ]
Lu, Hanqing [1 ]
Hoang, Doan [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China
[2] Univ Technol Sydney, Sch Comp & Commun, Sydney, NSW 2007, Australia
基金
中国国家自然科学基金;
关键词
Video retargeting; Visual attention; Visual concept; Spatial-temporal importance; 3D grid optimization;
D O I
10.1016/j.sigpro.2013.06.007
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, a ubiquitous video access is highly demanded for online video applications. One big challenge is that video service needs to adapt different device capabilities. Pervasive multimedia devices require an accurate and user comfort video retargeting. Letting users see their preferred content accurately directly affects their comforts. User preferences on video contents are different in various video domains. In this paper, we present a hybrid framework of video retargeting with a domain enhanced spatial-temporal grid optimization. First, we parse videos from low-level features to high-level visual concepts, combining with visual attention for an accurate importance description. Second, a semantic importance map is built up representing the spatial importance and temporal continuity, which is incorporated with a 3D rectilinear grid scaleplate to map frames to a target display, thereby keeping the aspect ratio of semantically salient objects as well as the perceptual coherency. Extensive evaluations are made on five typical video genres, i.e. sports, advertisements, lecture, news and surveillance. The comparison with the state-of-the-art approaches on both images and videos have demonstrated the advantages of the proposed approach. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:33 / 47
页数:15
相关论文
共 50 条
  • [41] Dynamical Facial Expression Recognition by Integrating 3D Spatial-Temporal Network and Static Network
    Liu, Wenlong
    Han, Shoudong
    Chen, Yang
    PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION SYSTEMS (ICCIS 2017), 2015, : 304 - 308
  • [42] Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation
    Sun, Jiadai
    Dai, Yuchao
    Zhang, Xianjing
    Xu, Jintao
    Ai, Rui
    Gu, Weihao
    Chen, Xieyuanli
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 11456 - 11463
  • [43] Fast spatial-temporal stereo matching for 3D face reconstruction under speckle pattern projection
    Fu, Keren
    Xie, Yijiang
    Jing, Hailong
    Zhu, Jiangping
    IMAGE AND VISION COMPUTING, 2019, 85 : 36 - 45
  • [44] Real-time Spatial-temporal Context Approach for 3D Object Detection using LiDAR
    Kumar, K. S. Chidanand
    Al-Stouhi, Samir
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS (VEHITS), 2020, : 432 - 439
  • [45] Semantic quantization of 3D human motion capture data through spatial-temporal feature extraction
    Jin, Yohan
    Prabhakaran, B.
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2008, 4903 : 318 - 328
  • [46] A Hybrid Method Using Temporal and Spatial Information for 3D Lidar Data Segmentation
    Tuncer, Mehmet Ali Cagri
    Schulz, Dirk
    ICINCO: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS - VOL 2, 2017, : 162 - 171
  • [47] Spatial-temporal 3D dependency matching with self-supervised deep learning for monocular visual sensing
    Song, Chengqun
    Niu, Maolong
    Liu, Zhaopeng
    Cheng, Jun
    Wang, Peng
    Li, Hongjian
    Hao, Luoying
    NEUROCOMPUTING, 2022, 481 : 11 - 21
  • [48] Toward volumetric urbanism: Analysing the spatial-temporal dynamics of 3D floor space use in the built environment
    Hsu, Yi-Ya
    Han, Hoon
    ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2024,
  • [49] A 3D Human Skeletonization Algorithm for a Single Monocular Camera Based on Spatial-Temporal Discrete Shadow Integration
    Hou, Jie
    Guo, Baolong
    He, Wangpeng
    Wu, Jinfu
    APPLIED SCIENCES-BASEL, 2017, 7 (07):
  • [50] Traffic Police 3D Gesture Recognition Based on Spatial-Temporal Fully Adaptive Graph Convolutional Network
    Fu, Zheng
    Chen, Junjie
    Jiang, Kun
    Wang, Sijia
    Wen, Junze
    Yang, Mengmeng
    Yang, Diange
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 9518 - 9531