A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization

被引：3

作者：

Wang, Jinqiao ^{[1
]}

Xu, Min ^{[2
]}

He, Xiangjian ^{[2
]}

Lu, Hanqing ^{[1
]}

Hoang, Doan ^{[2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China

[2] Univ Technol Sydney, Sch Comp & Commun, Sydney, NSW 2007, Australia

来源：

SIGNAL PROCESSING | 2014年 / 94卷

基金：

中国国家自然科学基金;

关键词：

Video retargeting; Visual attention; Visual concept; Spatial-temporal importance; 3D grid optimization;

D O I：

10.1016/j.sigpro.2013.06.007

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recently, a ubiquitous video access is highly demanded for online video applications. One big challenge is that video service needs to adapt different device capabilities. Pervasive multimedia devices require an accurate and user comfort video retargeting. Letting users see their preferred content accurately directly affects their comforts. User preferences on video contents are different in various video domains. In this paper, we present a hybrid framework of video retargeting with a domain enhanced spatial-temporal grid optimization. First, we parse videos from low-level features to high-level visual concepts, combining with visual attention for an accurate importance description. Second, a semantic importance map is built up representing the spatial importance and temporal continuity, which is incorporated with a 3D rectilinear grid scaleplate to map frames to a target display, thereby keeping the aspect ratio of semantically salient objects as well as the perceptual coherency. Extensive evaluations are made on five typical video genres, i.e. sports, advertisements, lecture, news and surveillance. The comparison with the state-of-the-art approaches on both images and videos have demonstrated the advantages of the proposed approach. (C) 2013 Elsevier B.V. All rights reserved.

引用

页码：33 / 47

页数：15

共 50 条

[31] Human action recognition based on 3D body mask and depth spatial-temporal maps
Xing Li
Zhenjie Hou
Jiuzhen Liang
Chen Chen
Multimedia Tools and Applications, 2020, 79 : 35761 - 35778
[32] Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks
Cai, Yujun
Ge, Liuhao
Liu, Jun
Cai, Jianfei
Cham, Tat-Jen
Yuan, Junsong
Thalmann, Nadia Magnenat
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2272 - 2281
[33] U-shape Spatial-Temporal Prediction Network based on 3D Convolution and BDLSTM
Peng, Ge
Shi, Chunchao
Zhong, Yujing
Ai, Xinyu
2024 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE, SEAI 2024, 2024, : 257 - 261
[34] Human action recognition based on 3D body mask and depth spatial-temporal maps
Li, Xing
Hou, Zhenjie
Liang, Jiuzhen
Chen, Chen
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35761 - 35778
[35] A multiple frequency bands parallel spatial-temporal 3D deep residual learning framework for EEG-based emotion recognition
Miao, Minmin
Zheng, Longxin
Xu, Baoguo
Yang, Zhong
Hu, Wenjun
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
[36] Spatial-temporal correlations in the speckle pattern for the characterization of cellular motion within a 3D object
Weil, Yonni
Shafran, Yana
Sobolev, Maria
Afrimzon, Lena
Zurgil, Naomi
Deutsch, Motti
Schiffer, Zeev
BIOMEDICAL OPTICS EXPRESS, 2023, 14 (05) : 1974 - 1991
[37] Hierarchical Spatial-Temporal Adaptive Graph Fusion for Monocular 3D Human Pose Estimation
Zhang, Lijun
Lu, Feng
Zhou, Kangkang
Zhou, Xiang-Dong
Shi, Yu
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 61 - 65
[38] SSRL: Self-Supervised Spatial-Temporal Representation Learning for 3D Action Recognition
Jin, Zhihao
Wang, Yifan
Wang, Qicong
Shen, Yehu
Meng, Hongying
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 274 - 285
[39] U-shaped spatial-temporal transformer network for 3D human pose estimation
Yang, Honghong
Guo, Longfei
Zhang, Yumei
Wu, Xiaojun
MACHINE VISION AND APPLICATIONS, 2022, 33 (06)
[40] 3D face imaging with the spatial-temporal correlation method using a rotary speckle projector
Zhou, Pei
Zhu, Jiangping
Xiong, Wei
Zhang, Jianwei
APPLIED OPTICS, 2021, 60 (20) : 5925 - 5935

← 1 2 3 4 5 →