Stereoscopic Image Retargeting Based on Deep Convolutional Neural Network

被引：8

作者：

Fan, Xiaoting ^{[1
]}

Lei, Jianjun ^{[1
]}

Liang, Jie ^{[2
]}

Fang, Yuming ^{[3
]}

Ling, Nam ^{[4
]}

Huang, Qingming ^{[5
,6
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Simon Fraser Univ, Sch Engn Sci, Burnaby, BC V5A 1S6, Canada

[3] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330032, Jiangxi, Peoples R China

[4] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA

[5] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China

[6] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2021年 / 31卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Stereo image processing; Three-dimensional displays; Two dimensional displays; Feature extraction; Distortion; Visualization; Shape; Stereoscopic image; image retargeting; cross-attention; disparity consistency; VIDEO;

D O I：

10.1109/TCSVT.2021.3054062

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Stereoscopic image retargeting aims at converting stereoscopic images to the target resolution adaptively. Different from 2D image retargeting, stereoscopic image retargeting needs to preserve both the shape structure of salient objects and depth consistency of 3D scenes. In this paper, we present a stereoscopic image retargeting method based on deep convolutional neural network to obtain high-quality retargeted images with both object shape preservation and scene depth preservation. First, a cross-attention extraction mechanism is constructed to generate attention map, which contains the valuable attention features of the left and right images and the common attention features between them. Second, since the disparity map can provide accurate depth information of objects in 3D scenes, a disparity-assisted 3D significance map generation module is utilized to further preserve the valuable depth information of stereoscopic images. Finally, in order to predict the retargeted stereoscopic images accurately, an image consistency loss is developed to preserve the geometric structure of salient objects, and a disparity consistency loss is introduced to eliminate depth distortions. Experimental results demonstrate that the proposed deep convolutional neural network can provide favorable stereoscopic image retargeting results.

引用

页码：4759 / 4770

页数：12

共 55 条

[1] Seam carving for content-aware image resizing
Avidan, Shai
Shamir, Ariel
[J]. ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03):
[2] Bare B, 2015, IEEE INT CON MULTI
[3] Chang CH, 2012, PROC CVPR IEEE, P1075, DOI 10.1109/CVPR.2012.6247786
[4] Content-Aware Display Adaptation and Interactive Editing for Stereoscopic Images
Chang, Che-Han
Liang, Chia-Kai
Chuang, Yung-Yu
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (04) : 589 - 601
[5] Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
Cho, Donghyeon
Park, Jinsun
Oh, Tae-Hyun
Tai, Yu-Wing
Kweon, In So
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4568 - 4577
[6] Sparse Seam-Carving for Structure Preserving Image Retargeting
Choi, Jiwon
Kim, Changick
[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 85 (02): : 275 - 283
[7] Review of Visual Saliency Detection With Comprehensive Information
Cong, Runmin
Lei, Jianjun
Fu, Huazhu
Cheng, Ming-Ming
Lin, Weisi
Huang, Qingming
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2941 - 2959
[8] Stereo Seam Carving a Geometrically Consistent Approach
Dekel , Tali
Moses, Yael
Avidan, Shai
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (10) : 2513 - 2525
[9] FlowNet: Learning Optical Flow with Convolutional Networks
Dosovitskiy, Alexey
Fischer, Philipp
Ilg, Eddy
Haeusser, Philip
Hazirbas, Caner
Golkov, Vladimir
van der Smagt, Patrick
Cremers, Daniel
Brox, Thomas
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
[10] Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending
Fan, Xiaoting
Lei, Jianjun
Fang, Yuming
Huang, Qingming
Ling, Nam
Hou, Chunping
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (03) : 655 - 665

← 1 2 3 4 5 6 →