Stereoscopic Image Retargeting Based on Deep Convolutional Neural Network

被引:8
作者
Fan, Xiaoting [1 ]
Lei, Jianjun [1 ]
Liang, Jie [2 ]
Fang, Yuming [3 ]
Ling, Nam [4 ]
Huang, Qingming [5 ,6 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Simon Fraser Univ, Sch Engn Sci, Burnaby, BC V5A 1S6, Canada
[3] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330032, Jiangxi, Peoples R China
[4] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA
[5] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[6] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Stereo image processing; Three-dimensional displays; Two dimensional displays; Feature extraction; Distortion; Visualization; Shape; Stereoscopic image; image retargeting; cross-attention; disparity consistency; VIDEO;
D O I
10.1109/TCSVT.2021.3054062
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Stereoscopic image retargeting aims at converting stereoscopic images to the target resolution adaptively. Different from 2D image retargeting, stereoscopic image retargeting needs to preserve both the shape structure of salient objects and depth consistency of 3D scenes. In this paper, we present a stereoscopic image retargeting method based on deep convolutional neural network to obtain high-quality retargeted images with both object shape preservation and scene depth preservation. First, a cross-attention extraction mechanism is constructed to generate attention map, which contains the valuable attention features of the left and right images and the common attention features between them. Second, since the disparity map can provide accurate depth information of objects in 3D scenes, a disparity-assisted 3D significance map generation module is utilized to further preserve the valuable depth information of stereoscopic images. Finally, in order to predict the retargeted stereoscopic images accurately, an image consistency loss is developed to preserve the geometric structure of salient objects, and a disparity consistency loss is introduced to eliminate depth distortions. Experimental results demonstrate that the proposed deep convolutional neural network can provide favorable stereoscopic image retargeting results.
引用
收藏
页码:4759 / 4770
页数:12
相关论文
共 55 条
  • [1] Seam carving for content-aware image resizing
    Avidan, Shai
    Shamir, Ariel
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03):
  • [2] Bare B, 2015, IEEE INT CON MULTI
  • [3] Chang CH, 2012, PROC CVPR IEEE, P1075, DOI 10.1109/CVPR.2012.6247786
  • [4] Content-Aware Display Adaptation and Interactive Editing for Stereoscopic Images
    Chang, Che-Han
    Liang, Chia-Kai
    Chuang, Yung-Yu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (04) : 589 - 601
  • [5] Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
    Cho, Donghyeon
    Park, Jinsun
    Oh, Tae-Hyun
    Tai, Yu-Wing
    Kweon, In So
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4568 - 4577
  • [6] Sparse Seam-Carving for Structure Preserving Image Retargeting
    Choi, Jiwon
    Kim, Changick
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 85 (02): : 275 - 283
  • [7] Review of Visual Saliency Detection With Comprehensive Information
    Cong, Runmin
    Lei, Jianjun
    Fu, Huazhu
    Cheng, Ming-Ming
    Lin, Weisi
    Huang, Qingming
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2941 - 2959
  • [8] Stereo Seam Carving a Geometrically Consistent Approach
    Dekel , Tali
    Moses, Yael
    Avidan, Shai
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (10) : 2513 - 2525
  • [9] FlowNet: Learning Optical Flow with Convolutional Networks
    Dosovitskiy, Alexey
    Fischer, Philipp
    Ilg, Eddy
    Haeusser, Philip
    Hazirbas, Caner
    Golkov, Vladimir
    van der Smagt, Patrick
    Cremers, Daniel
    Brox, Thomas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
  • [10] Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending
    Fan, Xiaoting
    Lei, Jianjun
    Fang, Yuming
    Huang, Qingming
    Ling, Nam
    Hou, Chunping
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (03) : 655 - 665