Deep Convolutional Inverse Graphics Network

被引:0
|
作者
Kulkarni, Tejas D. [1 ]
Whitney, William F. [1 ]
Kohli, Pushmeet [2 ]
Tenenbaum, Joshua B. [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Microsoft Res, Cambridge, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that aims to learn an interpretable representation of images, disentangled with respect to three-dimensional scene structure and viewing transformations such as depth rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm [10]. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g. pose or light). Given a single input image, our model can generate new images of the same object with variations in pose and lighting. We present qualitative and quantitative tests of the model's efficacy at learning a 3D rendering engine for varied object classes including faces and chairs.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Deep Convolutional Generative Adversarial Network and Convolutional Neural Network for Smoke Detection
    Yin, Hang
    Wei, Yurong
    Liu, Hedan
    Liu, Shuangyin
    Liu, Chuanyun
    Gao, Yacui
    Liu, Shuangyin (hdlsyxlq@126.com), 1600, Hindawi Limited (2020):
  • [22] Inverse design of broadband metasurface absorber based on convolutional autoencoder network and inverse design network
    Ma, Ju
    Huang, Yijia
    Pu, Mingbo
    Xu, Dong
    Luo, Jun
    Guo, Yinghui
    Luo, Xiangang
    JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2020, 53 (46)
  • [23] Deep Learning-Enhanced Inverse Modeling of Terahertz Metasurface Based on a Convolutional Neural Network Technique
    Gao, Muzhi
    Jiang, Dawei
    Zhu, Gaoyang
    Wang, Bin
    PHOTONICS, 2024, 11 (05)
  • [24] Inverse design of electromagnetically induced transparency(EIT) metasurface based on deep convolutional generative adversarial network
    Zhu, Lei
    Zhang, Cong
    Dong, Liang
    Rong, Miao Xin
    Gong, Jin Yue
    Meng, Fan-Yi
    PHYSICA SCRIPTA, 2023, 98 (10)
  • [25] Industrial defective chips detection using deep convolutional neural network with inverse feature matching mechanism
    Ullah, Waseem
    Khan, Samee Ullah
    Kim, Min Je
    Hussain, Altaf
    Munsif, Muhammad
    Lee, Mi Young
    Seo, Daeho
    Baik, Sung Wook
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2024, 11 (03) : 326 - 336
  • [26] DEEP CONVOLUTIONAL NEURAL NETWORK-BASED INVERSE FILTERING APPROACH FOR SPEECH DE-REVERBERATION
    Chung, Hanwook
    Tomar, Vikrant Singh
    Champagne, Benoit
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [27] Deep convolutional network for urbansound classification
    Karthika, N.
    Janet, B.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2020, 45 (01):
  • [28] Deep convolutional network for urbansound classification
    N Karthika
    B Janet
    Sādhanā, 2020, 45
  • [29] Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training
    Liu, Sheng
    Li, Xiao
    Zhai, Yuexiang
    You, Chong
    Zhu, Zhihui
    Fernandez-Granda, Carlos
    Qu, Qing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [30] MDSCN: multiscale depthwise separable convolutional network for underwater graphics restoration
    Li, Shiyu
    Liu, Zehao
    Gao, Meijing
    Bai, Yang
    Yin, Haozheng
    VISUAL COMPUTER, 2025, 41 (03): : 1999 - 2010