PixelPyramids: Exact Inference Models from Lossless Image Pyramids

被引:2
作者
Mahajan, Shweta [1 ]
Roth, Stefan [1 ,2 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
[2] Hessian AI, Darmstadt, Germany
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
基金
欧洲研究理事会;
关键词
D O I
10.1109/ICCV48922.2021.00657
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autoregressive models are a class of exact inference approaches with highly flexible functional forms, yielding state-of-the-art density estimates for natural images. Yet, the sequential ordering on the dimensions makes these models computationally expensive and limits their applicability to low-resolution imagery. In this work, we propose PixelPyramids,(1) a block-autoregressive approach employing a lossless pyramid decomposition with scale-specific representations to encode the joint distribution of image pixels. Crucially, it affords a sparser dependency structure compared to fully autoregressive approaches. Our PixelPyramids yield state-of-the-art results for density estimation on various image datasets, especially for high-resolution data. For CelebA-HQ 1024 x 1024, we observe that the density estimates (in terms of bits/dim) are improved to similar to 44% of the baseline despite sampling speeds superior even to easily parallelizable flow-based models.
引用
收藏
页码:6619 / 6628
页数:10
相关论文
共 50 条
[1]  
[Anonymous], 2017, ICML
[2]  
[Anonymous], 2017, NIPS
[3]  
[Anonymous], NeurIPS
[4]  
[Anonymous], 2016, P INT C LEARN REPR I
[5]  
[Anonymous], 2019, ICML
[6]  
[Anonymous], 2018, ICML
[7]   Long short-term memory [J].
Hochreiter, S ;
Schmidhuber, J .
NEURAL COMPUTATION, 1997, 9 (08) :1735-1780
[8]  
[Anonymous], 2018, ICML
[9]  
[Anonymous], 2018, ICML
[10]  
[Anonymous], 2016, ICML