Sequential Point Cloud Upsampling by Exploiting Multi-Scale Temporal Dependency

被引:15
作者
Wang, Kaisiyuan [1 ]
Sheng, Lu [2 ]
Gu, Shuhang [1 ]
Xu, Dong [1 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
[2] Beihang Univ, Sch Software, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Feature extraction; Shape; Task analysis; Superresolution; Estimation; Solid modeling; Point cloud upsampling; dynamic point cloud sequences; spatio-temporal exploration;
D O I
10.1109/TCSVT.2021.3104304
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work, we propose a new sequential point cloud upsampling method called SPU, which aims to upsample sparse, non-uniform, and orderless point cloud sequences by effectively exploiting rich and complementary temporal dependency from multiple inputs. Specifically, these inputs include a set of multi-scale short-term features from the 3D points in three consecutive frames (i.e., the previous/current/subsequent frame) and a long-term latent representation accumulated throughout the point cloud sequence. Considering that these temporal clues are not well aligned in the coordinate space, we propose a new temporal alignment module (TAM) based on the cross-attention mechanism to transform each individual feature into the feature space of the current frame. We also propose a new gating mechanism to learn the optimal weights for these transformed features, based on which the transformed features can be effectively aggregated as the final fused feature. The fused feature can be readily fed into the existing single frame-based point cloud upsampling methods (e.g., PU-Net, MPU and PU-GAN) to generate the dense point cloud for the current frame. Comprehensive experiments on three benchmark datasets DYNA, COMA, and MSR Action3D demonstrate the effectiveness of our method for upsampling point cloud sequences.
引用
收藏
页码:4686 / 4696
页数:11
相关论文
共 52 条
[1]   PointFlowNet: Learning Representations for Rigid Motion Estimation from Point Clouds [J].
Behl, Aseem ;
Paschalidou, Despoina ;
Donne, Simon ;
Geiger, Andreas .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7954-7963
[2]   4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks [J].
Choy, Christopher ;
Gwak, JunYoung ;
Savarese, Silvio .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3070-3079
[3]   Efficient and Flexible Sampling with Blue Noise Properties of Triangular Meshes [J].
Corsini, Massimiliano ;
Cignoni, Paolo ;
Scopigno, Roberto .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (06) :914-924
[4]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[5]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[6]   FlowNet: Learning Optical Flow with Convolutional Networks [J].
Dosovitskiy, Alexey ;
Fischer, Philipp ;
Ilg, Eddy ;
Haeusser, Philip ;
Hazirbas, Caner ;
Golkov, Vladimir ;
van der Smagt, Patrick ;
Cremers, Daniel ;
Brox, Thomas .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2758-2766
[7]   HPLFlowNet: Hierarchical Permutohedral Lattice FlowNet for Scene Flow Estimation on Large-scale Point Clouds [J].
Gu, Xiuye ;
Wang, Yijie ;
Wu, Chongruo ;
Lee, Yong Jae ;
Wang, Panqu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3249-3258
[8]   Recurrent Back-Projection Network for Video Super-Resolution [J].
Haris, Muhammad ;
Shakhnarovich, Greg ;
Ukita, Norimichi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3892-3901
[9]  
Kappeler A, 2016, IEEE IMAGE PROC, P1150, DOI 10.1109/ICIP.2016.7532538
[10]   Spatio-Temporal Transformer Network for Video Restoration [J].
Kim, Tae Hyun ;
Sajjadi, Mehdi S. M. ;
Hirsch, Michael ;
Schoelkopf, Bernhard .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :111-127