GMA3D: Local-Global Attention Learning to Estimate Occluded Motions of Scene Flow

被引:0
作者
Lu, Zhiyang [1 ]
Cheng, Ming [2 ]
机构
[1] Xiamen Univ, Inst Artificial Intelligence, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT II | 2024年 / 14426卷
关键词
Scene flow estimation; Deep learning; Point clouds; Local-global attention;
D O I
10.1007/978-981-99-8432-9_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene flow plays a pivotal role in the 3D perception task by capturing motion information between consecutive frames. However, occlusion phenomena are commonly observed in dynamic scenes, whether from the sparsity data sampling or real-world occlusion. In this paper, we focus on addressing occlusion issues in scene flow by the semantic self-similarity and motion consistency of the moving objects. We propose a GMA3D module based on the transformer framework, which utilizes local and global semantic similarity to infer the motion information of occluded points from the motion information of local and global non-occluded points respectively, and then uses the Offset Aggregator to aggregate them. Our module is the first to apply the transformer-based architecture to gauge the scene flow occlusion problem on point clouds. Experiments show that our GMA3D can solve the occlusion problem in the scene flow, especially in the real scene. We evaluated the proposed method on the occluded version of point cloud datasets and get state-of-the-art results on the real-world KITTI dataset. To testify that GMA3D is still beneficial to non-occluded scene flow, we also conducted experiments on non-occluded version datasets and achieved promising performance on FlyThings3D and KITTI. The code is available at https://github.com/O-VIGIA/GMA3D.
引用
收藏
页码:16 / 27
页数:12
相关论文
共 28 条
[1]   Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation [J].
Cheng, Wencan ;
Ko, Jong Hwan .
COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 :108-124
[2]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[3]   HPLFlowNet: Hierarchical Permutohedral Lattice FlowNet for Scene Flow Estimation on Large-scale Point Clouds [J].
Gu, Xiuye ;
Wang, Yijie ;
Wu, Chongruo ;
Lee, Yong Jae ;
Wang, Panqu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3249-3258
[4]   PCT: Point cloud transformer [J].
Guo, Meng-Hao ;
Cai, Jun-Xiong ;
Liu, Zheng-Ning ;
Mu, Tai-Jiang ;
Martin, Ralph R. ;
Hu, Shi-Min .
COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199
[5]  
Hur J., 2020, P IEEECVF C COMPUTER
[6]   Occlusions, Motion and Depth Boundaries with a Generic Network for Disparity, Optical Flow or Scene Flow Estimation [J].
Ilg, Eddy ;
Saikia, Tonmoy ;
Keuper, Margret ;
Brox, Thomas .
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :626-643
[7]   Learning to Estimate Hidden Motions with Global Motion Aggregation [J].
Jiang, Shihao ;
Campbell, Dylan ;
Lu, Yao ;
Li, Hongdong ;
Hartley, Richard .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9752-9761
[8]   FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation [J].
Kittenplon, Yair ;
Eldar, Yonina C. ;
Raviv, Dan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4112-4121
[9]   SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow [J].
Lang, Itai ;
Aiger, Dror ;
Cole, Forrester ;
Avidan, Shai ;
Rubinstein, Michael .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :5281-5290
[10]   FlowNet3D: Learning Scene Flow in 3D Point Clouds [J].
Liu, Xingyu ;
Qi, Charles R. ;
Guibas, Leonidas J. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :529-537