Learning graph-based representations for scene flow estimation

被引:0
作者
Mingliang Zhai
Hao Gao
Ye Liu
Jianhui Nie
Kang Ni
机构
[1] Nanjing University of Posts and Telecommunications,School of Automation
[2] Nanjing University of Posts and Telecommunications,School of Computer Science
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Deep learning; Scene flow estimation; Graph convolutional networks; 3D point cloud; Scene understanding;
D O I
暂无
中图分类号
学科分类号
摘要
Scene flow estimation is a fundamental task of autonomous driving. Compared with optical flow, scene flow can provide sufficient 3D motion information of the dynamic scene. With the increasing popularity of 3D LiDAR sensors and deep learning technology, 3D LiDAR-based scene flow estimation methods have achieved outstanding results on public benchmarks. Current methods usually adopt Multiple Layer Perceptron (MLP) or traditional convolution-like operation for feature extraction. However, the characteristics of point clouds are not exploited adequately in these methods, and thus some key semantic and geometric structures are not well captured. To address this issue, we propose to introduce graph convolution to exploit the structural features adaptively. In particular, multiple graph-based feature generators and a graph-based flow refinement module are deployed to encode geometric relations among points. Furthermore, residual connections are used in the graph-based feature generator to enhance feature representation and deep supervision of the graph-based network. In addition, to focus on short-term dependencies, we introduce a single gate-based recurrent unit to refine scene flow predictions iteratively. The proposed network is trained on the FlyingThings3D dataset and evaluated on the FlyingThings3D, KITTI, and Argoverse datasets. Comprehensive experiments show that all proposed components contribute to the performance of scene flow estimation, and our method can achieve potential performance compared to the recent approaches.
引用
收藏
页码:7317 / 7334
页数:17
相关论文
共 50 条
[31]   A Survey of CNN-Based Techniques for Scene Flow Estimation [J].
Muthu, Sundaram ;
Tennakoon, Ruwan ;
Hoseinnezhad, Reza ;
Bab-Hadiashar, Alireza .
IEEE ACCESS, 2023, 11 :99289-99303
[32]   Graph-based deep learning techniques for remote sensing applications: Techniques, taxonomy, and applications - A comprehensive review [J].
Khlifi, Manel Khazri ;
Boulila, Wadii ;
Farah, Imed Riadh .
COMPUTER SCIENCE REVIEW, 2023, 50
[33]   Soteria: Detecting Adversarial Examples in Control Flow Graph-based Malware Classifiers [J].
Alasmary, Hisham ;
Abusnaina, Ahmed ;
Jang, Rhongho ;
Abuhamad, Mohammed ;
Anwar, Afsah ;
Nyang, DaeHun ;
Mohaisen, David .
2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, :888-898
[34]   Optical flow and scene flow estimation: A survey [J].
Zhai, Mingliang ;
Xiang, Xuezhi ;
Lv, Ning ;
Kong, Xiangdong .
PATTERN RECOGNITION, 2021, 114
[35]   End-To-End Graph-Based Deep Semi-Supervised Learning with Extended Graph Laplacian [J].
Wang, Zihao ;
Tu, Enmei ;
Zhou, Meng ;
Yang, Jie .
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, :5948-5953
[36]   Deep learning, graph-based text representation and classification: a survey, perspectives and challenges [J].
Phu Pham ;
Loan T. T. Nguyen ;
Witold Pedrycz ;
Bay Vo .
Artificial Intelligence Review, 2023, 56 :4893-4927
[37]   Graph-Based Feature Learning for Cross-Project Software Defect Prediction [J].
Abdu, Ahmed ;
Zhai, Zhengjun ;
Abdo, Hakim A. ;
Algabri, Redhwan ;
Lee, Sungon .
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01) :161-180
[38]   Learning Graph Representations With Maximal Cliques [J].
Molaei, Soheila ;
Bousejin, Nima Ghanbari ;
Zare, Hadi ;
Jalili, Mahdi ;
Pan, Shirui .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) :1089-1096
[39]   Deep learning, graph-based text representation and classification: a survey, perspectives and challenges [J].
Phu Pham ;
Loan T T Nguyen ;
Pedrycz, Witold ;
Vo, Bay .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (06) :4893-4927
[40]   Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future [J].
Ahmedt-Aristizabal, David ;
Armin, Mohammad Ali ;
Denman, Simon ;
Fookes, Clinton ;
Petersson, Lars .
SENSORS, 2021, 21 (14)