CV-MOS: A Cross-View Model for Motion Segmentation

被引:0
作者
Tang, Xiaoyu [1 ,2 ]
Chen, Zeyu [1 ,2 ]
Cheng, Jintao [1 ,2 ]
Chen, Xieyuanli [3 ]
Wu, Jin [4 ]
Xue, Bohuan [5 ]
机构
[1] South China Normal Univ, Fac Engn, Sch Elect & Informat Engn, Foshan 528225, Guangdong, Peoples R China
[2] South China Normal Univ, Xingzhi Coll, Guangzhou 510000, Guangdong, Peoples R China
[3] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[5] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Semantics; Three-dimensional displays; Feature extraction; Laser radar; Periodic structures; Motion segmentation; Autonomous driving; cross view; LiDAR motion segmentation;
D O I
10.1109/TIM.2024.3458036
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving, accurately distinguishing between static and moving objects is crucial for the autonomous driving system. When performing the motion object segmentation (MOS) task, effectively leveraging motion information from objects becomes a primary challenge in improving the recognition of moving objects. Previous methods either utilized range view (RV) or bird's eye view (BEV) residual maps to capture motion information. Unlike traditional approaches, we propose combining RV and BEV residual maps to exploit a greater potential of motion information jointly. Thus, we introduce CV-MOS, a cross-view model for moving object segmentation. Novelty, we decouple spatial-temporal information by capturing the motion from BEV and RV residual maps and generating semantic features from range images, which are used as moving object guidance for the motion branch. Our direct and unique solution maximizes the use of range images and RV and BEV residual maps, significantly enhancing the performance of LiDAR-based MOS task. Our method achieved leading IoU (%) scores of 77.5% and 79.2% on the validation and test sets of the SemanticKITTI dataset. In particular, CV-MOS demonstrates SOTA performance to date on various datasets. The CV-MOS implementation is available at https://github.com/SCNU-RISLAB/CV-MOS.
引用
收藏
页数:10
相关论文
共 29 条
[1]   Automatic Labeling to Generate Training Data for Online LiDAR-Based Moving Object Segmentation [J].
Chen, Xieyuanli ;
Mersch, Benedikt ;
Nunes, Lucas ;
Marcuzzi, Rodrigo ;
Vizzo, Ignacio ;
Behley, Jens ;
Stachniss, Cyrill .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) :6107-6114
[2]   Moving Object Segmentation in 3D LiDAR Data: A Learning-Based Approach Exploiting Sequential Data [J].
Chen, Xieyuanli ;
Li, Shijie ;
Mersch, Benedikt ;
Wiesmann, Louis ;
Gall, Jurgen ;
Behley, Jens ;
Stachniss, Cyrill .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) :6529-6536
[3]  
Chen XYL, 2019, IEEE INT C INT ROBOT, P4530, DOI 10.1109/IROS40897.2019.8967704
[4]   MF-MOS: A Motion-Focused Model for Moving Object Segmentation [J].
Cheng, Jintao ;
Zeng, Kang ;
Huang, Zhuoxu ;
Tang, Xiaoyu ;
Wu, Jin ;
Zhang, Chengxi ;
Chen, Xieyuanli ;
Fan, Rui .
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, :12499-12505
[5]  
Cortinhal Tiago, 2020, Advances in Visual Computing. 15th International Symposium, ISVC 2020. Proceedings. Lecture Notes in Computer Science (LNCS 12510), P207, DOI 10.1007/978-3-030-64559-5_16
[6]   Remove, then Revert: Static Point cloud Map Construction using Multiresolution Range Images [J].
Kim, Giseop ;
Kim, Ayoung .
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, :10758-10765
[7]   RVMOS: Range-View Moving Object Segmentation Leveraged by Semantic and Motion Features [J].
Kim, Jaeyeul ;
Woo, Jungwan ;
Im, Sunghoon .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) :8044-8051
[8]   Rethinking Range View Representation for LiDAR Segmentation [J].
Kong, Lingdong ;
Liu, Youquan ;
Chen, Runnan ;
Ma, Yuexin ;
Zhu, Xinge ;
Li, Yikang ;
Hou, Yuenan ;
Qiao, Yu ;
Liu, Ziwei .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :228-240
[9]   Unsupervised 4D LiDAR Moving Object Segmentation in Stationary Settings with Multivariate Occupancy Time Series [J].
Kreutz, Thomas ;
Muehlhaeuser, Max ;
Guinea, Alejandro Sanchez .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :1644-1653
[10]   Autonomous Robot Navigation in Highly Populated Pedestrian Zones [J].
Kuemmerle, Rainer ;
Ruhnke, Michael ;
Steder, Bastian ;
Stachniss, Cyrill ;
Burgard, Wolfram .
JOURNAL OF FIELD ROBOTICS, 2015, 32 (04) :565-589