CV-MOS: A Cross-View Model for Motion Segmentation

被引:0
|
作者
Tang, Xiaoyu [1 ,2 ]
Chen, Zeyu [1 ,2 ]
Cheng, Jintao [1 ,2 ]
Chen, Xieyuanli [3 ]
Wu, Jin [4 ]
Xue, Bohuan [5 ]
机构
[1] South China Normal Univ, Fac Engn, Sch Elect & Informat Engn, Foshan 528225, Guangdong, Peoples R China
[2] South China Normal Univ, Xingzhi Coll, Guangzhou 510000, Guangdong, Peoples R China
[3] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[5] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Semantics; Three-dimensional displays; Feature extraction; Laser radar; Periodic structures; Motion segmentation; Autonomous driving; cross view; LiDAR motion segmentation;
D O I
10.1109/TIM.2024.3458036
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In autonomous driving, accurately distinguishing between static and moving objects is crucial for the autonomous driving system. When performing the motion object segmentation (MOS) task, effectively leveraging motion information from objects becomes a primary challenge in improving the recognition of moving objects. Previous methods either utilized range view (RV) or bird's eye view (BEV) residual maps to capture motion information. Unlike traditional approaches, we propose combining RV and BEV residual maps to exploit a greater potential of motion information jointly. Thus, we introduce CV-MOS, a cross-view model for moving object segmentation. Novelty, we decouple spatial-temporal information by capturing the motion from BEV and RV residual maps and generating semantic features from range images, which are used as moving object guidance for the motion branch. Our direct and unique solution maximizes the use of range images and RV and BEV residual maps, significantly enhancing the performance of LiDAR-based MOS task. Our method achieved leading IoU (%) scores of 77.5% and 79.2% on the validation and test sets of the SemanticKITTI dataset. In particular, CV-MOS demonstrates SOTA performance to date on various datasets. The CV-MOS implementation is available at https://github.com/SCNU-RISLAB/CV-MOS.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Cross-view and multi-view gait recognitions based on view transformation model using multi-layer perceptron
    Kusakunniran, Worapan
    Wu, Qiang
    Zhang, Jian
    Li, Hongdong
    PATTERN RECOGNITION LETTERS, 2012, 33 (07) : 882 - 889
  • [32] GaitSet: Cross-View Gait Recognition Through Utilizing Gait As a Deep Set
    Chao, Hanqing
    Wang, Kun
    He, Yiwei
    Zhang, Junping
    Feng, Jianfeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3467 - 3478
  • [33] Mining Semantically Consistent Patterns for Cross-View Data
    Zhang, Lei
    Zhao, Yao
    Zhu, Zhenfeng
    Wei, Shikui
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (11) : 2745 - 2758
  • [34] NiteDR: Nighttime Image De-Raining With Cross-View Sensor Cooperative Learning for Dynamic Driving Scenes
    Shi, Cidan
    Fang, Lihuang
    Wu, Han
    Xian, Xiaoyu
    Shi, Yukai
    Lin, Liang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9203 - 9215
  • [35] SEMI-SUPERVISED CROSS-VIEW SCENE MODEL ADAPTATION FOR REMOTE SENSING IMAGE CLASSIFICATION
    Deng, Zhipeng
    Sun, Hao
    Zhou, Shilin
    Ji, Kefeng
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 2376 - 2379
  • [36] Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition
    Liu, Yang
    Lu, Zhaoyang
    Li, Jing
    Yang, Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2416 - 2430
  • [37] Multi-View Gait Image Generation for Cross-View Gait Recognition
    Chen, Xin
    Luo, Xizhao
    Weng, Jian
    Luo, Weiqi
    Li, Huiting
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3041 - 3055
  • [38] UAV-Satellite View Synthesis for Cross-View Geo-Localization
    Tian, Xiaoyang
    Shao, Jie
    Ouyang, Deqiang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4804 - 4815
  • [39] Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval
    Zeng, Zelong
    Wang, Zheng
    Yang, Fan
    Satoh, Shin'ichi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2176 - 2188
  • [40] Cross-View Action Recognition by Projection-Based Augmentation
    Le, Chien-Quang
    Thanh Duc Ngo
    Duy-Dinh Le
    Satoh, Shin'ichi
    Duc Anh Duong
    IMAGE AND VIDEO TECHNOLOGY, PSIVT 2015, 2016, 9431 : 215 - 227