CV-MOS: A Cross-View Model for Motion Segmentation

被引：0

作者：

Tang, Xiaoyu ^{[1
,2
]}

Chen, Zeyu ^{[1
,2
]}

Cheng, Jintao ^{[1
,2
]}

Chen, Xieyuanli ^{[3
]}

Wu, Jin ^{[4
]}

Xue, Bohuan ^{[5
]}

机构：

[1] South China Normal Univ, Fac Engn, Sch Elect & Informat Engn, Foshan 528225, Guangdong, Peoples R China

[2] South China Normal Univ, Xingzhi Coll, Guangzhou 510000, Guangdong, Peoples R China

[3] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

[4] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China

[5] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

基金：

中国国家自然科学基金;

关键词：

Point cloud compression; Semantics; Three-dimensional displays; Feature extraction; Laser radar; Periodic structures; Motion segmentation; Autonomous driving; cross view; LiDAR motion segmentation;

D O I：

10.1109/TIM.2024.3458036

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In autonomous driving, accurately distinguishing between static and moving objects is crucial for the autonomous driving system. When performing the motion object segmentation (MOS) task, effectively leveraging motion information from objects becomes a primary challenge in improving the recognition of moving objects. Previous methods either utilized range view (RV) or bird's eye view (BEV) residual maps to capture motion information. Unlike traditional approaches, we propose combining RV and BEV residual maps to exploit a greater potential of motion information jointly. Thus, we introduce CV-MOS, a cross-view model for moving object segmentation. Novelty, we decouple spatial-temporal information by capturing the motion from BEV and RV residual maps and generating semantic features from range images, which are used as moving object guidance for the motion branch. Our direct and unique solution maximizes the use of range images and RV and BEV residual maps, significantly enhancing the performance of LiDAR-based MOS task. Our method achieved leading IoU (%) scores of 77.5% and 79.2% on the validation and test sets of the SemanticKITTI dataset. In particular, CV-MOS demonstrates SOTA performance to date on various datasets. The CV-MOS implementation is available at https://github.com/SCNU-RISLAB/CV-MOS.

引用

页数：10

共 50 条

[41] Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition
Liu, Yang
Lu, Zhaoyang
Li, Jing
Yang, Tao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2416 - 2430
[42] Multi-View Gait Image Generation for Cross-View Gait Recognition
Chen, Xin
Luo, Xizhao
Weng, Jian
Luo, Weiqi
Li, Huiting
Tian, Qi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3041 - 3055
[43] Joint Versus Independent Multiview Hashing for Cross-View Retrieval
Hu, Peng
Peng, Xi
Zhu, Hongyuan
Lin, Jie
Zhen, Liangli
Peng, Dezhong
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (10) : 4982 - 4993
[44] Pairwise-Covariance Multi-view Discriminant Analysis for Robust Cross-View Human Action Recognition
Tran, Hoang-Nhat
Nguyen, Hong-Quan
Doan, Huong-Giang
Tran, Thanh-Hai
Le, Thi-Lan
Vu, Hai
IEEE ACCESS, 2021, 9 : 76097 - 76111
[45] Cross-view Matching Neural Network for Remote Sensing Images
Hui, Tian
Chen, Xiaoqing
Zhu, Ronggang
Xu, YueLei
Zhang, Zhaoxiang
2021 IEEE 6TH INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2021), 2021, : 138 - 143
[46] Human pose estimation based on cross-view feature fusion
Sun, Dandan
Wang, Siqi
Xia, Hailun
Zhang, Changan
Gao, Jianlong
Mao, Mingyu
VISUAL COMPUTER, 2024, 40 (09) : 6581 - 6597
[47] Cross-view learning with scatters and manifold exploitation in geodesic space
Tian, Qing
Zhang, Heng
Xia, Shiyu
Xu, Heng
Ma, Chuang
ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (09): : 5425 - 5441
[48] Cross-View Action Recognition Based on Hierarchical View-Shared Dictionary Learning
Zhang, Chengkun
Zheng, Huicheng
Lai, Jianhuang
IEEE ACCESS, 2018, 6 : 16855 - 16868
[49] Horizontal and Vertical Part-Wise Feature Extraction for Cross-View Gait Recognition
Uddin, Md. Zasim
Hasan, Kamrul
Ahad, Md Atiqur Rahman
Alnajjar, Fady
IEEE ACCESS, 2024, 12 : 185511 - 185527
[50] Person Reidentification via Unsupervised Cross-View Metric Learning
Feng, Yachuang
Yuan, Yuan
Lu, Xiaoqiang
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (04) : 1849 - 1859

← 1 2 3 4 5 →