SELMA: SEmantic Large-Scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints

被引：16

作者：

Testolina, Paolo ^{[1
]}

Barbato, Francesco ^{[1
]}

Michieli, Umberto ^{[1
]}

Giordani, Marco ^{[1
]}

Zanuttigh, Pietro ^{[1
]}

Zorzi, Michele ^{[1
]}

机构：

[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 07期

关键词：

Cameras; Sensors; Semantics; Meteorology; Autonomous vehicles; Task analysis; Synthetic data; Synthetic dataset; CARLA; autonomous driving; domain adaptation; semantic segmentation; sensor fusion; UNSUPERVISED DOMAIN ADAPTATION; CHALLENGES; BENCHMARK; NETWORKS;

D O I：

10.1109/TITS.2023.3257086

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Accurate scene understanding from multiple sensors mounted on cars is a key requirement for autonomous driving systems. Nowadays, this task is mainly performed through data-hungry deep learning techniques that need very large amounts of data to be trained. Due to the high cost of performing segmentation labeling, many synthetic datasets have been proposed. However, most of them miss the multi-sensor nature of the data, and do not capture the significant changes introduced by the variation of daytime and weather conditions. To fill these gaps, we introduce SELMA, a novel synthetic dataset for semantic segmentation that contains more than 30K unique waypoints acquired from 24 different sensors including RGB, depth, semantic cameras and LiDARs, in 27 different weather and daytime conditions, for a total of more than 20M samples. SELMA is based on CARLA, an open-source simulator for generating synthetic data in autonomous driving scenarios, that we modified to increase the variability and the diversity in the scenes and class sets, and to align it with other benchmark datasets. As shown by the experimental evaluation, SELMA allows the efficient training of standard and multi-modal deep learning architectures, and achieves remarkable results on real-world data. SELMA is free and publicly available, thus supporting open science and research.

引用

页码：7012 / 7024

页数：13

共 50 条

[41] Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval
Lu, Xu
Liu, Li
Nie, Liqiang
Chang, Xiaojun
Zhang, Huaxiang
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 4541 - 4554
[42] The Tesserae Project: Large-Scale, Longitudinal, In Situ, Multimodal Sensing of Information Workers
Mattingly, Stephen M.
Gregg, Julie M.
Audia, Pino
Bayraktaroglu, Ayse Elvan
Campbell, Andrew T.
Chawla, Nitesh V.
Das Swains, Vedant
De Choudhury, Munmun
D'Mello, Sidney K.
Dey, Anind K.
Gao, Ge
Jagannath, Krithika
Jiang, Kaifeng
Lin, Suwen
Liu, Qiang
Marko, Gloria
Martinez, Gonzalo J.
Masaba, Kizito
Mirjafari, Shayan
Moskal, Edward
Mulukutla, Raghu
Nies, Kari
Reddy, Manikanta D.
Robles-Granda, Pablo
Saha, Koustuv
Sirigiri, Anusha
Striegel, Aaron
CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[43] Context-Aware Network for Semantic Segmentation Toward Large-Scale Point Clouds in Urban Environments
Liu, Chun
Zeng, Doudou
Akbar, Akram
Wu, Hangbin
Jia, Shoujun
Xu, Zeran
Yue, Han
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[44] Empirical analysis of large-scale multimodal traffic with multi-sensor data
Fu, Hui
Wang, Yefei
Tang, Xianma
Zheng, Nan
Geroliminis, Nikolaos
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 118
[45] Enhanced Local Feature Learning With Simple Offset Attention for Semantic Segmentation of Large-Scale Point Clouds
Chen, Dong
Wang, Yuebin
Zhang, Liqiang
Kang, Zhizhong
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[46] Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications
Cheng, Qing
Zeller, Niclas
Cremers, Daniel
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9235 - 9242
[47] Hybrid Offset Position Encoding for Large-Scale Point Cloud Semantic Segmentation
Xiao, Yu
Wu, Hui
Chen, Yisheng
Chen, Chongcheng
Dong, Ruihai
Lin, Ding
REMOTE SENSING, 2025, 17 (02)
[48] Learning Binary Semantic Embedding for Large-Scale Breast Histology Image Analysis
Liu, Xingbo
Kang, Xiao
Nie, Xiushan
Guo, Jie
Wang, Shaohua
Yin, Yilong
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (07) : 3240 - 3250
[49] PSSNet: Planarity-sensible Semantic Segmentation of large-scale urban meshes
Gao, Weixiao
Nan, Liangliang
Boom, Bas
Ledoux, Hugo
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 196 : 32 - 44
[50] SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispectral Images
Ji, Wei
Li, Jingjing
Bian, Cheng
Zhang, Zhicheng
Cheng, Li
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3307 - 3316

← 1 2 3 4 5 →