SELMA: SEmantic Large-Scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints

被引:16
|
作者
Testolina, Paolo [1 ]
Barbato, Francesco [1 ]
Michieli, Umberto [1 ]
Giordani, Marco [1 ]
Zanuttigh, Pietro [1 ]
Zorzi, Michele [1 ]
机构
[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
Cameras; Sensors; Semantics; Meteorology; Autonomous vehicles; Task analysis; Synthetic data; Synthetic dataset; CARLA; autonomous driving; domain adaptation; semantic segmentation; sensor fusion; UNSUPERVISED DOMAIN ADAPTATION; CHALLENGES; BENCHMARK; NETWORKS;
D O I
10.1109/TITS.2023.3257086
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate scene understanding from multiple sensors mounted on cars is a key requirement for autonomous driving systems. Nowadays, this task is mainly performed through data-hungry deep learning techniques that need very large amounts of data to be trained. Due to the high cost of performing segmentation labeling, many synthetic datasets have been proposed. However, most of them miss the multi-sensor nature of the data, and do not capture the significant changes introduced by the variation of daytime and weather conditions. To fill these gaps, we introduce SELMA, a novel synthetic dataset for semantic segmentation that contains more than 30K unique waypoints acquired from 24 different sensors including RGB, depth, semantic cameras and LiDARs, in 27 different weather and daytime conditions, for a total of more than 20M samples. SELMA is based on CARLA, an open-source simulator for generating synthetic data in autonomous driving scenarios, that we modified to increase the variability and the diversity in the scenes and class sets, and to align it with other benchmark datasets. As shown by the experimental evaluation, SELMA allows the efficient training of standard and multi-modal deep learning architectures, and achieves remarkable results on real-world data. SELMA is free and publicly available, thus supporting open science and research.
引用
收藏
页码:7012 / 7024
页数:13
相关论文
共 50 条
  • [41] Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval
    Lu, Xu
    Liu, Li
    Nie, Liqiang
    Chang, Xiaojun
    Zhang, Huaxiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 4541 - 4554
  • [42] The Tesserae Project: Large-Scale, Longitudinal, In Situ, Multimodal Sensing of Information Workers
    Mattingly, Stephen M.
    Gregg, Julie M.
    Audia, Pino
    Bayraktaroglu, Ayse Elvan
    Campbell, Andrew T.
    Chawla, Nitesh V.
    Das Swains, Vedant
    De Choudhury, Munmun
    D'Mello, Sidney K.
    Dey, Anind K.
    Gao, Ge
    Jagannath, Krithika
    Jiang, Kaifeng
    Lin, Suwen
    Liu, Qiang
    Marko, Gloria
    Martinez, Gonzalo J.
    Masaba, Kizito
    Mirjafari, Shayan
    Moskal, Edward
    Mulukutla, Raghu
    Nies, Kari
    Reddy, Manikanta D.
    Robles-Granda, Pablo
    Saha, Koustuv
    Sirigiri, Anusha
    Striegel, Aaron
    CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [43] Context-Aware Network for Semantic Segmentation Toward Large-Scale Point Clouds in Urban Environments
    Liu, Chun
    Zeng, Doudou
    Akbar, Akram
    Wu, Hangbin
    Jia, Shoujun
    Xu, Zeran
    Yue, Han
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [44] Empirical analysis of large-scale multimodal traffic with multi-sensor data
    Fu, Hui
    Wang, Yefei
    Tang, Xianma
    Zheng, Nan
    Geroliminis, Nikolaos
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 118
  • [45] Enhanced Local Feature Learning With Simple Offset Attention for Semantic Segmentation of Large-Scale Point Clouds
    Chen, Dong
    Wang, Yuebin
    Zhang, Liqiang
    Kang, Zhizhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [46] Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications
    Cheng, Qing
    Zeller, Niclas
    Cremers, Daniel
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9235 - 9242
  • [47] Hybrid Offset Position Encoding for Large-Scale Point Cloud Semantic Segmentation
    Xiao, Yu
    Wu, Hui
    Chen, Yisheng
    Chen, Chongcheng
    Dong, Ruihai
    Lin, Ding
    REMOTE SENSING, 2025, 17 (02)
  • [48] Learning Binary Semantic Embedding for Large-Scale Breast Histology Image Analysis
    Liu, Xingbo
    Kang, Xiao
    Nie, Xiushan
    Guo, Jie
    Wang, Shaohua
    Yin, Yilong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (07) : 3240 - 3250
  • [49] PSSNet: Planarity-sensible Semantic Segmentation of large-scale urban meshes
    Gao, Weixiao
    Nan, Liangliang
    Boom, Bas
    Ledoux, Hugo
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 196 : 32 - 44
  • [50] SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispectral Images
    Ji, Wei
    Li, Jingjing
    Bian, Cheng
    Zhang, Zhicheng
    Cheng, Li
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3307 - 3316