SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection

被引:0
|
作者
Dai, Kun [1 ,2 ]
Jiang, Zhiqiang [1 ]
Xie, Tao [1 ,2 ]
Wang, Ke [1 ]
Liu, Dedong [1 ]
Fan, Zhendong [1 ]
Li, Ruifeng [1 ]
Zhao, Lijun [1 ,2 ]
Omar, Mohamed [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
[2] State Key Yangtze River Delta HIT Robot Technol Re, Wuhu 241000, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Feature extraction; Object detection; Point cloud compression; Optimization; Accuracy; Shape; Proposals; Sun; Kernel; 3D object detection; synergistic optimization; deep learning;
D O I
10.1109/TMM.2024.3521782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we observe that indoor 3D object detection across varied scene domains encompasses both universal attributes and specific features. Based on this insight, we propose SOFW, a synergistic optimization framework that investigates the feasibility of optimizing 3D object detection tasks concurrently spanning several dataset domains. The core of SOFW is identifying domain-shared parameters to encode universal scene attributes, while employing domain-specific parameters to delve into the particularities of each scene domain. Technically, we introduce a set abstraction alteration strategy (SAAS) that embeds learnable domain-specific features into set abstraction layers, thus empowering the network with a refined comprehension for each scene domain. Besides, we develop an element-wise sharing strategy (ESS) to facilitate fine-grained adaptive discernment between domain-shared and domain-specific parameters for network layers. Benefited from the proposed techniques, SOFW crafts feature representations for each scene domain by learning domain-specific parameters, whilst encoding generic attributes and contextual interdependencies via domain-shared parameters. Built upon the classical detection framework VoteNet without any complicated modules, SOFW delivers impressive performances under multiple benchmarks with much fewer total storage footprint. Additionally, we demonstrate that the proposed ESS is a universal strategy and applying it to a voxels-based approach TR3D can realize cutting-edge detection accuracy on all S3DIS, ScanNet, and SUN RGB-D datasets.
引用
收藏
页码:637 / 651
页数:15
相关论文
共 50 条
  • [1] A LIDAR-BASED 3D INDOOR MAPPING FRAMEWORK WITH MISMATCH DETECTION AND OPTIMIZATION
    Wang, Zhiyong
    Liu, Weiquan
    Wen, Chenglu
    Shi, Yongfei
    Yan, Xiaocheng
    Tan, Jinbin
    Wang, Cheng
    Li, Jonathan
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 7499 - 7502
  • [2] Hierarchical Point Attention for Indoor 3D Object Detection
    Shu, Manli
    Xue, Le
    Yu, Ning
    Martin-Martin, Roberto
    Xiong, Calming
    Goldstein, Tom
    Niebles, Juan Carlos
    Xu, Ran
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4245 - 4251
  • [3] Monocular 3D object detection for an indoor robot environment
    Kim, Jiwon
    Lee, GiJae
    Kim, Jun-Sik
    Kim, Hyunwoo J.
    Kim, KangGeon
    2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 438 - 445
  • [4] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, (07) : 2176 - 2190
  • [5] Image attention transformer network for indoor 3D object detection
    Ren, Keyan
    Yan, Tong
    Hu, Zhaoxin
    Han, Honggui
    Zhang, Yunlu
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (07) : 2176 - 2190
  • [6] Image attention transformer network for indoor 3D object detection
    REN KeYan
    YAN Tong
    HU ZhaoXin
    HAN HongGui
    ZHANG YunLu
    Science China(Technological Sciences), 2024, 67 (07) : 2176 - 2190
  • [7] Spatial and Semantic Information Enhancement for Indoor 3D Object Detection
    Chen, Chunmei
    Liang, Zhiqiang
    Liu, Haitao
    Liu, Xin
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (05) : 831 - 839
  • [8] MapFusion: A General Framework for 3D Object Detection with HDMaps
    Fang, Jin
    Zhou, Dingfu
    Song, Xibin
    Zhang, Liangjun
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3406 - 3413
  • [9] MonoGRNet: A General Framework for Monocular 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184
  • [10] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
    Zhu, Yun
    Hui, Le
    Shen, Yaqi
    Xie, Jin
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7811 - 7819