SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection

被引:0
|
作者
Dai, Kun [1 ,2 ]
Jiang, Zhiqiang [1 ]
Xie, Tao [1 ,2 ]
Wang, Ke [1 ]
Liu, Dedong [1 ]
Fan, Zhendong [1 ]
Li, Ruifeng [1 ]
Zhao, Lijun [1 ,2 ]
Omar, Mohamed [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
[2] State Key Yangtze River Delta HIT Robot Technol Re, Wuhu 241000, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Feature extraction; Object detection; Point cloud compression; Optimization; Accuracy; Shape; Proposals; Sun; Kernel; 3D object detection; synergistic optimization; deep learning;
D O I
10.1109/TMM.2024.3521782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we observe that indoor 3D object detection across varied scene domains encompasses both universal attributes and specific features. Based on this insight, we propose SOFW, a synergistic optimization framework that investigates the feasibility of optimizing 3D object detection tasks concurrently spanning several dataset domains. The core of SOFW is identifying domain-shared parameters to encode universal scene attributes, while employing domain-specific parameters to delve into the particularities of each scene domain. Technically, we introduce a set abstraction alteration strategy (SAAS) that embeds learnable domain-specific features into set abstraction layers, thus empowering the network with a refined comprehension for each scene domain. Besides, we develop an element-wise sharing strategy (ESS) to facilitate fine-grained adaptive discernment between domain-shared and domain-specific parameters for network layers. Benefited from the proposed techniques, SOFW crafts feature representations for each scene domain by learning domain-specific parameters, whilst encoding generic attributes and contextual interdependencies via domain-shared parameters. Built upon the classical detection framework VoteNet without any complicated modules, SOFW delivers impressive performances under multiple benchmarks with much fewer total storage footprint. Additionally, we demonstrate that the proposed ESS is a universal strategy and applying it to a voxels-based approach TR3D can realize cutting-edge detection accuracy on all S3DIS, ScanNet, and SUN RGB-D datasets.
引用
收藏
页码:637 / 651
页数:15
相关论文
共 50 条
  • [41] A streamlined framework for BEV-based 3D object detection with prior masking☆
    Tong, Qinglin
    Zhang, Junjie
    Yan, Chenggang
    Zeng, Dan
    IMAGE AND VISION COMPUTING, 2024, 150
  • [42] Blockchain framework for managing machine-learning models for 3D object detection
    Tsuruta, Yoshiki
    Akiyama, Kuon
    Shinkuma, Ryoichi
    Mine, Aramu
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [43] A Framework for Training 3D Object Detection Models on a Limited Amount of Real Data
    Nakanoya, Manabu
    Fujiwaka, Masaya
    Nogami, Kousuke
    2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023, 2023, : 239 - 246
  • [44] A General Framework for Fast 3D Object Detection and Localization Using an Uncalibrated Camera
    Montero, Andres Solis
    Lang, Jochen
    Laganiere, Robert
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 884 - 891
  • [45] Towards a Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation
    Meng, Qinghao
    Wang, Wenguan
    Zhou, Tianfei
    Shen, Jianbing
    Jia, Yunde
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4454 - 4468
  • [46] A Multimodal 3D Object Detection Method Based on Double-Fusion Framework
    Ge T.-A.
    Li H.
    Guo Y.
    Wang J.-Y.
    Zhou D.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3100 - 3110
  • [47] BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection
    Yang, Lei
    Yu, Kaicheng
    Tang, Tao
    Li, Jun
    Yuan, Kun
    Wang, Li
    Zhang, Xinyu
    Chen, Peng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21611 - 21620
  • [48] Multimodal Object Query Initialization for 3D Object Detection
    van Geerenstein, Mathijs R.
    Ruppel, Felicia
    Dietmayers, Klaus
    Gavrila, Dariu M.
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 12484 - 12491
  • [49] Lighting Layout Optimization for 3D Indoor Scenes
    Jin, Sam
    Lee, Sung-Hee
    COMPUTER GRAPHICS FORUM, 2019, 38 (07) : 733 - 743
  • [50] 3D Object Proposals for Accurate Object Class Detection
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhu, Yukun
    Berneshawi, Andrew
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28