SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection

被引:0
|
作者
Dai, Kun [1 ,2 ]
Jiang, Zhiqiang [1 ]
Xie, Tao [1 ,2 ]
Wang, Ke [1 ]
Liu, Dedong [1 ]
Fan, Zhendong [1 ]
Li, Ruifeng [1 ]
Zhao, Lijun [1 ,2 ]
Omar, Mohamed [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
[2] State Key Yangtze River Delta HIT Robot Technol Re, Wuhu 241000, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Feature extraction; Object detection; Point cloud compression; Optimization; Accuracy; Shape; Proposals; Sun; Kernel; 3D object detection; synergistic optimization; deep learning;
D O I
10.1109/TMM.2024.3521782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we observe that indoor 3D object detection across varied scene domains encompasses both universal attributes and specific features. Based on this insight, we propose SOFW, a synergistic optimization framework that investigates the feasibility of optimizing 3D object detection tasks concurrently spanning several dataset domains. The core of SOFW is identifying domain-shared parameters to encode universal scene attributes, while employing domain-specific parameters to delve into the particularities of each scene domain. Technically, we introduce a set abstraction alteration strategy (SAAS) that embeds learnable domain-specific features into set abstraction layers, thus empowering the network with a refined comprehension for each scene domain. Besides, we develop an element-wise sharing strategy (ESS) to facilitate fine-grained adaptive discernment between domain-shared and domain-specific parameters for network layers. Benefited from the proposed techniques, SOFW crafts feature representations for each scene domain by learning domain-specific parameters, whilst encoding generic attributes and contextual interdependencies via domain-shared parameters. Built upon the classical detection framework VoteNet without any complicated modules, SOFW delivers impressive performances under multiple benchmarks with much fewer total storage footprint. Additionally, we demonstrate that the proposed ESS is a universal strategy and applying it to a voxels-based approach TR3D can realize cutting-edge detection accuracy on all S3DIS, ScanNet, and SUN RGB-D datasets.
引用
收藏
页码:637 / 651
页数:15
相关论文
共 50 条
  • [31] TSFF: a two-stage fusion framework for 3D object detection
    Jiang, Guoqing
    Li, Saiya
    Huang, Ziyu
    Cai, Guorong
    Su, Jinhe
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [32] Framework for 3D Object Hole Filling
    Setty, Shankar
    Ganihar, Syed Altaf
    Mudenagudi, Uma
    2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,
  • [33] A Framework for 3D Object Identification and Tracking
    Chliveros, Georgios
    Figueiredo, Rui P.
    Moreno, Plinio
    Pateraki, Maria
    Bernardino, Alexandre
    Santos-Victor, Jose
    Trahanias, Panos
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 672 - 677
  • [34] MonoDFNet: Monocular 3D Object Detection with Depth Fusion and Adaptive Optimization
    Gao, Yuhan
    Wang, Peng
    Li, Xiaoyan
    Sun, Mengyu
    Di, Ruohai
    Li, Liangliang
    Hong, Wei
    SENSORS, 2025, 25 (03)
  • [35] A robust 3D unique descriptor for 3D object detection
    Joshi, Piyush
    Rastegarpanah, Alireza
    Stolkin, Rustam
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)
  • [36] GFENet: Group-Free Enhancement Network for Indoor Scene 3D Object Detection
    Zhou, Feng
    Dai, Ju
    Pan, Junjun
    Zhu, Mengxiao
    Cai, Xingquan
    Huang, Bin
    Wang, Chen
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT III, 2024, 14497 : 119 - 136
  • [37] SOA: Seed point offset attention for indoor 3D object detection in point clouds
    Shu, Jun
    Yu, Shiqi
    Shu, Xinyi
    Hu, Jiewen
    COMPUTERS & GRAPHICS-UK, 2024, 123
  • [38] 3D Scene Reconstruction and Object Recognition for Indoor Scene
    Shen, Yangping
    Manabe, Yoshitsugu
    Yata, Noriko
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) 2019, 2019, 11049
  • [39] ActiveAnno3D-An Active Learning Framework for Multi-Modal 3D Object Detection
    Ghita, Ahmed
    Antoniussen, Bjork
    Zimmer, Walter
    Greer, Ross
    Cress, Christian
    Mogelmose, Andreas
    Trivedi, Mohan M.
    Knoll, Alois C.
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1699 - 1706
  • [40] ETS-3D: An Efficient Two-Stage Framework for Stereo 3D Object Detection
    Ji, Chaofeng
    Liu, Guizhong
    Zhao, Dan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88