SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection

被引：0

作者：

Dai, Kun ^{[1
,2
]}

Jiang, Zhiqiang ^{[1
]}

Xie, Tao ^{[1
,2
]}

Wang, Ke ^{[1
]}

Liu, Dedong ^{[1
]}

Fan, Zhendong ^{[1
]}

Li, Ruifeng ^{[1
]}

Zhao, Lijun ^{[1
,2
]}

Omar, Mohamed ^{[1
]}

机构：

[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China

[2] State Key Yangtze River Delta HIT Robot Technol Re, Wuhu 241000, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2025年 / 27卷

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Feature extraction; Object detection; Point cloud compression; Optimization; Accuracy; Shape; Proposals; Sun; Kernel; 3D object detection; synergistic optimization; deep learning;

D O I：

10.1109/TMM.2024.3521782

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this work, we observe that indoor 3D object detection across varied scene domains encompasses both universal attributes and specific features. Based on this insight, we propose SOFW, a synergistic optimization framework that investigates the feasibility of optimizing 3D object detection tasks concurrently spanning several dataset domains. The core of SOFW is identifying domain-shared parameters to encode universal scene attributes, while employing domain-specific parameters to delve into the particularities of each scene domain. Technically, we introduce a set abstraction alteration strategy (SAAS) that embeds learnable domain-specific features into set abstraction layers, thus empowering the network with a refined comprehension for each scene domain. Besides, we develop an element-wise sharing strategy (ESS) to facilitate fine-grained adaptive discernment between domain-shared and domain-specific parameters for network layers. Benefited from the proposed techniques, SOFW crafts feature representations for each scene domain by learning domain-specific parameters, whilst encoding generic attributes and contextual interdependencies via domain-shared parameters. Built upon the classical detection framework VoteNet without any complicated modules, SOFW delivers impressive performances under multiple benchmarks with much fewer total storage footprint. Additionally, we demonstrate that the proposed ESS is a universal strategy and applying it to a voxels-based approach TR3D can realize cutting-edge detection accuracy on all S3DIS, ScanNet, and SUN RGB-D datasets.

引用

页码：637 / 651

页数：15

共 50 条

[1] A LIDAR-BASED 3D INDOOR MAPPING FRAMEWORK WITH MISMATCH DETECTION AND OPTIMIZATION
Wang, Zhiyong
Liu, Weiquan
Wen, Chenglu
Shi, Yongfei
Yan, Xiaocheng
Tan, Jinbin
Wang, Cheng
Li, Jonathan
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 7499 - 7502
[2] Hierarchical Point Attention for Indoor 3D Object Detection
Shu, Manli
Xue, Le
Yu, Ning
Martin-Martin, Roberto
Xiong, Calming
Goldstein, Tom
Niebles, Juan Carlos
Xu, Ran
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4245 - 4251
[3] Monocular 3D object detection for an indoor robot environment
Kim, Jiwon
Lee, GiJae
Kim, Jun-Sik
Kim, Hyunwoo J.
Kim, KangGeon
2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 438 - 445
[4] Image attention transformer network for indoor 3D object detection
REN KeYan
YAN Tong
HU ZhaoXin
HAN HongGui
ZHANG YunLu
Science China(Technological Sciences), 2024, (07) : 2176 - 2190
[5] Image attention transformer network for indoor 3D object detection
Ren, Keyan
Yan, Tong
Hu, Zhaoxin
Han, Honggui
Zhang, Yunlu
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (07) : 2176 - 2190
[6] Image attention transformer network for indoor 3D object detection
REN KeYan
YAN Tong
HU ZhaoXin
HAN HongGui
ZHANG YunLu
Science China(Technological Sciences), 2024, 67 (07) : 2176 - 2190
[7] Spatial and Semantic Information Enhancement for Indoor 3D Object Detection
Chen, Chunmei
Liang, Zhiqiang
Liu, Haitao
Liu, Xin
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (05) : 831 - 839
[8] MapFusion: A General Framework for 3D Object Detection with HDMaps
Fang, Jin
Zhou, Dingfu
Song, Xibin
Zhang, Liangjun
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3406 - 3413
[9] MonoGRNet: A General Framework for Monocular 3D Object Detection
Qin, Zengyi
Wang, Jinglu
Lu, Yan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184
[10] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection
Zhu, Yun
Hui, Le
Shen, Yaqi
Xie, Jin
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7811 - 7819

← 1 2 3 4 5 →