EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection

被引:1
|
作者
Duan, Yao [1 ]
Yi, Renjiao [1 ]
Gao, Yuanming [1 ]
Xu, Kai [1 ]
Zhu, Chenyang [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha 410000, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
indoor scene; object detection; contrastive learning; feature enhancement;
D O I
10.1007/s41095-023-0366-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Good proposal initials are critical for 3D object detection applications. However, due to the significant geometry variation of indoor scenes, incomplete and noisy proposals are inevitable in most cases. Mining feature information among these "bad" proposals may mislead the detection. Contrastive learning provides a feasible way for representing proposals, which can align complete and incomplete/noisy proposals in feature space. The aligned feature space can help us build robust 3D representation even if bad proposals are given. Therefore, we devise a new contrast learning framework for indoor 3D object detection, called EFECL, that learns robust 3D representations by contrastive learning of proposals on two different levels. Specifically, we optimize both instance-level and category-level contrasts to align features by capturing instance-specific characteristics and semantic-aware common patterns. Furthermore, we propose an enhanced feature aggregation module to extract more general and informative features for contrastive learning. Evaluations on ScanNet V2 and SUN RGB-D benchmarks demonstrate the generalizability and effectiveness of our method, and our method can achieve 12.3% and 7.3% improvements on both datasets over the benchmark alternatives. The code andmodels are publicly available at https://github.com/YaraDuan/EFECL.
引用
收藏
页码:875 / 892
页数:18
相关论文
共 50 条
  • [1] EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection
    Yao Duan
    Renjiao Yi
    Yuanming Gao
    Kai Xu
    Chenyang Zhu
    Computational Visual Media, 2023, 9 (4) : 875 - 892
  • [2] MonoFENet: Monocular 3D Object Detection With Feature Enhancement Networks
    Bao, Wentao
    Xu, Bin
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2753 - 2765
  • [3] SimLOG: Simultaneous Local-Global Feature Learning for 3D Object Detection in Indoor Point Clouds
    Wei, Mingqiang
    Chen, Baian
    Nan, Liangliang
    Xie, Haoran
    Gu, Lipeng
    Lu, Dening
    Wang, Fu Lee
    Li, Qing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 19482 - 19495
  • [4] Point-Guided Contrastive Learning for Monocular 3-D Object Detection
    Feng, Dapeng
    Han, Songfang
    Xu, Hang
    Liang, Xiaodan
    Tan, Xiaojun
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) : 954 - 966
  • [5] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [6] SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection
    Dai, Kun
    Jiang, Zhiqiang
    Xie, Tao
    Wang, Ke
    Liu, Dedong
    Fan, Zhendong
    Li, Ruifeng
    Zhao, Lijun
    Omar, Mohamed
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 637 - 651
  • [7] Foreground Feature Enhancement for Object Detection
    Jiang, Shenwang
    Xu, Tingfa
    Li, Jianan
    Shen, Ziyi
    Guo, Jie
    IEEE ACCESS, 2019, 7 : 49223 - 49231
  • [8] Weighted Unsupervised Learning for 3D Object Detection
    Kowsari, Kamran
    Alassaf, Manal H.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (01) : 584 - 593
  • [9] Feature enhancement and supervised contrastive learning for image splicing forgery detection
    Xu, Yanzhi
    Zheng, Jiangbin
    Fang, Aiqing
    Irfan, Muhammad
    DIGITAL SIGNAL PROCESSING, 2023, 136
  • [10] FETR: Feature Transformer for vehicle-infrastructure cooperative 3D object detection
    Yan, Wenchao
    Cao, Hua
    Chen, Jiazhong
    Wu, Tao
    NEUROCOMPUTING, 2024, 600