Bilateral transformer 3D planar recovery

被引:0
|
作者
Ren, Fei [2 ]
Liao, Chunhua [1 ]
Xie, Zhina [1 ]
机构
[1] Jiangmen Cent Hosp, Jiangmen 550025, Guangdong, Peoples R China
[2] Chinasoft Int Ltd, Shenzhen 518129, Peoples R China
关键词
Deep learning; 3D planar recovery; Planar segmentation; Bilateral networks;
D O I
10.1016/j.gmod.2024.101221
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, deep learning based methods for single image 3D planar recovery have made significant progress, but most of the research has focused on overall plane segmentation performance rather than the accuracy of small scale plane segmentation. In order to solve the problem of feature loss in the feature extraction process of small target object features, a three dimensional planar recovery method based on bilateral transformer was proposed. The two sided network branches capture rich small object target features through different scale sampling, and are used for detecting planar and non-planar regions respectively. In addition, the loss of variational information is used to share the parameters of the bilateral network, which achieves the output consistency of the bilateral network and alleviates the problem of feature loss of small target objects. The method is verified on Scannet and Nyu V2 datasets, and a variety of evaluation indexes are superior to the current popular algorithms, proving the effectiveness of the method in three dimensional planar recovery.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Multi-scale Transformer 3D Plane Recovery
    Ren, Fei
    Chang, Qingling
    Liu, Xinglin
    Cui, Yan
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [2] 3D Medical Axial Transformer: A Lightweight Transformer Model for 3D Brain Tumor Segmentation
    Liu, Cheng
    Kiryu, Hisanori
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 799 - 813
  • [3] Position Encoding for 3D Lane Detection via Perspective Transformer
    Zhang, Meng Li
    Wang, Ming Wei
    Deng, Yan Yang
    Lei, Xin Yu
    IEEE ACCESS, 2024, 12 : 106480 - 106487
  • [4] FATUnetr:fully attention Transformer for 3D medical image segmentation
    Li, QingFeng
    Tong, Jigang
    Yang, Sen
    Du, Shengzhi
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 1415 - 1419
  • [5] 3D point cloud object detection algorithm based on Transformer
    Liu M.
    Yang Q.
    Hu G.
    Guo Y.
    Zhang J.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2023, 41 (06): : 1190 - 1197
  • [6] Applying Attention Transformer Module to 3D Lip Sequence Identification
    Pian, Xinyang
    Wang, Yu
    Zhang, Jie
    Computer Engineering and Applications, 2024, 60 (07) : 141 - 146
  • [7] Graph Transformer for 3D point clouds classification and semantic segmentation
    Zhou, Wei
    Wang, Qian
    Jin, Weiwei
    Shi, Xinzhe
    He, Ying
    COMPUTERS & GRAPHICS-UK, 2024, 124
  • [8] AIFormer: Adaptive Interaction Transformer for 3D Point Cloud Understanding
    Chu, Xutao
    Zhao, Shengjie
    Dai, Hongwei
    REMOTE SENSING, 2024, 16 (21)
  • [9] Automatic Scan Registration Using 3D Linear and Planar Features
    Yao, Jian
    Ruggeri, Mauro R.
    Taddei, Pierluigi
    Sequeira, Vitor
    3D RESEARCH, 2010, 1 (03) : 1 - 18
  • [10] PLANAR SEGMENTATION BASED ON PLANE FIT AND QUATERNION FOR 3D REGISTRATION
    Wang, Hongke
    2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 2, 2012, : 341 - 345