Bilateral transformer 3D planar recovery

被引:0
|
作者
Ren, Fei [2 ]
Liao, Chunhua [1 ]
Xie, Zhina [1 ]
机构
[1] Jiangmen Cent Hosp, Jiangmen 550025, Guangdong, Peoples R China
[2] Chinasoft Int Ltd, Shenzhen 518129, Peoples R China
关键词
Deep learning; 3D planar recovery; Planar segmentation; Bilateral networks;
D O I
10.1016/j.gmod.2024.101221
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, deep learning based methods for single image 3D planar recovery have made significant progress, but most of the research has focused on overall plane segmentation performance rather than the accuracy of small scale plane segmentation. In order to solve the problem of feature loss in the feature extraction process of small target object features, a three dimensional planar recovery method based on bilateral transformer was proposed. The two sided network branches capture rich small object target features through different scale sampling, and are used for detecting planar and non-planar regions respectively. In addition, the loss of variational information is used to share the parameters of the bilateral network, which achieves the output consistency of the bilateral network and alleviates the problem of feature loss of small target objects. The method is verified on Scannet and Nyu V2 datasets, and a variety of evaluation indexes are superior to the current popular algorithms, proving the effectiveness of the method in three dimensional planar recovery.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Transformer-based 3D Human pose estimation and action achievement evaluation
    Yang, Aolei
    Zhou, Yinghong
    Yang, Banghua
    Xu, Yulin
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (04): : 136 - 144
  • [22] A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images
    Liu, Chiyu
    Sun, Cunjie
    IEEE ACCESS, 2024, 12 : 93389 - 93397
  • [23] A Hybrid Transformer-LSTM Model With 3D Separable Convolution for Video Prediction
    Mathai, Mareeta
    Liu, Ying
    Ling, Nam
    IEEE ACCESS, 2024, 12 : 39589 - 39602
  • [24] Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection
    Kim, Taeho
    Kim, Joohee
    SENSORS, 2023, 23 (16)
  • [25] 3D ResNets for 3D Object Classification
    Ioannidou, Anastasia
    Chatzilari, Elisavet
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 495 - 506
  • [26] Pavement Point Cloud Upsampling Based on Transformer: Toward Enhancing 3D Pavement Data
    Bu, Tianxiang
    Zhu, Junqing
    Ma, Tao
    Jiang, Shun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 21647 - 21657
  • [27] Automatic Vertebral Rotation Angle Measurement of 3D Vertebrae Based on an Improved Transformer Network
    Huo, Xing
    Li, Hao
    Shao, Kun
    ENTROPY, 2024, 26 (02)
  • [28] Deep learning for 3D human pose estimation and mesh recovery: A survey
    Liu, Yang
    Qiu, Changzhen
    Zhang, Zhiyong
    NEUROCOMPUTING, 2024, 596
  • [29] Group-in-Group Relation-Based Transformer for 3D Point Cloud Learning
    Liu, Shaolei
    Fu, Kexue
    Wang, Manning
    Song, Zhijian
    REMOTE SENSING, 2022, 14 (07)
  • [30] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
    Yu, Bin
    Zhou, Quan
    Zhang, Xuming
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387