Bilateral transformer 3D planar recovery

被引：0

作者：

Ren, Fei ^{[2
]}

Liao, Chunhua ^{[1
]}

Xie, Zhina ^{[1
]}

机构：

[1] Jiangmen Cent Hosp, Jiangmen 550025, Guangdong, Peoples R China

[2] Chinasoft Int Ltd, Shenzhen 518129, Peoples R China

来源：

GRAPHICAL MODELS | 2024年 / 134卷

关键词：

Deep learning; 3D planar recovery; Planar segmentation; Bilateral networks;

D O I：

10.1016/j.gmod.2024.101221

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In recent years, deep learning based methods for single image 3D planar recovery have made significant progress, but most of the research has focused on overall plane segmentation performance rather than the accuracy of small scale plane segmentation. In order to solve the problem of feature loss in the feature extraction process of small target object features, a three dimensional planar recovery method based on bilateral transformer was proposed. The two sided network branches capture rich small object target features through different scale sampling, and are used for detecting planar and non-planar regions respectively. In addition, the loss of variational information is used to share the parameters of the bilateral network, which achieves the output consistency of the bilateral network and alleviates the problem of feature loss of small target objects. The method is verified on Scannet and Nyu V2 datasets, and a variety of evaluation indexes are superior to the current popular algorithms, proving the effectiveness of the method in three dimensional planar recovery.

引用

页数：9

共 50 条

[21] Transformer-based 3D Human pose estimation and action achievement evaluation
Yang, Aolei
Zhou, Yinghong
Yang, Banghua
Xu, Yulin
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (04): : 136 - 144
[22] A Fusion Deep Learning Model of ResNet and Vision Transformer for 3D CT Images
Liu, Chiyu
Sun, Cunjie
IEEE ACCESS, 2024, 12 : 93389 - 93397
[23] A Hybrid Transformer-LSTM Model With 3D Separable Convolution for Video Prediction
Mathai, Mareeta
Liu, Ying
Ling, Nam
IEEE ACCESS, 2024, 12 : 39589 - 39602
[24] Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection
Kim, Taeho
Kim, Joohee
SENSORS, 2023, 23 (16)
[25] 3D ResNets for 3D Object Classification
Ioannidou, Anastasia
Chatzilari, Elisavet
Nikolopoulos, Spiros
Kompatsiaris, Ioannis
MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 495 - 506
[26] Pavement Point Cloud Upsampling Based on Transformer: Toward Enhancing 3D Pavement Data
Bu, Tianxiang
Zhu, Junqing
Ma, Tao
Jiang, Shun
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 21647 - 21657
[27] Automatic Vertebral Rotation Angle Measurement of 3D Vertebrae Based on an Improved Transformer Network
Huo, Xing
Li, Hao
Shao, Kun
ENTROPY, 2024, 26 (02)
[28] Deep learning for 3D human pose estimation and mesh recovery: A survey
Liu, Yang
Qiu, Changzhen
Zhang, Zhiyong
NEUROCOMPUTING, 2024, 596
[29] Group-in-Group Relation-Based Transformer for 3D Point Cloud Learning
Liu, Shaolei
Fu, Kexue
Wang, Manning
Song, Zhijian
REMOTE SENSING, 2022, 14 (07)
[30] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
Yu, Bin
Zhou, Quan
Zhang, Xuming
ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387

← 1 2 3 4 5 →