Heterogeneous Feature Fusion Module Based on CNN and Transformer for Multiview Stereo Reconstruction

被引:4
|
作者
Gao, Rui [1 ]
Xu, Jiajia [1 ]
Chen, Yipeng [2 ]
Cho, Kyungeun [1 ]
机构
[1] Dongguk Univ Seoul, Dept Multimedia Engn, 30 Pildongro 1 Gil, Seoul 04620, South Korea
[2] Dongguk Univ Seoul, Dept Autonomous Things Intelligence, 30 Pildongro 1 Gil, Seoul 04620, South Korea
基金
新加坡国家研究基金会;
关键词
multi-view stereo; 3D reconstruction; deep learning; transformer;
D O I
10.3390/math11010112
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
For decades, a vital area of computer vision research has been multiview stereo (MVS), which creates 3D models of a scene using photographs. This study presents an effective MVS network for 3D reconstruction utilizing multiview pictures. Alternative learning-based reconstruction techniques work well, because CNNs (convolutional neural network) can extract only the image's local features; however, they contain many artifacts. Herein, a transformer and CNN are used to extract the global and local features of the image, respectively. Additionally, hierarchical aggregation and heterogeneous interaction modules were used to improve these features. They are based on the transformer and can extract dense features with 3D consistency and global context that are necessary to provide accurate matching for MVS.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A CNN-Based Fusion Method for Feature Extraction from Sentinel Data
    Scarpa, Giuseppe
    Gargiulo, Massimiliano
    Mazza, Antonio
    Gaetano, Raffaele
    REMOTE SENSING, 2018, 10 (02)
  • [42] Uniting Stereo and Depth-from-Defocus: A Thin Lens-based Variational Framework for Multiview Reconstruction
    Friedlander, Robert D.
    Yang, Huizong
    Yezzi, Anthony J.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 4401 - 4410
  • [43] MF-CNN: a New Approach for LDoS Attack Detection Based on Multi-feature Fusion and CNN
    Tang, Dan
    Tang, Liu
    Shi, Wei
    Zhan, Sijia
    Yang, Qiuwei
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (04) : 1705 - 1722
  • [44] EVPTMFF: Bridge Crack Detection Based on Efficient Visual Pyramid Transformer and Multiple-Feature Fusion
    Li, Gang
    Zhou, Pan
    Shen, Dan
    Zhao, Shanmeng
    JOURNAL OF PERFORMANCE OF CONSTRUCTED FACILITIES, 2024, 38 (04)
  • [45] MF-CNN: a New Approach for LDoS Attack Detection Based on Multi-feature Fusion and CNN
    Dan Tang
    Liu Tang
    Wei Shi
    Sijia Zhan
    Qiuwei Yang
    Mobile Networks and Applications, 2021, 26 : 1705 - 1722
  • [46] Single-Image Superresolution for RGB Remote Sensing Imagery via Multiscale CNN-Transformer Feature Fusion
    Yao, Xudong
    Zhang, Haopeng
    Wen, Sizhe
    Shi, Zhenwei
    Jiang, Zhiguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 1302 - 1316
  • [47] Multi-scale feature fusion for pavement crack detection based on Transformer
    Yang, Yalong
    Niu, Zhen
    Su, Liangliang
    Xu, Wenjing
    Wang, Yuanhang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 14920 - 14937
  • [48] Underwater image enhancement algorithm based on dynamic heterogeneous feature fusion
    Han X.-W.
    Zhang Y.-Z.
    Xie Y.-H.
    Wu B.-J.
    Zhao Y.-Y.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (06): : 1560 - 1568
  • [49] MAGIC-TBR: Multiview Attention Fusion for Transformer-based Bodily Behavior Recognition in Group Settings
    Madan, Surbhi
    Jain, Rishabh
    Sharma, Gulshan
    Subramanian, Ramanathan
    Dhall, Abhinav
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9526 - 9530
  • [50] A Transformer-Based Network With Feature Complementary Fusion for Crack Defect Detection
    Ma, Mingyang
    Yang, Lei
    Liu, Yanhong
    Yu, Hongnian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16989 - 17006