Heterogeneous Feature Fusion Module Based on CNN and Transformer for Multiview Stereo Reconstruction

被引:4
|
作者
Gao, Rui [1 ]
Xu, Jiajia [1 ]
Chen, Yipeng [2 ]
Cho, Kyungeun [1 ]
机构
[1] Dongguk Univ Seoul, Dept Multimedia Engn, 30 Pildongro 1 Gil, Seoul 04620, South Korea
[2] Dongguk Univ Seoul, Dept Autonomous Things Intelligence, 30 Pildongro 1 Gil, Seoul 04620, South Korea
基金
新加坡国家研究基金会;
关键词
multi-view stereo; 3D reconstruction; deep learning; transformer;
D O I
10.3390/math11010112
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
For decades, a vital area of computer vision research has been multiview stereo (MVS), which creates 3D models of a scene using photographs. This study presents an effective MVS network for 3D reconstruction utilizing multiview pictures. Alternative learning-based reconstruction techniques work well, because CNNs (convolutional neural network) can extract only the image's local features; however, they contain many artifacts. Herein, a transformer and CNN are used to extract the global and local features of the image, respectively. Additionally, hierarchical aggregation and heterogeneous interaction modules were used to improve these features. They are based on the transformer and can extract dense features with 3D consistency and global context that are necessary to provide accurate matching for MVS.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Super-resolution Reconstruction of Remote Sensing Image Based on Transformer of Multi-scale Feature Fusion
    Wang, Zhi
    Wang, Kun
    Wang, Meng-Qing
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (08): : 1178 - 1184
  • [22] RGB-INFRARED MULTI-MODAL REMOTE SENSING OBJECT DETECTION USING CNN AND TRANSFORMER BASED FEATURE FUSION
    Tian, Tao
    Cai, Jiang
    Xu, Yang
    Wu, Zebin
    Wei, Zhihui
    Chanussot, Jocelyn
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5728 - 5731
  • [23] Remote Sensing Image Change Detection Based on Lightweight Transformer and Multiscale Feature Fusion
    Li, Jingming
    Zheng, Panpan
    Wang, Liejun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 5460 - 5473
  • [24] A Road Crack Segmentation Method Based on Transformer and Multi-Scale Feature Fusion
    Xu, Yang
    Xia, Yonghua
    Zhao, Quai
    Yang, Kaihua
    Li, Qiang
    ELECTRONICS, 2024, 13 (12)
  • [25] CNN-EFF: CNN Based Edge Feature Fusion in Semantic Image Labelling and Parsing
    Vishal Srivastava
    Bhaskar Biswas
    Neural Processing Letters, 2022, 54 : 1753 - 1781
  • [26] Multi-feature decomposition and transformer-fusion: an infrared and visible image fusion network based on multi-feature decomposition and transformer
    Li, Xujun
    Duan, Zhicheng
    Chang, Jia
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [27] CNN-EFF: CNN Based Edge Feature Fusion in Semantic Image Labelling and Parsing
    Srivastava, Vishal
    Biswas, Bhaskar
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1753 - 1781
  • [28] CTAFFNet: CNN-Transformer Adaptive Feature Fusion Object Detection Algorithm for Complex Traffic Scenarios
    Dong, Xinlong
    Shi, Peicheng
    Liang, Taonian
    Yang, Aixi
    TRANSPORTATION RESEARCH RECORD, 2024, : 1947 - 1965
  • [29] Metal Defect Image Recognition Method Based on Shallow CNN Fusion Transformer
    Tang D.
    Yang Z.
    Cheng H.
    Liu M.
    Zhou L.
    Ding C.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2022, 33 (19): : 2298 - 2305and2316
  • [30] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
    Wang, Hongmei
    Li, Lin
    Li, Chenkai
    Lu, Xuanyu
    IEEE ACCESS, 2023, 11 : 78956 - 78969