HEAT: Holistic Edge Attention Transformer for Structured Reconstruction

被引:16
作者
Chen, Jiacheng [1 ]
Qian, Yiming [2 ]
Furukawa, Yasutaka [1 ]
机构
[1] Simon Fraser Univ, Burnaby, BC, Canada
[2] Univ Manitoba, Winnipeg, MB, Canada
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR52688.2022.00384
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel attention-based neural network for structured reconstruction, which takes a 2D raster image as an input and reconstructs a planar graph depicting an underlying geometric structure. The approach detects corners and classifies edge candidates between corners in an end-to-end manner. Our contribution is a holistic edge classification architecture, which 1) initializes the feature of an edge candidate by a trigonometric positional encoding of its end-points; 2) fides image feature to each edge candidate by deformable attention; 3) employs two weight-sharing Transformer decoders to learn holistic structural patterns over the graph edge candidates; and 4) is trained with a masked learning strategy. The corner detector is a variant of the edge classification architecture, adapted to operate on pixels as corner candidates. We conduct experiments on two structured reconstruction tasks: outdoor building architecture and indoor floorplan planar graph reconstruction. Extensive qualitative and quantitative evaluations demonstrate the superiority of our approach over the state of the art. Code and pre-trained models are available at https://heat-structured-reconstruction. github io/
引用
收藏
页码:3856 / 3865
页数:10
相关论文
共 33 条
  • [1] Adan A., 2011, INT C 3D IMAGING MOD, P275, DOI [DOI 10.1109/3DIMPVT.2011.42, 10.1109/3DIMPVT.2011.42]
  • [2] Piecewise Planar and Compact Floorplan Reconstruction from Images
    Cabral, Ricardo
    Furukawa, Yasutaka
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 628 - 635
  • [3] Carion N., 2020, EUROPEAN C COMPUTER
  • [4] Floor-SP: Inverse CAD for Floorplans by Sequential Room-wise Shortest Path
    Chen, Jiacheng
    Liu, Chen
    Wu, Jiaye
    Furukawa, Yasutaka
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2661 - 2670
  • [5] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [6] Furukawa Y, 2009, PROC CVPR IEEE, P1422, DOI 10.1109/CVPRW.2009.5206867
  • [7] Review: reconstruction of 3D building information models from 2D scanned plans
    Gimenez, Lucile
    Hippolyte, Jean-Laurent
    Robert, Sylvain
    Suard, Frederic
    Zreik, Khaldoun
    [J]. JOURNAL OF BUILDING ENGINEERING, 2015, 2 (24-35) : 24 - 35
  • [8] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Recovering the Spatial Layout of Cluttered Rooms
    Hedau, Varsha
    Hoiem, Derek
    Forsyth, David
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1849 - 1856