EVALUATING CONVNET AND TRANSFORMER BASED SELF-SUPERVISED ALGORITHMS FOR BUILDING ROOF FORM CLASSIFICATION

被引：0

作者：

Mutreja, G. ^{[1
]}

Bittner, K. ^{[1
]}

机构：

[1] German Aerosp Ctr DLR, Remote Sensing Technol Inst, Wessling, Germany

来源：

GEOSPATIAL WEEK 2023, VOL. 48-1 | 2023年

关键词：

Roof-form classification; Self-supervised learning; SimCLR; MoCo; ConvNets; Vision transformers; BYOL; BEiT;

D O I：

10.5194/isprs-archives-XLVIII-1-W2-2023-315-2023

中图分类号：

K85 [文物考古];

学科分类号：

0601 ;

摘要：

This research paper presents a comprehensive evaluation of various self-supervised learning models for building roof type classification. We conduct linear evaluation experiments for the models pretrained on both the ImageNet1K dataset and a custom building roof type dataset to assess the models' performance for the roof type classification task. The results demonstrate the effectiveness of the ViT-based BEiTV2 model, which outperforms other models on both datasets, achieving an accuracy of 96.8% from the model pretrained on ImageNet1K dataset and 92.67% on the model pretrained on building roof type dataset. The class activation maps further validate the strong performance of MoCoV3, BarlowTwins, and DenseCL models. These findings emphasize the potential of self-supervised learning for accurate building roof type classification, with the ViT-based BEiTV2 model showcasing state-of-the-art results.

引用

页码：315 / 321

页数：7

共 26 条

[11]

Gidaris Spyros, 2018, ICLR

[12]

Grill J.B., 2020, P 34 INT C NEUR INF, V33, P21271

[13] Masked Autoencoders Are Scalable Vision Learners [J].

He, Kaiming ;

Chen, Xinlei ;

Xie, Saining ;

Li, Yanghao ;

Dollar, Piotr ;

Girshick, Ross .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :15979-15988

[14] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[15]

He Kaiming, 2019, ARXIV

[16]

Hensel S., 2021, ISPRS-International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, V46W4, P85

[17] Momentum Contrast for Unsupervised Visual Representation Learning [J].

He, Kaiming ;

Fan, Haoqi ;

Wu, Yuxin ;

Xie, Saining ;

Girshick, Ross .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735

[18] Colorization as a Proxy Task for Visual Understanding [J].

Larsson, Gustav ;

Maire, Michael ;

Shakhnarovich, Gregory .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :840-849

[19]

Pathak D., 2016, PROC CVPR IEEE

[20]

Peng Z., 2022, BEiT v2: masked image modeling with vector-quantized visual tokenizers

← 1 2 3 →