EVALUATING CONVNET AND TRANSFORMER BASED SELF-SUPERVISED ALGORITHMS FOR BUILDING ROOF FORM CLASSIFICATION

被引:0
作者
Mutreja, G. [1 ]
Bittner, K. [1 ]
机构
[1] German Aerosp Ctr DLR, Remote Sensing Technol Inst, Wessling, Germany
来源
GEOSPATIAL WEEK 2023, VOL. 48-1 | 2023年
关键词
Roof-form classification; Self-supervised learning; SimCLR; MoCo; ConvNets; Vision transformers; BYOL; BEiT;
D O I
10.5194/isprs-archives-XLVIII-1-W2-2023-315-2023
中图分类号
K85 [文物考古];
学科分类号
0601 ;
摘要
This research paper presents a comprehensive evaluation of various self-supervised learning models for building roof type classification. We conduct linear evaluation experiments for the models pretrained on both the ImageNet1K dataset and a custom building roof type dataset to assess the models' performance for the roof type classification task. The results demonstrate the effectiveness of the ViT-based BEiTV2 model, which outperforms other models on both datasets, achieving an accuracy of 96.8% from the model pretrained on ImageNet1K dataset and 92.67% on the model pretrained on building roof type dataset. The class activation maps further validate the strong performance of MoCoV3, BarlowTwins, and DenseCL models. These findings emphasize the potential of self-supervised learning for accurate building roof type classification, with the ViT-based BEiTV2 model showcasing state-of-the-art results.
引用
收藏
页码:315 / 321
页数:7
相关论文
共 26 条
[11]  
Gidaris Spyros, 2018, ICLR
[12]  
Grill J.B., 2020, P 34 INT C NEUR INF, V33, P21271
[13]   Masked Autoencoders Are Scalable Vision Learners [J].
He, Kaiming ;
Chen, Xinlei ;
Xie, Saining ;
Li, Yanghao ;
Dollar, Piotr ;
Girshick, Ross .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :15979-15988
[14]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[15]  
He Kaiming, 2019, ARXIV
[16]  
Hensel S., 2021, ISPRS-International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, V46W4, P85
[17]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735
[18]   Colorization as a Proxy Task for Visual Understanding [J].
Larsson, Gustav ;
Maire, Michael ;
Shakhnarovich, Gregory .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :840-849
[19]  
Pathak D., 2016, PROC CVPR IEEE
[20]  
Peng Z., 2022, BEiT v2: masked image modeling with vector-quantized visual tokenizers