Effectiveness of Image Augmentation Techniques on Detection of Building Characteristics from Street View Images Using Deep Learning

被引：16

作者：

Han, Jongwon ^{[1
]}

Kim, Jaejun ^{[1
]}

Kim, Seongkyung ^{[1
]}

Wang, Seunghyeon ^{[1
]}

机构：

[1] Hanyang Univ, Dept Architectural Engn, Seoul 133791, South Korea

来源：

JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT | 2024年 / 150卷 / 10期

关键词：

Building characteristics; Urban analysis; Number of stories; Building typologies; Street view images; Image augmentation; Image processing; Deep learning; DAMAGE DETECTION;

D O I：

10.1061/JCEMD4.COENG-15075

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Two key building characteristics, namely the number of stories and typology, is vital across various domains such as construction management and architectural design. These aspects are particularly critical for disaster risk assessment and infrastructure planning. Although deep learning models are adept at extracting this information from Street view images (SVIs), their success is contingent upon the availability of large and diverse data sets with high accuracy. Image augmentation presents an alternative method to artificially broaden data set diversity. However, the impact of image augmentation techniques on identifying building stories and typologies from SVIs has not been adequately explored. This study proposes a methodology employing eight distinct image augmentation techniques-brightness, contrast, perspective, rotation, scale, shearing, and translation augmentations-as well as a combined approach using all these methods. The study evaluates the efficacy of models trained with these techniques by comparing the accuracy of different classes and architectures for each task, both with and without the application of augmentation. The findings revealed that while most augmentation methods enhance model accuracy, their effectiveness is task-dependent. Furthermore, it was observed that the most effective augmentation techniques differ among building classes and architectures within each task. This suggests that augmentation strategies need to be custom-designed to align with the unique features of each class and architectures for precise estimation of the number of stories and building typologies. While the focus of this research is on specific tasks, the evaluated augmentation techniques could also extend to related areas, such as ascertaining the age of buildings or identifying window types. In this study, the efficacy of augmentation techniques is explored within the framework of identifying the number of stories and building typologies. The models were assessed for average accuracy and class-specific accuracy across various architectures, comparing outcomes with and without the implementation of the proposed augmentation methods. A key finding is that the most effective augmentation method varies between architectures and individual classes. Contrary to common practice in deep learning, where applying multiple augmentation techniques is standard for accuracy enhancement, this study observed that such a strategy did not uniformly improve performance. Specifically, while combining augmentation methods generally resulted in higher average accuracy, this was not the case for some classes within MobileNetV3 when detecting the number of stories. Similarly, for ResNet-152, employing all augmentation techniques together led to the lowest accuracy in certain classes for building typology classification. These results indicate that augmentation strategies may require customization to cater to the distinct characteristics of each class and architecture for accurate estimation of number of stories and building typologies.

引用

页数：18

共 50 条

[31] Image Forgery Detection Using Deep Learning by Recompressing Images [J].

Ali, Syed Sadaf ;

Ganapathi, Iyyakutti Iyappan ;

Ngoc-Son Vu ;

Ali, Syed Danish ;

Saxena, Neetesh ;

Werghi, Naoufel .

ELECTRONICS, 2022, 11 (03)

[32] Estimation of building height using a single street view image via deep neural networks [J].

Yan, Yizhen ;

Huang, Bo .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 192 :83-98

[33] A survey of deep learning techniques for weed detection from images [J].

Hasan, A. S. M. Mahmudul ;

Sohel, Ferdous ;

Diepeveen, Dean ;

Laga, Hamid ;

Jones, Michael G. K. .

COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 184 (184)

[34] Using Deep Learning to Identify Utility Poles with Crossarms and Estimate Their Locations from Google Street View Images [J].

Zhang, Weixing ;

Witharana, Chandi ;

Li, Weidong ;

Zhang, Chuanrong ;

Li, Xiaojiang ;

Parent, Jason .

SENSORS, 2018, 18 (08)

[35] Building façade datasets for analyzing building characteristics using deep learning [J].

Wang, Seunghyeon ;

Park, Sangkyun ;

Park, Sungman ;

Kim, Jaejun .

DATA IN BRIEF, 2024, 57

[36] Lung Cancer Detection in CT Images Using Deep Learning Techniques: A Survey Review [J].

Usharani C. ;

Revathi B. ;

Selvapandian A. ;

Kezial Elizabeth S.K. .

EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10

[37] Building Extraction in High Spatial Resolution Images Using Deep Learning Techniques [J].

Shetty, Ashvitha R. ;

Mohan, B. Krishna .

COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2018, PT III, 2018, 10962 :327-338

[38] Image data augmentation techniques based on deep learning: A survey [J].

Zeng W. .

Mathematical Biosciences and Engineering, 2024, 21 (06) :6190-6224

[39] Driver Distraction Detection based on Deep Learning Techniques using Images [J].

Mohanapriya, S. ;

Saranya, Mohana S. ;

Dinesh, K. ;

Sivasankar, B. ;

Vignesh, R. G. ;

Kumar, Vishnu K. .

2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, :475-480

[40] Integrating restorative perception into urban street planning: A framework using street view images, deep learning, and space syntax [J].

Wu, Yunfei ;

Liu, Qiqi ;

Hang, Tian ;

Yang, Yihong ;

Wang, Yijun ;

Cao, Lei .

CITIES, 2024, 147

← 1 2 3 4 5 →