A convolutional neural network approach to classifying urban spaces using generative tools for data augmentation

被引:2
作者
Medel-Vera, Carlos [1 ,5 ]
Vidal-Estevez, Pelayo [2 ]
Madler, Thomas [3 ,4 ]
机构
[1] Univ Liverpool, Sch Architecture, Liverpool, England
[2] Univ Diego Portales, Fac Engn & Sci, Sch Civil Engn, Santiago, Chile
[3] Univ Diego Portales, Fac Engn & Sci, Sch Civil Engn, Santiago, Chile
[4] Univ Diego Portales, Inst Astrophys Studies, Fac Engn & Sci, Santiago, Chile
[5] Univ Liverpool, Sch Architecture, 25 Abercromby Sq, Liverpool L697ZG, England
关键词
Urban categories; classification; machine learning; deep learning; diffusion models; neural architecture; CONTEMPORARY PUBLIC SPACE; RESIDUAL NETWORK; CLASSIFICATION; INCEPTION; RESNET;
D O I
10.1177/14780771231225697
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This article discusses an application for classifying urban spaces using convolutional neural networks (CNNs). A seed dataset was initially generated composed of 630 photographs of urban spaces from the Adobe Stock repository. This dataset was topped up with images produced by two generative artificial intelligence (AI) engines, namely, Deep Dream Generator and Midjourney, making two additional augmented datasets, each composed of 2200 images. The training process was carried out using four well-known CNNs, namely, GoogLeNet, ResNet-18, ShuffleNet, and MobileNet-v2. The results show an increase of roughly 30% in the predicting capabilities in both augmented datasets when compared to the seed dataset. Furthermore, performance metrics are generally higher when using ResNet-18 which may suggest that this CNN architecture is more applicable to urban classification projects. Finally, although both generative AI engines have similar performance, Midjourney seems to slightly outperform Deep Dream Generator as a data augmentation engine for urban spaces.
引用
收藏
页码:392 / 411
页数:20
相关论文
共 71 条
[1]   Leveraging ShuffleNet transfer learning to enhance handwritten character recognition [J].
Abu Al-Haija, Qasem .
GENE EXPRESSION PATTERNS, 2022, 45
[2]  
Adobe stock, About us
[3]  
Aggarwal A., 2021, INT J INF MANAG DATA, V1
[4]  
Ali A., 2013, Int. J. Adv. Soft Comput. Appl., V5, P176
[5]   Classification of Urban Spaces: An Attempt to Classify Al-Baha City Urban Spaces Using Carmona's Classification [J].
Alzahrani, Abdulaziz .
SAGE OPEN, 2022, 12 (02)
[6]  
[Anonymous], ARXIV181010863
[7]   A systematic study of the class imbalance problem in convolutional neural networks [J].
Buda, Mateusz ;
Maki, Atsuto ;
Mazurowski, Maciej A. .
NEURAL NETWORKS, 2018, 106 :249-259
[8]  
Carmona M.Wunderlich., 2012, Capital Spaces: The Multiple Complex Public Spaces of a Global City
[9]   Contemporary Public Space, Part Two: Classification [J].
Carmona, Matthew .
JOURNAL OF URBAN DESIGN, 2010, 15 (02) :157-173
[10]   Contemporary Public Space: Critique and Classification, Part One: Critique [J].
Carmona, Matthew .
JOURNAL OF URBAN DESIGN, 2010, 15 (01) :123-148