Classification of the qilou (arcade building) using a robust image processing framework based on the Faster R-CNN with ResNet50

被引:4
作者
Li, Ming Ho [1 ]
Yu, Yi [2 ]
Wei, Hongni [3 ]
Chan, Ting On [1 ,4 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Geog & Planning, Guangzhou, Peoples R China
[2] East China Normal Univ, Sch Urban & Reg Sci, Shanghai, Peoples R China
[3] Guangdong Univ Foreign Studies, Sch Business, Guangzhou, Peoples R China
[4] Sun Yat Sen Univ, Sch Geog & Planning, Guangdong Prov Key Lab Urbanizat & Geosimulat, Guangzhou, Peoples R China
[5] Sun Yat Sen Univ, Sch Geog & Planning, 135 Xigangxi Rd, Guangzhou, Peoples R China
关键词
Qilou; object detection; Faster R-CNN; CLAHE; classification; CLOSE-RANGE PHOTOGRAMMETRY; STYLE CLASSIFICATION; DOCUMENTATION; TERRESTRIAL;
D O I
10.1080/13467581.2023.2238038
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Qilou (arcade building) is a particular type of Chinese historical architecture combined with western and eastern building elements, which plays a significant role in the history of modern Chinese architecture. However, the recognition and classification of the qilou mainly rely on manual inspection, suppressing the cultural dissemination and protection of qilou relics. In this paper, we present a new framework that adopts multiple image processing algorithms and a deep learning network to automate qilou classification. First, image dataset of the qilou is enhanced based on the Contrast Limited Adaptive Histogram Equalization (CLAHE) algorithm. Then, an improved Faster R-CNN with ResNet50 (Faster R-CNN-R) is deployed for qilou image recognition. A total of 760 images captured in Guangzhou were used for training, validation, and accuracy check of the proposed framework and several contrastive networks under the same conditions. Compared to other networks, the proposed framework works better than Faster R-CNN with VGG16 (Faster R-CNN-V) and FCOS. The accuracy of the proposed framework embedded with the Faster R-CNN-R, Faster R-CNN-V, and FCOS are 80.12%, 65.17%, and 66.35%, respectively. Based on digital images captured under different lighting conditions, the proposed framework can be used to classify nine different types of qilous, with high robustness.
引用
收藏
页码:595 / 612
页数:18
相关论文
共 62 条
[1]  
Abed M.H., 2020, Architectural heritage images classification using deep learning with CNN
[2]   Attention-based generative adversarial network with internal damage segmentation using thermography [J].
Ali, Rahmat ;
Cha, Young-Jin .
AUTOMATION IN CONSTRUCTION, 2022, 141
[3]   Real-time multiple damage mapping using autonomous UAV and deep faster region-based neural networks for GPS-denied structures [J].
Ali, Rahmat ;
Kang, Dongho ;
Suh, Gahyun ;
Cha, Young-Jin .
AUTOMATION IN CONSTRUCTION, 2021, 130
[4]  
Anderson Christy., 2013, Renaissance Architecture
[5]  
Babinovich N. U., 2021, Journal of Physics: Conference Series, V1926, DOI 10.1088/1742-6596/1926/1/012042
[6]   Deep learning-based automatic volumetric damage quantification using depth camera [J].
Beckman, Gustavo H. ;
Polyzois, Dimos ;
Cha, Young-Jin .
AUTOMATION IN CONSTRUCTION, 2019, 99 :114-124
[7]  
Castleman K. R., 1996, Digital image processing
[8]   Autonomous Structural Visual Inspection Using Region-Based Deep Learning for Detecting Multiple Damage Types [J].
Cha, Young-Jin ;
Choi, Wooram ;
Suh, Gahyun ;
Mahmoudkhani, Sadegh ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2018, 33 (09) :731-747
[9]   Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks [J].
Cha, Young-Jin ;
Choi, Wooram ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2017, 32 (05) :361-378
[10]   Haikou QiLou historical and artistic value [J].
Chen, Jing ;
Wang, Fang .
SUSTAINABLE DEVELOPMENT AND ENVIRONMENT II, PTS 1 AND 2, 2013, 409-410 :404-+