Pre-clustering active learning method for automatic classification of building structures in urban areas

被引:1
作者
Zhou P. [1 ]
Zhang T. [2 ]
Zhao L. [3 ]
Qi Y. [1 ]
Chang Y. [1 ]
Bai L. [4 ,5 ]
机构
[1] School of Management Science and Engineering, Central University of Finance and Economics, Beijing
[2] Postal Savings Bank of China Co., Ltd., Beijing
[3] Industry Internet Operation Center, China United Network Communications Corporation Beijing Branch, Beijing
[4] School of Artificial Intelligence, Beijing Normal Universit, Beijing
[5] School of Information, Central University of Finance and Economics, Beijing
基金
中国国家自然科学基金;
关键词
Active learning; Automated classification; Machine learning; Pre-clustering; Urban building structures;
D O I
10.1016/j.engappai.2023.106382
中图分类号
学科分类号
摘要
Identifying the structures of buildings in urban areas is a prerequisite for robust urban planning and regeneration. Owing to the diverse structural designs of urban buildings, automated approaches are required to classify building structures. Supervised machine learning is usually employed to classify various building characteristics. However, this approach requires significant labeling effort. Therefore, this paper proposes a new pre-clustering active learning method for building structure classification. The proposed method captures the statistical characteristics of samples and enhances the recognition of the most valuable training samples, thereby substantially reducing the labeling workload and improving the efficiency and effectiveness of classification. This method was tested via the classification of 3718 buildings in Beijing, China, into five common structures. The results showed that the proposed method could reduce labeling effort by 60% while achieving a promising 90% F1 score for overall classification performance, thus indicating its effectiveness. © 2023 Elsevier Ltd
引用
收藏
相关论文
共 64 条
  • [41] Ministry of Housing and Urban-Rural Development of the People's Republic of China, General code for seismic precaution of buildings and municipal engineering GB 55002-2021, (2021)
  • [42] Mullner D., Modern hierarchical, agglomerative clustering algorithms, (2011)
  • [43] Ng A.Y., Jordan M.I., Weiss Y., On spectral clustering: analysis and an algorithm, Proceedings of the Advances in Neural Information Processing Systems, pp. 849-856, (2001)
  • [44] Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dubourg V., Vanderplas J., Passos A., Cournapeau D., Brucher M., Perrot M., Duchesnay E., Scikit-learn: machine learning in python, J. Mach. Learn. Res., 12, 85, pp. 2825-2830, (2011)
  • [45] Pomberger A., McCarthy A.P., Khan A., Sung S., Taylor C.J., Gaunt M.J., Colwell L., Walz D., Lapkin A.A., The effect of chemical representation on active machine learning towards closed-loop optimization, React. Chem. Eng., 7, 6, pp. 1368-1379, (2022)
  • [46] Ramirez-Loaiza M.E., Sharma M., Kumar G., Bilgic M., Active learning: an empirical study of common baselines, Data Min. Knowl. Discov., 31, 2, pp. 287-313, (2017)
  • [47] Ren P.Z., Xiao Y., Chang X.J., Huang P.Y., Li Z.H., Gupta B.B., Chen X.J., Wang X., A survey of deep active learning, ACM Comput. Surv., 54, 9, pp. 1-40, (2021)
  • [48] Rosser J.F., Boyd D.S., Long G., Zakhary S., Mao Y., Robinson D., Predicting residential building age from map data, Comput. Environ. Urban Syst., 73, pp. 163-222, (2019)
  • [49] Schubert E., Sander J., Ester M., Kriegel H.P., Xu X.W., DBSCAN revisited, revisited: why and how you should (still) use DBSCAN, ACM Trans. Database Syst., 42, 3, pp. 1-21, (2017)
  • [50] Sebastiani F., Machine learning in automated text categorization, ACM Comput. Surv., 34, 1, pp. 1-47, (2002)