Semi-supervised learning approach for construction object detection by integrating super-resolution and mean teacher network

被引:0
作者
Zhang, Wen-Jie [1 ]
Wan, Hua-Ping [1 ]
Hu, Peng-Hua [1 ]
Ge, Hui-Bin [1 ]
Luo, Yaozhi [1 ]
Todd, Michael D. [2 ]
机构
[1] Zhejiang Univ, Coll Civil Engn & Architecture, Hangzhou 310058, Peoples R China
[2] Univ Calif San Diego, Dept Struct Engn, 9500 Gilman Dr 0085, La Jolla, CA 92093 USA
来源
JOURNAL OF INFRASTRUCTURE INTELLIGENCE AND RESILIENCE | 2024年 / 3卷 / 04期
关键词
Construction object detection; Deep learning; Mean teacher network; Super-resolution; Semi-supervised learning; CHALLENGES;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep learning-based object detection methods are utilized for safety management at construction sites, which require large-scale, high-quality, and well-labeled datasets for training. The existing construction datasets are relatively small due to the high expense of labor-intensive annotation, and the varying quality of the construction images also affects the detection performance of the model. To address the limitations of datasets, this study proposes a new method for construction object detection by integrating super-resolution and semi-supervised learning. The proposed method improves the quality of construction images and achieves excellent detection performance with limited labeled data. First, the Real-ESRGAN model is introduced to improve the quality of construction images and make the construction objects visible. The proposed super-resolution method can enhance the texture details of low-resolution images, hence improving the performance of object detection models. Second, the mean-teacher network is adopted to expand the training set, thus avoiding the laborintensive annotation work. To verify the effectiveness of the proposed method, the method is applied to the state-of-the-art Yolov5 object detection model, and construction images from the Site Object Detection Dataset (SODA) with different labeled data proportions (from 10% to 50% in 10% intervals with an extreme case of 5%) are used as the training set. By comparing with the existing supervised learning method, it is shown that the proposed method can achieve better detection performance. In particular, the method is more effective in enhancing detection performance when the proportion of the labeled data is smaller, which is of great practical value in real-world engineering. The experimental results show the potential of the proposed method in improving image quality and reducing the expense of developing construction datasets.
引用
收藏
页数:12
相关论文
共 41 条
[1]   Reference synthetic-dataset for novelty detection in oil production data: A perceptive evaluation along with case studies from 51 oilfields [J].
Abdelaziem, Osama Elsayed ;
Gawish, Ahmed Ahmed ;
Farrag, Sayed Fadel .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
[2]  
Arayici Y, 2012, J INF TECHNOL CONSTR, V17, P75
[3]   Using Context-Guided data Augmentation, lightweight CNN, and proximity detection techniques to improve site safety monitoring under occlusion conditions [J].
Chen, Haosen ;
Hou, Lei ;
Zhang, Guomin ;
Wu, Shaoze .
SAFETY SCIENCE, 2023, 158
[4]   Dynamic identification of crane load fall zone: A computer vision approach [J].
Chian, Eugene Yan Tao ;
Goh, Yang Miang ;
Tian, Jing ;
Guo, Brian H. W. .
SAFETY SCIENCE, 2022, 156
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]   A lightweight vehicles detection network model based on YOLOv5 [J].
Dong, Xudong ;
Yan, Shuai ;
Duan, Chaoqun .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 113
[7]   SODA: A large-scale open site object detection dataset for deep learning in construction [J].
Duan, Rui ;
Deng, Hui ;
Tian, Mao ;
Deng, Yichuan ;
Lin, Jiarui .
AUTOMATION IN CONSTRUCTION, 2022, 142
[8]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[9]   Balanced semisupervised generative adversarial network for damage assessment from low-data imbalanced-class regime [J].
Gao, Yuqing ;
Zhai, Pengyuan ;
Mosalam, Khalid M. .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2021, 36 (09) :1094-1113
[10]  
Guanhao Yang, 2020, 2020 IEEE 6th International Conference on Computer and Communications (ICCC), P1398, DOI 10.1109/ICCC51575.2020.9345042