Deep Convolutional Networks for Construction Object Detection Under Different Visual Conditions

被引:41
作者
Nath, Nipun D. [1 ]
Behzadan, Amir H. [2 ]
机构
[1] Texas A&M Univ, Zachry Dept Civil Engn, College Stn, TX USA
[2] Texas A&M Univ, Dept Construct Sci, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
visual recognition; deep learning; object detection; computer vision; content retrieval; NEURAL-NETWORKS; RECOGNITION; RESOURCES; CLASSIFICATION; RECONSTRUCTION; ANNOTATION; PROGRESS; KINECT; MODEL;
D O I
10.3389/fbuil.2020.00097
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Sensing and reality capture devices are widely used in construction sites. Among different technologies, vision-based sensors are by far the most common and ubiquitous. A large volume of images and videos is collected from construction projects every day to track work progress, measure productivity, litigate claims, and monitor safety compliance. Manual interpretation of such colossal amounts of data, however, is non-trivial, error-prone, and resource-intensive. This has motivated new research on soft computing methods that utilize high-power data processing, computer vision, and deep learning (DL) in the form of convolutional neural networks (CNNs). A fundamental step toward machine-driven interpretation of construction site scenery is to accurately identify objects of interest for a particular problem. The accuracy requirement, however, may offset the computational speed of the candidate method. While lightweight DL algorithms (e.g., Mask R-CNN) can perform visual recognition with relatively high accuracy, they suffer from low processing efficacy, which hinders their use in real-time decision-making. One of the most promising DL algorithms that balance speed and accuracy is YOLO (you-only-look-once). This paper investigates YOLO-based CNN models in fast detection of construction objects. First, a large-scale image dataset, named Pictor-v2, is created, which contains about 3,500 images and approximately 11,500 instances of common construction site objects (e.g., building, equipment, worker). To assess the agility of object detection, transfer learning is used to train two variations of this model, namely, YOLO-v2 and YOLO-v3, and test them on different data combinations (crowdsourced, web-mined, or both). Results indicate that performance is higher if the model is trained on both crowdsourced and web-mined images. Additionally, YOLO-v3 outperforms YOLO-v2 by focusing on smaller, harder-to-detect objects. The best-performing YOLO-v3 model has a 78.2% mAP when tested on crowdsourced data. Sensitivity analysis of the output shows that the model's strong suit is in detecting larger objects in less crowded and well-lit spaces. The proposed methodology can also be extended to predict the relative distance of the detected objects with reliable accuracy. Findings of this work lay the foundation for further research on technology-assistive systems to augment human capacities in quickly and reliably interpreting visual data in complex environments.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A New Method Based on Deep Convolutional Neural Networks for Object Detection and Classification
    Yan Liu
    Zhu Zhuxngjie
    Zhang, Qiuhui
    Ding, Xiaotian
    Wang, Ruonan
    Han, Senyao
    Chi Li
    AATCC JOURNAL OF RESEARCH, 2021, 8 : 37 - 45
  • [2] A New Method Based on Deep Convolutional Neural Networks for Object Detection and Classification
    Liu, Yan
    Zhuxngjie, Zhu
    Zhang, Qiuhui
    Ding, Xiaotian
    Wang, Ruonan
    Han, Senyao
    Li, Chi
    AATCC JOURNAL OF RESEARCH, 2021, 8 (1_SUPPL) : 38 - 46
  • [3] Object Detection Using Deep Convolutional Neural Networks
    Qian, Huimin
    Xu, Jiawei
    Zhou, Jun
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1151 - 1156
  • [4] Convolutional SVM Networks for Object Detection in UAV Imagery
    Bazi, Yakoub
    Melgani, Farid
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06): : 3107 - 3118
  • [5] Object Detection and Depth Estimation Approach Based on Deep Convolutional Neural Networks
    Wang, Huai-Mu
    Lin, Huei-Yung
    Chang, Chin-Chen
    SENSORS, 2021, 21 (14)
  • [6] A Review on Object Detection Based on Deep Convolutional Neural Networks for Autonomous Driving
    Lu, Jialin
    Tang, Shuming
    Wang, Jinqiao
    Zhu, Haibing
    Wang, Yunkuan
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5301 - 5308
  • [7] Efficient convolutional neural networks and network compression methods for object detection: a survey
    Zhou, Yong
    Xia, Lei
    Zhao, Jiaqi
    Yao, Rui
    Liu, Bing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10167 - 10209
  • [8] Object detection of transmission line visual images based on deep convolutional neural network
    Zhou Zhu-bo
    Gao Jiao
    Zhang Wei
    Wang Xiao-jing
    Zhang Jiang
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2018, 33 (04) : 317 - 325
  • [9] Detecting visual design principles in art and architecture through deep convolutional neural networks
    Demir, Gozdenur
    Cekmis, Asli
    Yesilkaynak, Vahit Bugra
    Unal, Gozde
    AUTOMATION IN CONSTRUCTION, 2021, 130
  • [10] Automated visual detection of geometrical defects in composite manufacturing processes using deep convolutional neural networks
    Djavadifar, Abtin
    Graham-Knight, John Brandon
    Korber, Marian
    Lasserre, Patricia
    Najjaran, Homayoun
    JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (08) : 2257 - 2275