Deep Convolutional Networks for Construction Object Detection Under Different Visual Conditions

被引：41

作者：

Nath, Nipun D. ^{[1
]}

Behzadan, Amir H. ^{[2
]}

机构：

[1] Texas A&M Univ, Zachry Dept Civil Engn, College Stn, TX USA

[2] Texas A&M Univ, Dept Construct Sci, College Stn, TX 77843 USA

来源：

FRONTIERS IN BUILT ENVIRONMENT | 2020年 / 6卷

基金：

美国国家科学基金会;

关键词：

visual recognition; deep learning; object detection; computer vision; content retrieval; NEURAL-NETWORKS; RECOGNITION; RESOURCES; CLASSIFICATION; RECONSTRUCTION; ANNOTATION; PROGRESS; KINECT; MODEL;

D O I：

10.3389/fbuil.2020.00097

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Sensing and reality capture devices are widely used in construction sites. Among different technologies, vision-based sensors are by far the most common and ubiquitous. A large volume of images and videos is collected from construction projects every day to track work progress, measure productivity, litigate claims, and monitor safety compliance. Manual interpretation of such colossal amounts of data, however, is non-trivial, error-prone, and resource-intensive. This has motivated new research on soft computing methods that utilize high-power data processing, computer vision, and deep learning (DL) in the form of convolutional neural networks (CNNs). A fundamental step toward machine-driven interpretation of construction site scenery is to accurately identify objects of interest for a particular problem. The accuracy requirement, however, may offset the computational speed of the candidate method. While lightweight DL algorithms (e.g., Mask R-CNN) can perform visual recognition with relatively high accuracy, they suffer from low processing efficacy, which hinders their use in real-time decision-making. One of the most promising DL algorithms that balance speed and accuracy is YOLO (you-only-look-once). This paper investigates YOLO-based CNN models in fast detection of construction objects. First, a large-scale image dataset, named Pictor-v2, is created, which contains about 3,500 images and approximately 11,500 instances of common construction site objects (e.g., building, equipment, worker). To assess the agility of object detection, transfer learning is used to train two variations of this model, namely, YOLO-v2 and YOLO-v3, and test them on different data combinations (crowdsourced, web-mined, or both). Results indicate that performance is higher if the model is trained on both crowdsourced and web-mined images. Additionally, YOLO-v3 outperforms YOLO-v2 by focusing on smaller, harder-to-detect objects. The best-performing YOLO-v3 model has a 78.2% mAP when tested on crowdsourced data. Sensitivity analysis of the output shows that the model's strong suit is in detecting larger objects in less crowded and well-lit spaces. The proposed methodology can also be extended to predict the relative distance of the detected objects with reliable accuracy. Findings of this work lay the foundation for further research on technology-assistive systems to augment human capacities in quickly and reliably interpreting visual data in complex environments.

引用

页数：22

共 50 条

[41] Deep Convolutional Object Detection and Search Area Prediction for UAV Tracking
Boirel, Nicolas
Akhloufi, Moulay A.
[J]. UNMANNED SYSTEMS TECHNOLOGY XXIII, 2021, 11758
[42] Human and object detection using Hybrid Deep Convolutional Neural Network
P. Mukilan
Wogderess Semunigus
[J]. Signal, Image and Video Processing, 2022, 16 : 1913 - 1923
[43] Enhance Visual Recognition Under Adverse Conditions via Deep Networks
Liu, Ding
Cheng, Bowen
Wang, Zhangyang
Zhang, Haichao
Huang, Thomas S.
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4401 - 4412
[44] A novel deep convolutional encoder-decoder network: application to moving object detection in videos
Ganivada, Avatharam
Yara, Srinivas
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29) : 22027 - 22041
[45] Invariance of object detection in untrained deep neural networks
Cheon, Jeonghwan
Baek, Seungdae
Paik, Se-Bum
[J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2022, 16
[46] Breast cancer detection: Shallow convolutional neural network against deep convolutional neural networks based approach
Das, Himanish Shekhar
Das, Akalpita
Neog, Anupal
Mallik, Saurav
Bora, Kangkana
Zhao, Zhongming
[J]. FRONTIERS IN GENETICS, 2023, 13
[47] Classification of physiological disorders in apples using deep convolutional neural network under different lighting conditions
Birkan Buyukarikan
Erkan Ulker
[J]. Multimedia Tools and Applications, 2023, 82 : 32463 - 32483
[48] Classification of physiological disorders in apples using deep convolutional neural network under different lighting conditions
Buyukarikan, Birkan
Ulker, Erkan
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) : 32463 - 32483
[49] Deep Learning for Visual Indonesian Place Classification with Convolutional Neural Networks
Chowanda, Andry
Sutoyo, Rhio
[J]. 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 436 - 443
[50] Object Detection by a Super-Resolution Method and a Convolutional Neural Networks
Na, Bokyoon
Fox, Geoffrey C.
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2263 - 2269

← 1 2 3 4 5 →