A few-shot learning approach for database-free vision-based monitoring on construction sites

被引:46
作者
Kim, Jinwoo [1 ,3 ]
Chi, Seokho [2 ,3 ]
机构
[1] Univ Michigan, Dept Civil & Environm Engn, Ann Arbor, MI 48109 USA
[2] Seoul Natl Univ, Dept Civil & Environm Engn, Seoul 08826, South Korea
[3] Seoul Natl Univ, Inst Construct & Environm Engn, Seoul 08826, South Korea
关键词
Construction site; Vision-based; Site monitoring; Database-free (DB-free); Few-shot learning; Meta-learning; EARTHMOVING EXCAVATORS; SURVEILLANCE VIDEOS; WORKERS ACTIVITIES; ACTION RECOGNITION; NEURAL-NETWORK; PRODUCTIVITY; INSPECTION; EQUIPMENT; CONTEXT;
D O I
10.1016/j.autcon.2021.103566
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
This paper proposes a few-shot learning approach that can successfully learn and detect new construction objects when only a few training data are given. The proposed approach includes few-shot model design and meta learning processes. To validate the approach, the authors conducted experiments using a popular construction benchmark dataset, AIMDataset. Even if only 20 training images were provided to a new construction object, the few-shot learning could build an object detection model with the mean Average Precision of 73.1% on average, whereas the performance of the existing supervised learning was limited to 36.5%. The results imply that the proposed approach can successfully learn and detect new types of construction objects only with few labeled images given, enabling to reduce the number of training images while maximizing the model performance. It would be then possible to save human efforts required for data labeling and enhance the practicality of vision based construction monitoring systems.
引用
收藏
页数:11
相关论文
共 66 条
[1]  
[Anonymous], 2009, METALEARNING
[2]  
[Anonymous], 2016, ENF DECR ART 98 99 S
[3]   Semantic Annotation of Videos from Equipment-Intensive Construction Operations by Shot Recognition and Probabilistic Reasoning [J].
Azar, Ehsan Rezazadeh .
JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2017, 31 (05)
[4]   Image augmentation to improve construction resource detection using generative adversarial networks, cut-and-paste, and image transformation techniques [J].
Bang, Seongdeok ;
Baek, Francis ;
Park, Somin ;
Kim, Wontae ;
Kim, Hyoungkwan .
AUTOMATION IN CONSTRUCTION, 2020, 115
[5]   Context-based information generation for managing UAV-acquired data using image captioning [J].
Bang, Seongdeok ;
Kim, Hyoungkwan .
AUTOMATION IN CONSTRUCTION, 2020, 112
[6]   Combining inverse photogrammetry and BIM for automated labeling of construction site images for machine learning [J].
Braun, Alex ;
Borrmann, Andre .
AUTOMATION IN CONSTRUCTION, 2019, 106
[7]   Two-step long short-term memory method for identifying construction activities through positional and attentional cues [J].
Cai, Jiannan ;
Zhang, Yuxi ;
Cai, Hubo .
AUTOMATION IN CONSTRUCTION, 2019, 106
[8]   Automated excavators activity recognition and productivity analysis from construction site surveillance videos [J].
Chen, Chen ;
Zhu, Zhenhua ;
Hammad, Amin .
AUTOMATION IN CONSTRUCTION, 2020, 110
[9]   Vision-based material recognition for automated monitoring of construction progress and generating building information modeling from unordered site image collections [J].
Dimitrov, Andrey ;
Golparvar-Fard, Mani .
ADVANCED ENGINEERING INFORMATICS, 2014, 28 (01) :37-49
[10]   A deep hybrid learning model to detect unsafe behavior: Integrating convolution neural networks and long short-term memory [J].
Ding, Lieyun ;
Fang, Weili ;
Luo, Hanbin ;
Love, Peter E. D. ;
Zhong, Botao ;
Ouyang, Xi .
AUTOMATION IN CONSTRUCTION, 2018, 86 :118-124