TY - JOUR
T1 - Development of training image database using web crawling for vision-based site monitoring
AU - Hwang, Jeongbin
AU - Kim, Jinwoo
AU - Chi, Seokho
AU - Seo, Joon Oh
N1 - Funding Information:
This research was supported by a grant ( 21CTAP-C151784-03 ) from the Technology Advancement Research Program funded by the Ministry of Land, Infrastructure, and Transport of the Korean government , the BK21 PLUS research program of the National Research Foundation of Korea . and by the National Research Foundation of Korea grant funded by the Korea government (No. 2021R1A2C2003696 ).
Publisher Copyright:
© 2022 The Authors
PY - 2022/3
Y1 - 2022/3
N2 - As most of the state-of-the-art technologies for vision-based monitoring were originated from machine learning or deep learning algorithms, it is crucial to build a large and rich training image database (DB). For this purpose, this paper proposes an automated framework that builds a large, high-quality training DB for construction site monitoring. The framework consists of three main processes: (1) automated construction image collection using web crawling, (2) automated image labeling using an image segmentation model, and (3) fully randomized foreground-background cross-oversampling. Using the developed framework, it was possible to automatically construct a training DB, composed of 5864 images, for the detection of construction objects in 53.5 min. The deep learning model trained by the DB successfully detected construction resources with an average precision of 92.71% and a recall rate of 88.14%. The findings of this study can reduce the time and effort required to develop vision-based site monitoring technologies.
AB - As most of the state-of-the-art technologies for vision-based monitoring were originated from machine learning or deep learning algorithms, it is crucial to build a large and rich training image database (DB). For this purpose, this paper proposes an automated framework that builds a large, high-quality training DB for construction site monitoring. The framework consists of three main processes: (1) automated construction image collection using web crawling, (2) automated image labeling using an image segmentation model, and (3) fully randomized foreground-background cross-oversampling. Using the developed framework, it was possible to automatically construct a training DB, composed of 5864 images, for the detection of construction objects in 53.5 min. The deep learning model trained by the DB successfully detected construction resources with an average precision of 92.71% and a recall rate of 88.14%. The findings of this study can reduce the time and effort required to develop vision-based site monitoring technologies.
KW - Automated labeling
KW - Construction site
KW - Training image database
KW - Vision-based monitoring
KW - Web crawling
UR - http://www.scopus.com/inward/record.url?scp=85123019989&partnerID=8YFLogxK
U2 - 10.1016/j.autcon.2022.104141
DO - 10.1016/j.autcon.2022.104141
M3 - Journal article
AN - SCOPUS:85123019989
SN - 0926-5805
VL - 135
JO - Automation in Construction
JF - Automation in Construction
M1 - 104141
ER -