TY - JOUR
T1 - A transformer-based deep learning model for recognizing communication-oriented entities from patents of ICT in construction
AU - Wu, Hengqin
AU - Shen, Geoffrey Qiping
AU - Lin, Xue
AU - Li, Minglei
AU - Li, Clyde Zhengdao
N1 - Funding Information:
This research was supported by the National Natural Science Foundation of China (NSFC).
Funding Information:
(No. 71771067 , No. 71801159 and No. 52078302 ), the National Natural Science Foundation of Guangdong Province (No. 2018A030310534 ), and Youth Fund of Humanities and Social Sciences Research of the Ministry of Education (No. 18YJCZH090 ).
Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2021/5
Y1 - 2021/5
N2 - The patents of information and communication technology (ICT) in construction are valuable sources of technological solutions to communication problems in the construction practice. However, it is often difficult for practitioners and stakeholders to identify the key communication functionalities from complicated expressions in the patent documents. Addressing such challenges, this study develops a deep learning model to enable automatic recognition of communication-oriented entities (CEs) from patent documents. The proposed model is structured based on the Transformer, consisting of feed-forward and self-attention neural networks to better recognize ambiguous and unknown entities by utilizing contextual information. The validation results showed that the proposed model has superior performance in CE recognition than traditional recurrent neural networks (RNN)-based models, especially in recognizing ambiguous and unknown entities. Moreover, experimental results on some research literature and a real-life project report showed satisfactory performance of the model in CE recognition across different document types.
AB - The patents of information and communication technology (ICT) in construction are valuable sources of technological solutions to communication problems in the construction practice. However, it is often difficult for practitioners and stakeholders to identify the key communication functionalities from complicated expressions in the patent documents. Addressing such challenges, this study develops a deep learning model to enable automatic recognition of communication-oriented entities (CEs) from patent documents. The proposed model is structured based on the Transformer, consisting of feed-forward and self-attention neural networks to better recognize ambiguous and unknown entities by utilizing contextual information. The validation results showed that the proposed model has superior performance in CE recognition than traditional recurrent neural networks (RNN)-based models, especially in recognizing ambiguous and unknown entities. Moreover, experimental results on some research literature and a real-life project report showed satisfactory performance of the model in CE recognition across different document types.
KW - Construction industry
KW - Contextual information
KW - Deep learning
KW - Entity recognition
KW - Information and communications technology (ICT)
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=85100627233&partnerID=8YFLogxK
U2 - 10.1016/j.autcon.2021.103608
DO - 10.1016/j.autcon.2021.103608
M3 - Journal article
AN - SCOPUS:85100627233
SN - 0926-5805
VL - 125
JO - Automation in Construction
JF - Automation in Construction
M1 - 103608
ER -