Abstract
Mining entities from open Chinese mechanical equipment texts have become prevailing in the intelligent manufacturing field. However, compared to named entities in other domains, it is very hard to determine entity boundaries in mining Chinese mechanical equipment texts because there is no unified standard for entity simplification, and digits and units are mixed in the text. To address the issue, this paper presents an entity boundary-define strategy and constructs a mechanical equipment-oriented corpus called MECorpus using open Chinese mechanical equipment texts by combining domain knowledge. A multi-neural network collaboration model Adaptive-BERT-BiLSTM-CRF-Rating (ABBCR) is then proposed for mechanical equipment named entity recognition. The novelty of ABBCR is characterised by its adaptive input mechanism and rating score ability for identified entities. Various experiments about ABBCR model selection, evaluation and application are conducted on MECorpus. Experimental results show that the ABBCR model provides high-quality mechanical equipment entities for constructing the mechanical equipment knowledge graph. ABBCR combined with the large language model has proved to be a promising method to manage complex mechanical equipment expertise.
Original language | English |
---|---|
Number of pages | 26 |
Journal | Journal of Engineering Design |
DOIs | |
Publication status | E-pub ahead of print - Jul 2024 |
Keywords
- adaptive mechanism
- bidirectional encoder representations from transformers
- fuzzy boundary
- multi-neural network collaboration
- Named entity recognition
ASJC Scopus subject areas
- General Engineering