TY - GEN
T1 - Cross-Lingual Name Entity Recognition from Clinical Text Using Mixed Language Query
AU - Shi, Kunli
AU - Chen, Gongchi
AU - Gu, Jinghang
AU - Qian, Longhua
AU - Zhou, Guodong
N1 - Publisher Copyright:
© 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2024/2/1
Y1 - 2024/2/1
N2 - Cross-lingual Named Entity Recognition (Cross-Lingual NER) addresses the challenge of NER with limited annotated data in low-resource languages by transferring knowledge from high-resource languages. Particularly, in the clinical domain, the lack of annotated corpora for Cross-Lingual NER hinders the development of cross-lingual clinical text named entity recognition. By leveraging the English clinical text corpus I2B2 2010 and the Chinese clinical text corpus CCKS2019, we construct a cross-lingual clinical text named entity recognition corpus (CLC-NER) via label alignment. Further, we propose a machine reading comprehension framework for Cross-Lingual NER using mixed language queries to enhance model transfer capabilities. We conduct comprehensive experiments on the CLC-NER corpus, and the results demonstrate the superiority of our approach over other systems.
AB - Cross-lingual Named Entity Recognition (Cross-Lingual NER) addresses the challenge of NER with limited annotated data in low-resource languages by transferring knowledge from high-resource languages. Particularly, in the clinical domain, the lack of annotated corpora for Cross-Lingual NER hinders the development of cross-lingual clinical text named entity recognition. By leveraging the English clinical text corpus I2B2 2010 and the Chinese clinical text corpus CCKS2019, we construct a cross-lingual clinical text named entity recognition corpus (CLC-NER) via label alignment. Further, we propose a machine reading comprehension framework for Cross-Lingual NER using mixed language queries to enhance model transfer capabilities. We conduct comprehensive experiments on the CLC-NER corpus, and the results demonstrate the superiority of our approach over other systems.
KW - Clinical Text
KW - Cross-Lingual NER
KW - Machine Reading Comprehension
KW - Mixed Language Query
UR - http://www.scopus.com/inward/record.url?scp=85186660504&partnerID=8YFLogxK
U2 - 10.1007/978-981-99-9864-7_1
DO - 10.1007/978-981-99-9864-7_1
M3 - Conference article published in proceeding or book
AN - SCOPUS:85186660504
SN - 9789819998630
T3 - Communications in Computer and Information Science
SP - 3
EP - 21
BT - Health Information Processing
A2 - Xu, Hua
A2 - Chen, Qingcai
A2 - Lin, Hongfei
A2 - Wu, Fei
A2 - Liu, Lei
A2 - Tang, Buzhou
A2 - Hao, Tianyong
A2 - Huang, Zhengxing
PB - Springer Science and Business Media Deutschland GmbH
T2 - 9th China Health Information Processing Conference, CHIP 2023
Y2 - 27 October 2023 through 29 October 2023
ER -