Cross-Lingual Name Entity Recognition from Clinical Text Using Mixed Language Query

Kunli Shi, Gongchi Chen, Jinghang Gu, Longhua Qian (Corresponding Author), Guodong Zhou

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Cross-lingual Named Entity Recognition (Cross-Lingual NER) addresses the challenge of NER with limited annotated data in low-resource languages by transferring knowledge from high-resource languages. Particularly, in the clinical domain, the lack of annotated corpora for Cross-Lingual NER hinders the development of cross-lingual clinical text named entity recognition. By leveraging the English clinical text corpus I2B2 2010 and the Chinese clinical text corpus CCKS2019, we construct a cross-lingual clinical text named entity recognition corpus (CLC-NER) via label alignment. Further, we propose a machine reading comprehension framework for Cross-Lingual NER using mixed language queries to enhance model transfer capabilities. We conduct comprehensive experiments on the CLC-NER corpus, and the results demonstrate the superiority of our approach over other systems.

Original languageEnglish
Title of host publicationHealth Information Processing
Subtitle of host publication9th China Health Information Processing Conference, CHIP 2023, Proceedings
EditorsHua Xu, Qingcai Chen, Hongfei Lin, Fei Wu, Lei Liu, Buzhou Tang, Tianyong Hao, Zhengxing Huang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages3-21
Number of pages19
ISBN (Print)9789819998630
DOIs
Publication statusPublished - 1 Feb 2024
Event9th China Health Information Processing Conference, CHIP 2023 - Hangzhou, China
Duration: 27 Oct 202329 Oct 2023

Publication series

NameCommunications in Computer and Information Science
Volume1993 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference9th China Health Information Processing Conference, CHIP 2023
Country/TerritoryChina
CityHangzhou
Period27/10/2329/10/23

Keywords

  • Clinical Text
  • Cross-Lingual NER
  • Machine Reading Comprehension
  • Mixed Language Query

ASJC Scopus subject areas

  • General Computer Science
  • General Mathematics

Fingerprint

Dive into the research topics of 'Cross-Lingual Name Entity Recognition from Clinical Text Using Mixed Language Query'. Together they form a unique fingerprint.

Cite this