Medical literature’s semantic status and solutiona corpus-based study on the categorization and composition of personal names and terms of address in Chinese classical novels

D. Xiong, Qin Lu, F. Lo, D. Shi, T.S. Chiu

Research output: Journal article publicationJournal articleAcademic research

Abstract

As an important linguistic category, terms of address not only carry particular information, but also express feelings and emotions. They are therefore widely used in literature works. In natural language processing (NLP) and its applications, terms of address are one of the key elements in named entity recognition, which can affect the overall performance of an NLP system. Based on the analysis of a manually-annotated corpus of four Chinese classical novels in the Ming and Qing dynasties, this paper presents a classification and annotation system for personal names and terms of address from the perspective of named entity recognition and information extraction in NLP. Personal names and terms of address are categorized into simple types and compound types and the compound-type is further categorized into four subtypes, namely, fixed expressions, appositive constructions, subordinate constructions of affiliation, and other subordinate constructions.
Original languageEnglish
Pages (from-to)10-20
Number of pages11
JournalInternational journal of knowledge and language processing
Volume4
Issue number4
Publication statusPublished - 2013

Keywords

  • Terms of address
  • NLP
  • Novels in the Ming and Qing dynasties
  • Corpus
  • Named entity recognition

Cite this