The formal representation for Chinese characters

Y.M. Chou, Chu-ren Huang

Research output: Journal article publicationJournal articleAcademic research


汉字的知识本体和形式表达的研究不仅有助于计算机处理汉语,更能够突显汉字的特色和丰富知识内涵。本文旨在说明如何在计算机建立汉字知识,以及如何用形式语言表达汉字知识。与过去的汉字数据库不同的是,本研究以语意网的形式语言描述汉字的知识,希望能够对这方面的研究有所启发。汉字知识的形式表达内容包括:字形外在结构和演变的描述、意符与声符的描述、字形内在结构的描述、字义与衍生词的描述、异体字关系的描述、字音演变的描述、时间的描述,其中,意符和字义皆与IEEE建议上层共享知识本体(SUMO)对应,作为汉字知识的上层知识。本研究采用的形式语言是OWL-DL,有助于汉字知识与其他知识本体分享知识。||The formal representation of Chinese characters using ontology is an important research area,and advantageous to process Chinese language. This paper aims to describe the methodology of constructing the ontology of Chinese characters and its formal representation. The formal representation proposed herein includes the external structure and derivation of Chinese characters,semantic and phonetic symbols,internal structure,sense and derived words,the relations of variants,and the pronunciations. The semantic symbols and senses of characters are connected with IEEE Suggested Upper Merged Ontology (SUMO) . This study uses the OWL (Web Ontology Language)-DL to describe the knowledge of Chinese characters and share with other ontology.
Original languageChinese (Simplified)
Pages (from-to)142-161
Number of pages20
Journal当代语言学 (Contemporary linguistics)
Issue number2
Publication statusPublished - 2013


  • Formal representation
  • Chinese characters ontology
  • SUMO

Cite this