Introduction to CKIP’s Language Resources and Their Applications

Zhao-Ming Gao (Corresponding Author), Chu-Ren Huang, Keh-Jiann Chen

Research output: Chapter in book / Conference proceedingChapter in an edited book (as author)Academic researchpeer-review

Abstract

In this chapter, we will introduce the language resources developed by the Chinese Knowledge Information Processing (CKIP) Group at Academia Sinica in Taiwan over the past 30 years. These include monolingual and bilingual lexical knowledge bases (CKIP lexical knowledge base, Hantology, Chinese WordNet, Sinica BOW, and E-HowNet), Chinese grammar (Information-based Case Grammar), annotated corpora (Sinica Chinese Corpus, Sinica Ancient Chinese Corpus, Sinica Chinese Treebank, and Chinese Sketch Engine), and online Chinese word segmentation and parsing systems. After a brief overview, we will show how some of these resources can be employed to generate natural language processing (NLP) applications using machine learning algorithms.
Original languageEnglish
Title of host publicationChinese Language Resources
Subtitle of host publicationData Collection, Linguistic Analysis, Annotation and Language Processing
EditorsChu-ren Huang, Shu-Kai Hsieh, Peng Jin
PublisherSpringer
Chapter3
Pages27-56
ISBN (Electronic)978-3-031-38913-9
ISBN (Print)978-3-031-38912-2, 978-3-031-38915-3
DOIs
Publication statusPublished - 19 Dec 2023

Publication series

NameText, Speech and Language Technology
PublisherSpringer
Volume49
ISSN (Print)1386-291X
ISSN (Electronic)2542-9388

Keywords

  • Hantology
  • Chinese wordnet
  • E-HowNet
  • Sinica Chinese Treebank
  • Sinica Chinese Copus

Fingerprint

Dive into the research topics of 'Introduction to CKIP’s Language Resources and Their Applications'. Together they form a unique fingerprint.

Cite this