Chinese sketch engine and the extraction of grammatical collocations

Chu Ren Huang, Adam Kilgarriff, Yiching Wu, Chih Ming Chiu, Simon Smith, Pavel Rychly, Ming Hong Bai, Keh Jiann Chen

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

50 Citations (Scopus)

Abstract

This paper introduces a new technology for collocation extraction in Chinese. Sketch Engine (Kilgarriff et al., 2004) has proven to be a very effective tool for automatic description of lexical information, including collocation extraction, based on large-scale corpus. The original work of Sketch Engine was based on BNC. We extend Sketch Engine to Chinese based on Gigaword corpus from LDC. We discuss the available functions of the prototype Chinese Sketch Engine (CSE) as well as the robustness of language-independent adaptation of Sketch Engine. We conclude by discussing how Chinese-specific linguistic information can be incorporated to improve the CSE prototype.

Original languageEnglish
Title of host publication4th SIGHAN Workshop on Chinese Language Processing, Proceedings of the Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages48-55
Number of pages8
Publication statusPublished - 2005
Externally publishedYes
Event4th SIGHAN Workshop on Chinese Language Processing at the 2nd International Joint Conference on Natural Language Processing, SIGHAN@IJCNLP 2005 - Jeju Island, Korea, Republic of
Duration: 14 Oct 200515 Oct 2005

Conference

Conference4th SIGHAN Workshop on Chinese Language Processing at the 2nd International Joint Conference on Natural Language Processing, SIGHAN@IJCNLP 2005
Country/TerritoryKorea, Republic of
CityJeju Island
Period14/10/0515/10/05

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Chinese sketch engine and the extraction of grammatical collocations'. Together they form a unique fingerprint.

Cite this