Chinese typed collocation extraction using corpus-based syntactic collocation patterns

Wanyin Li, Qin Lu, James Liu

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

4 Citations (Scopus)

Abstract

Collocations play significant role in many application and extraction them automatically is useful in NLP. Syntactic-based phrase patterns used in collocation extraction have brought advantages due to the well-formedness of results and automatically classifying the candidates into syntactically congeneric categories. However, due to the language independency, the arbitrary choice of syntactic patterns for target collocations brings drawbacks for evaluation as well as adaptation for a new language. This work presents a corpus-driven framework to generate collocation templates for nouns and verbs phrase at first and then integrate them with statistical association measures for noun/verb phrase collocation extraction, namely typed collocation extraction. The experiment results show a higher average precision of 84.80% and a so called local recall value of 55.99% based on a randomly selected noun and verb headwords.
Original languageEnglish
Title of host publicationIEEE NLP-KE 2007 - Proceedings of International Conference on Natural Language Processing and Knowledge Engineering
Pages248-255
Number of pages8
DOIs
Publication statusPublished - 1 Dec 2007
EventInternational Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007 - Beijing, China
Duration: 30 Aug 20071 Sep 2007

Conference

ConferenceInternational Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007
CountryChina
CityBeijing
Period30/08/071/09/07

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Information Systems and Management

Cite this