Abstract
Collocations play significant role in many application and extraction them automatically is useful in NLP. Syntactic-based phrase patterns used in collocation extraction have brought advantages due to the well-formedness of results and automatically classifying the candidates into syntactically congeneric categories. However, due to the language independency, the arbitrary choice of syntactic patterns for target collocations brings drawbacks for evaluation as well as adaptation for a new language. This work presents a corpus-driven framework to generate collocation templates for nouns and verbs phrase at first and then integrate them with statistical association measures for noun/verb phrase collocation extraction, namely typed collocation extraction. The experiment results show a higher average precision of 84.80% and a so called local recall value of 55.99% based on a randomly selected noun and verb headwords.
Original language | English |
---|---|
Title of host publication | IEEE NLP-KE 2007 - Proceedings of International Conference on Natural Language Processing and Knowledge Engineering |
Pages | 248-255 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 1 Dec 2007 |
Event | International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007 - Beijing, China Duration: 30 Aug 2007 → 1 Sep 2007 |
Conference
Conference | International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007 |
---|---|
Country | China |
City | Beijing |
Period | 30/08/07 → 1/09/07 |
ASJC Scopus subject areas
- Computer Science Applications
- Information Systems
- Information Systems and Management