Annotation and Classification of Light Verbs and Light Verb Variations in Mandarin Chinese

Jingxia Lin, Hongzhi Xu, Menghan Jiang, Chu Ren Huang

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

8 Citations (Scopus)

Abstract

Light verbs pose an a challenge in linguistics because of its syntactic and semantic versatility and its unique distribution different from regular verbs with higher semantic content and selectional resrictions. Due to its light grammatical content, earlier natural language processing studies typically put light verbs in a stop word list and ignore them. Recently, however, classification and identification of light verbs and light verb construction have become a focus of study in computational linguistics, especially in the context of multi-word expression, information retrieval, disambiguation, and parsing. Past linguistic and computational studies on light verbs had very different foci. Linguistic studies tend to focus on the status of light verbs and its various selectional constraints. While NLP studies have focused on light verbs in the context of either a multi-word expression (MWE) or a construction to be identified, classified, or translated, trying to overcome the apparent poverty of semantic content of light verbs. There has been nearly no work attempting to bridge these two lines of research. This paper takes this challenge by proposing a corpus-bases study which classifies and captures syntactic-semantic difference among all light verbs. In this study, we first incorporate results from past linguistic studies to create annotated light verb corpora with syntactic-semantics features. We next adopt a statistic method for automatic identification of light verbs based on this annotated corpora. Our results show that a language resource based methodology optimally incorporating linguistic information can resolve challenges posed by light verbs in NLP.

Original languageEnglish
Title of host publicationProceedings of the Workshop on Lexical and Grammatical Resources for Language Processing, LG-LP 2014 - in conjunction with 25th International Conference on Computational Linguistics, COLING 2014
EditorsJorge Baptista, Pushpak Bhattacharyya, Christiane Fellbaum, Mikel Forcada, Chu-Ren Huang, Svetla Koeva, Cvetana Krstev, Eric Laporte
PublisherAssociation for Computational Linguistics (ACL)
Pages75-82
Number of pages8
ISBN (Electronic)9781873769447
DOIs
Publication statusPublished - Aug 2014
Event2014 Workshop on Lexical and Grammatical Resources for Language Processing, LG-LP 2014 - Dublin, Ireland
Duration: 24 Aug 2014 → …

Publication series

NameProceedings of the Workshop on Lexical and Grammatical Resources for Language Processing, LG-LP 2014 - in conjunction with 25th International Conference on Computational Linguistics, COLING 2014

Conference

Conference2014 Workshop on Lexical and Grammatical Resources for Language Processing, LG-LP 2014
Country/TerritoryIreland
CityDublin
Period24/08/14 → …

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'Annotation and Classification of Light Verbs and Light Verb Variations in Mandarin Chinese'. Together they form a unique fingerprint.

Cite this