Developing position structure-based framework for chinese entity relation extraction

Peng Zhang, Wenjie Li, Yuexian Hou, Dawei Song

Research output: Journal article publicationJournal articleAcademic researchpeer-review

17 Citations (Scopus)

Abstract

Relation extraction is the task of finding semantic relations between two entities in text, and is often cast as a classification problem. In contrast to the significant achievements on English language, research progress in Chinese relation extraction is relatively limited. In this article, we present a novel Chinese relation extraction framework, which is mainly based on a 9-position structure. The design of this proposed structure is motivated by the fact that there are some obvious connections between relation types/subtypes and position structures of two entities. The 9-position structure can be captured with less effort than applying deep natural language processing, and is effective to relieve the class imbalance problem which often hurts the classification performance. In our framework, all involved features do not require Chinese word segmentation, which has long been limiting the performance of Chinese language processing. We also utilize some correction and inference mechanisms to further improve the classified results. Experiments on the ACE 2005 Chinese data set show that the 9-position structure feature can provide strong support for Chinese relation extraction. As well as this, other strategies are also effective to further improve the performance.
Original languageEnglish
Article number14
JournalACM Transactions on Asian Language Information Processing
Volume10
Issue number3
DOIs
Publication statusPublished - 1 Sept 2011

Keywords

  • Chinese language
  • Entity relation extraction
  • Imbalance class classification
  • Position structure

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Developing position structure-based framework for chinese entity relation extraction'. Together they form a unique fingerprint.

Cite this