Abstract
Person description extraction is an important task in biography generation, question answering and summarization, etc. While most of the previous extraction methods mainly depended on structural information, the work presented in the paper will focus on extraction verification by integrating linguistic knowledge provided by HowNet (with semantic knowledge) and the newswire corpus (with statistical information), from which the associations between terms (i.e. the words in HowNet) and person entities are measured. With Term-Entity association, ineligible descriptions extracted could be filtered out, and a higher precision is achieved in turn.
Original language | English |
---|---|
Title of host publication | 2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005 |
Pages | 50-56 |
Number of pages | 7 |
Publication status | Published - 12 Dec 2005 |
Event | International Conference on Machine Learning and Cybernetics, ICMLC 2005 - Guangzhou, China Duration: 18 Aug 2005 → 21 Aug 2005 |
Conference
Conference | International Conference on Machine Learning and Cybernetics, ICMLC 2005 |
---|---|
Country/Territory | China |
City | Guangzhou |
Period | 18/08/05 → 21/08/05 |
Keywords
- Description
- Information extraction
- Natural language processing
- Term-entity association
ASJC Scopus subject areas
- General Engineering