Abstract
Automatic terminology extraction requires termhood verification for extracted terms in a specific domain. Chinese terminology extraction suffers from insufficient domain corpora for verification even though there is abundance of information in other languages. This paper presents a novel approach to overcome this problem by using word translations and bilingual web resources to improve both coverage and precision. The proposed approach incorporates bilingual information from within candidate terms themselves and from existing domain knowledge to conduct termhood calculation. In contrast to previous researches, this method is not confined to only pre-determined corpora. Preliminary experiments show a 14.8% improvement in coverage and 26.3% improvement in precision, respectively.
Original language | English |
---|---|
Title of host publication | IEEE NLP-KE 2007 - Proceedings of International Conference on Natural Language Processing and Knowledge Engineering |
Pages | 347-354 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 1 Dec 2007 |
Event | International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007 - Beijing, China Duration: 30 Aug 2007 → 1 Sept 2007 |
Conference
Conference | International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2007 |
---|---|
Country/Territory | China |
City | Beijing |
Period | 30/08/07 → 1/09/07 |
ASJC Scopus subject areas
- Computer Science Applications
- Information Systems
- Information Systems and Management