Abstract
2014 by Ge Xu and Chu-Ren Huang. Chinese radicals are linguistic elements smaller than Chinese characters1. Normally, a radical is a semantic category and almost all characters contain radicals or are radicals themselves. In subjectivity classification on sentences, we can use radicals to represent characters, which reduce the scale of word space while keep the subjectivity information. In this paper, we manually labeled a character set to build a high-quality radical-character mapping, and then the mapping is used to generalize character-based features with radicals. In experiments, we at first evaluated the performance when directly generalizing characters with radicals, and then offer a hypothesis that can reduce noises. Experiments show that this approach based on our hypothesis can reduce feature space while keep or improve the performance, which is especially useful when the training samples are scarce.
Original language | English |
---|---|
Title of host publication | Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, PACLIC 2014 |
Publisher | Faculty of Pharmaceutical Sciences, Chulalongkorn University |
Pages | 495-502 |
Number of pages | 8 |
ISBN (Electronic) | 9786165518871 |
Publication status | Published - 1 Jan 2014 |
Event | 28th Pacific Asia Conference on Language, Information and Computation, PACLIC 2014 - Cape Panwa Hotel, Phuket, Thailand Duration: 12 Dec 2014 → 14 Dec 2014 |
Conference
Conference | 28th Pacific Asia Conference on Language, Information and Computation, PACLIC 2014 |
---|---|
Country/Territory | Thailand |
City | Phuket |
Period | 12/12/14 → 14/12/14 |
Keywords
- Chinese character
- Radical
- Sentiment analysis
- Subjectivity classification
ASJC Scopus subject areas
- Language and Linguistics
- Computer Science (miscellaneous)