An analysis of radicals-based features in subjectivity classification on simplified Chinese sentences

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

2014 by Ge Xu and Chu-Ren Huang. Chinese radicals are linguistic elements smaller than Chinese characters1. Normally, a radical is a semantic category and almost all characters contain radicals or are radicals themselves. In subjectivity classification on sentences, we can use radicals to represent characters, which reduce the scale of word space while keep the subjectivity information. In this paper, we manually labeled a character set to build a high-quality radical-character mapping, and then the mapping is used to generalize character-based features with radicals. In experiments, we at first evaluated the performance when directly generalizing characters with radicals, and then offer a hypothesis that can reduce noises. Experiments show that this approach based on our hypothesis can reduce feature space while keep or improve the performance, which is especially useful when the training samples are scarce.
Original languageEnglish
Title of host publicationProceedings of the 28th Pacific Asia Conference on Language, Information and Computation, PACLIC 2014
PublisherFaculty of Pharmaceutical Sciences, Chulalongkorn University
Pages495-502
Number of pages8
ISBN (Electronic)9786165518871
Publication statusPublished - 1 Jan 2014
Event28th Pacific Asia Conference on Language, Information and Computation, PACLIC 2014 - Cape Panwa Hotel, Phuket, Thailand
Duration: 12 Dec 201414 Dec 2014

Conference

Conference28th Pacific Asia Conference on Language, Information and Computation, PACLIC 2014
Country/TerritoryThailand
CityPhuket
Period12/12/1414/12/14

Keywords

  • Chinese character
  • Radical
  • Sentiment analysis
  • Subjectivity classification

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Science (miscellaneous)

Cite this