Using genetic algorithms for optimizing part of speech tagset

H.L. Sun, Qin Lu, S.W. Yu

Research output: Journal article publicationJournal articleAcademic research

21 Citations (Scopus)

Abstract

过去词类标记集的选择主要基于专家的经验知识 ,缺乏自动或半自动的方法来辅助这一过程。本文提出了一种利用遗传算法来搜索优化的标记集的新方法。这种方法可以在一个候选标记集集合中自动搜索一个最优或较优的标记集 ,并可根据应用的需求调整参数以适应特定任务的需求。实验表明 :遗传算法为标记集的选择提供了一种系统的有效的辅助手段||POS tagset selection in the past was mainly done by experts using human knowledge manually,since there is no automatic or semi automatic way to assist the selection process.This paper proposes a novel method to search for an optimal POS tagset using genetic algorithms (GA).The experiment shows that GA provides an efficient optimization of POS tagset and allows for the adjustment of parameters according to user requirement.It provides a systematic way to help people in making an intelligent choice on the selection of a tagset.
Original languageChinese (Simplified)
Pages (from-to)19-27
Number of pages9
Journal中文信息学报 (Journal of Chinese information processing)
Volume15
Issue number1
Publication statusPublished - 2001

Keywords

  • POS tagging
  • Word class
  • POS tagset
  • Genetic algorithm

Cite this