Evalution-man 2.0: Expand the evaluation dataset for vector space models

Hongchao Liu, Chu-ren Huang

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

3 Citations (Scopus)

Abstract

We introduce EVALution 2.0, a simplified Mandarin dataset for the evaluation of Vector Space Models. We take a psycholinguistics-based methodology through the use of a verbal association task, which differs from previous datasets that use corpus and ontology to construct word relation pairs. Semantic neighbors were created for 100 target words and surprisingly, to which participants produced 1129 word relation pairs. In a separate agreement-rating task, only 62 pairs showed were rejected. The methodology has proven to be a way to expand the existing resources quickly while maintaining a high level of quality.
Original languageEnglish
Title of host publicationChinese Lexical Semantics - 17th Workshop, CLSW 2016, Revised Selected Papers
PublisherSpringer Verlag
Pages261-268
Number of pages8
ISBN (Print)9783319495071
DOIs
Publication statusPublished - 1 Jan 2016
Event17th Chinese Lexical Semantics Workshop, CLSW 2016 - Singapore, Singapore
Duration: 20 May 201622 May 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10085 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th Chinese Lexical Semantics Workshop, CLSW 2016
CountrySingapore
CitySingapore
Period20/05/1622/05/16

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this