Evalution-man 2.0: Expand the evaluation dataset for vector space models

Hongchao Liu, Chu-ren Huang

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

3 Citations (Scopus)


We introduce EVALution 2.0, a simplified Mandarin dataset for the evaluation of Vector Space Models. We take a psycholinguistics-based methodology through the use of a verbal association task, which differs from previous datasets that use corpus and ontology to construct word relation pairs. Semantic neighbors were created for 100 target words and surprisingly, to which participants produced 1129 word relation pairs. In a separate agreement-rating task, only 62 pairs showed were rejected. The methodology has proven to be a way to expand the existing resources quickly while maintaining a high level of quality.
Original languageEnglish
Title of host publicationChinese Lexical Semantics - 17th Workshop, CLSW 2016, Revised Selected Papers
PublisherSpringer Verlag
Number of pages8
ISBN (Print)9783319495071
Publication statusPublished - 1 Jan 2016
Event17th Chinese Lexical Semantics Workshop, CLSW 2016 - Singapore, Singapore
Duration: 20 May 201622 May 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10085 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference17th Chinese Lexical Semantics Workshop, CLSW 2016

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this