ChiWUG: A Graph-based Evaluation Dataset for Chinese Lexical Semantic Change Detection

Jing Chen, Emmanuele Chersoni, Dominik Schlechtweg, Jelena Prokic, Chu Ren Huang

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

19 Citations (Scopus)

Abstract

Recent studies suggested that language models are efficient tools for measuring lexical semantic change. In our paper, we present the compilation of the first graph-based evaluation dataset for semantic change in the context of the Chinese language, covering the periods before and after the Reform and Opening Up. Exploiting the existing framework DURel, we collect over 61,000 human semantic relatedness judgments for 40 targets. The inferred word usage graphs and semantic change scores provide a basis for visualization and evaluation of semantic change.

Original languageEnglish
Title of host publicationLChange 2023 - 4th International Workshop on Computational Approaches to Historical Language Change 2023, Proceedings
EditorsNina Tahmasebi, Syrielle Montariol, Haim Dubossarsky, Haim Dubossarsky, Andrey Kutuzov, Simon Hengchen, David Alfter, Francesco Periti, Pierluigi Cassotti
PublisherAssociation for Computational Linguistics (ACL)
Pages93-99
Number of pages7
ISBN (Electronic)9798891760431
Publication statusPublished - 6 Dec 2023
Event4th International Workshop on Computational Approaches to Historical Language Change, LChange 2023 - Singapore, Singapore
Duration: 6 Dec 2023 → …

Publication series

NameLChange 2023 - 4th International Workshop on Computational Approaches to Historical Language Change 2023, Proceedings

Conference

Conference4th International Workshop on Computational Approaches to Historical Language Change, LChange 2023
Country/TerritorySingapore
CitySingapore
Period6/12/23 → …

ASJC Scopus subject areas

  • Language and Linguistics
  • Computational Theory and Mathematics
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'ChiWUG: A Graph-based Evaluation Dataset for Chinese Lexical Semantic Change Detection'. Together they form a unique fingerprint.

Cite this