Skip to main navigation Skip to search Skip to main content

Exploring Layer-wise Representations of English and Chinese Homonymy in Pre-trained Language Models

  • Matthew King-Hang Ma
  • , Chenwei Xie
  • , Wenbo Wang
  • , William Shiyuan Wang

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Homonymy can easily raise lexical ambiguity due to the misunderstanding of its multiple senses. Correct recognition of homonym sense greatly relies on its surrounding context. This ambiguous nature makes homonyms an appropriate testbed for examining the contextualization capability of pre-trained (PLM) and large language models (LLMs). Considering the impact of part of speech (POS) on homonym disambiguation and the prevalence of English-focused studies in word embedding research, this study extends to Chinese and provides a comprehensive layer-wise analysis of homonym representations in both languages, spanning same and different POS categories, across four families of PLMs/LLMs (BERT, GPT-2, Llama 3, Qwen 2.5). Through the creation of a synthetic dataset and computation of disambiguation score (D-Score), we found that: (1) no universal layer depth excels in differentiating homonym representations; (2) bidirectional models produce better contextualized homonym representations compared to much larger autoregressive models; (3) most importantly, POS affects homonym representations in models in ways that differ from human research findings. The individual differences between LLMs uncovered in our study challenge the simplistic understanding of their inner workings. This reveals a compelling research frontier: conducting controlled experiments with purposefully manipulated inputs to enhance the interpretability of LLMs. We have made our dataset and codes available publicly at https://github.com/neurothew/exploring-homonym-rep-in-llm.
Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics: ACL 2025
EditorsWanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
PublisherAssociation for Computational Linguistics (ACL)
Pages19705-19724
ISBN (Electronic)9798891762565
DOIs
Publication statusPublished - Aug 2025
EventThe 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025) - Vienna, Austria
Duration: 27 Jul 20251 Aug 2025

Conference

ConferenceThe 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
Country/TerritoryAustria
CityVienna
Period27/07/251/08/25

Fingerprint

Dive into the research topics of 'Exploring Layer-wise Representations of English and Chinese Homonymy in Pre-trained Language Models'. Together they form a unique fingerprint.

Cite this