Investigating Aspect Features in Contextualized Embeddings with Semantic Scales and Distributional Similarity

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Aspect, a linguistic category describing how actions and events unfold over time, is traditionally characterized by three semantic properties: stativity, durativity and telicity. In this study, we investigate whether and to what extent these properties are encoded in the verb token embeddings of the contextualized spaces of two English language models – BERT and GPT-2. First, we propose an experiment using semantic projections to examine whether the values of the vector dimensions of annotated verbs for stativity, durativity and telicity reflect human linguistic distinctions. Second, we use distributional similarity to replicate the notorious Imperfective Paradox described by Dowty (1977), and assess whether the embedding models are sensitive to capture contextual nuances of the verb telicity. Our results show that both models encode the semantic distinctions for the aspect properties of stativity and telicity in most of their layers, while durativity is the most challenging feature. As for the Imperfective Paradox, only the embedding similarities computed with the vectors from the early layers of the BERT model align with the expected pattern.

Original languageEnglish
Title of host publicationProceedings of The 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024)
EditorsDanushka Bollegala, Vered Shwartz
PublisherAssociation for Computational Linguistics (ACL)
Pages80-92
Number of pages13
ISBN (Electronic)9798891761063
DOIs
Publication statusPublished - Jun 2024
Event13th Joint Conference on Lexical and Computational Semantics, StarSEM 2024 - Mexico City, Mexico
Duration: 20 Jun 202421 Jun 2024

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference13th Joint Conference on Lexical and Computational Semantics, StarSEM 2024
Country/TerritoryMexico
CityMexico City
Period20/06/2421/06/24

ASJC Scopus subject areas

  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'Investigating Aspect Features in Contextualized Embeddings with Semantic Scales and Distributional Similarity'. Together they form a unique fingerprint.

Cite this