Abstract
Collecting modality exclusivity norms for lexical items has recently become a common practice in psycholinguistics and cognitive research. However, these norms are available only for a relatively small number of languages and often involve a costly and time-consuming collection of ratings. In this work, we aim at learning a mapping between word embeddings and modality norms. Our experiments focused on crosslingual word embeddings, in order to predict modality association scores by training on a high-resource language and testing on a low-resource one. We ran two experiments, one in a monolingual and the other one in a crosslingual setting. Results show that modality prediction using off-the-shelf crosslingual embeddings indeed has moderate-to-high correlations with human ratings even when regression algorithms are trained on an English resource and tested on a completely unseen language.
Original language | English |
---|---|
Title of host publication | Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics (*SEM) |
Editors | Iryna Gurevych, Marianna Apidianaki, Manaal Faruqui |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 32–38 |
ISBN (Electronic) | 978-1-952148-32-3 |
Publication status | Published - Dec 2020 |
Event | Ninth Joint Conference on Lexical and Computational Semantics - Online Duration: 12 Dec 2020 → 13 Dec 2020 https://sites.google.com/view/starsem2020/ |
Conference
Conference | Ninth Joint Conference on Lexical and Computational Semantics |
---|---|
Abbreviated title | *SEM 2020 |
Period | 12/12/20 → 13/12/20 |
Internet address |