Abstract
In this paper, we describe the system we presented at the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022) regarding the shared task on Lexical Simplification for English, Portuguese, and Spanish. We proposed an unsupervised approach in two steps: First, we used a masked language model with word masking for each language to extract possible candidates for the replacement of a difficult word; second, we ranked the candidates according to three different Transformer-based metrics. Finally, we determined our list of candidates based on the lowest average rank across different metrics.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022) |
| Editors | Sanja Štajner, Horacio Saggion, Daniel Ferrés, Matthew Shardlow, Kim Cheng Sheang, Kai North, Marcos Zampieri, Wei Xu |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 225–230 |
| ISBN (Print) | 978-1-959429-25-8 |
| Publication status | Published - Dec 2022 |
| Event | EMNLP Workshop on Workshop on Text Simplification, Accessibility and Readability - Abu Dhabi National Exhibition Center, Abu Dhabi, United Arab Emirates Duration: 8 Dec 2022 → … |
Conference
| Conference | EMNLP Workshop on Workshop on Text Simplification, Accessibility and Readability |
|---|---|
| Country/Territory | United Arab Emirates |
| City | Abu Dhabi |
| Period | 8/12/22 → … |