Abstract
In this paper, we describe the system we presented at the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022) regarding the shared task on Lexical Simplification for English, Portuguese, and Spanish. We proposed an unsupervised approach in two steps: First, we used a masked language model with word masking for each language to extract possible candidates for the replacement of a difficult word; second, we ranked the candidates according to three different Transformer-based metrics. Finally, we determined our list of candidates based on the lowest average rank across different metrics.
Original language | English |
---|---|
Title of host publication | Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022) |
Editors | Sanja Štajner, Horacio Saggion, Daniel Ferrés, Matthew Shardlow, Kim Cheng Sheang, Kai North, Marcos Zampieri, Wei Xu |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 225–230 |
ISBN (Print) | 978-1-959429-25-8 |
Publication status | Published - Dec 2022 |
Event | EMNLP Workshop on Workshop on Text Simplification, Accessibility and Readability - Abu Dhabi National Exhibition Center, Abu Dhabi, United Arab Emirates Duration: 8 Dec 2022 → … |
Conference
Conference | EMNLP Workshop on Workshop on Text Simplification, Accessibility and Readability |
---|---|
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 8/12/22 → … |