PolyU-CBS at TSAR-2022: A Simple, Rank-Based Method for Complex Word Substitution in Two Steps

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

In this paper, we describe the system we presented at the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022) regarding the shared task on Lexical Simplification for English, Portuguese, and Spanish. We proposed an unsupervised approach in two steps: First, we used a masked language model with word masking for each language to extract possible candidates for the replacement of a difficult word; second, we ranked the candidates according to three different Transformer-based metrics. Finally, we determined our list of candidates based on the lowest average rank across different metrics.
Original languageEnglish
Title of host publication Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022)
EditorsSanja Štajner, Horacio Saggion, Daniel Ferrés, Matthew Shardlow, Kim Cheng Sheang, Kai North, Marcos Zampieri, Wei Xu
PublisherAssociation for Computational Linguistics (ACL)
Pages225–230
ISBN (Print)978-1-959429-25-8
Publication statusPublished - Dec 2022
EventEMNLP Workshop on Workshop on Text Simplification, Accessibility and Readability - Abu Dhabi National Exhibition Center, Abu Dhabi, United Arab Emirates
Duration: 8 Dec 2022 → …

Conference

ConferenceEMNLP Workshop on Workshop on Text Simplification, Accessibility and Readability
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period8/12/22 → …

Fingerprint

Dive into the research topics of 'PolyU-CBS at TSAR-2022: A Simple, Rank-Based Method for Complex Word Substitution in Two Steps'. Together they form a unique fingerprint.

Cite this