Search Results for author: Aviad Rom

Found 2 papers, 0 papers with code

Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space

no code implementations25 Feb 2024 Aviad Rom, Kfir Bar

We train a bilingual Arabic-Hebrew language model using a transliterated version of Arabic texts in Hebrew, to ensure both languages are represented in the same script.

Language Modelling Translation +1

Supporting Undotted Arabic with Pre-trained Language Models

no code implementations ICNLSP 2021 Aviad Rom, Kfir Bar

We observe a recent behaviour on social media, in which users intentionally remove consonantal dots from Arabic letters, in order to bypass content-classification algorithms.

Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.