no code implementations • 3 Aug 2022 • Avi Shmidman, Joshua Guedalia, Shaltiel Shmidman, Cheyn Shmuel Shmidman, Eli Handel, Moshe Koppel
We present a new pre-trained language model (PLM) for Rabbinic Hebrew, termed Berel (BERT Embeddings for Rabbinic-Encoded Language).
no code implementations • Findings of the Association for Computational Linguistics 2020 • Avi Shmidman, Joshua Guedalia, Shaltiel Shmidman, Moshe Koppel, Reut Tsarfaty
One of the primary tasks of morphological parsers is the disambiguation of homographs.
no code implementations • ACL 2020 • Avi Shmidman, Shaltiel Shmidman, Moshe Koppel, Yoav Goldberg
We present a system for automatic diacritization of Hebrew text.
1 code implementation • 11 Sep 2018 • Yonatan Belinkov, Alexander Magidow, Alberto Barrón-Cedeño, Avi Shmidman, Maxim Romanov
Arabic is a widely-spoken language with a long and rich history, but existing corpora and language technology focus mostly on modern Arabic and its varieties.
no code implementations • WS 2016 • Yonatan Belinkov, Alexander Magidow, Maxim Romanov, Avi Shmidman, Moshe Koppel
Arabic is a widely-spoken language with a rich and long history spanning more than fourteen centuries.
no code implementations • 28 Feb 2016 • Avi Shmidman, Moshe Koppel, Ely Porat
We propose a method for efficiently finding all parallel passages in a large corpus, even if the passages are not quite identical due to rephrasing and orthographic variation.