1 code implementation • EMNLP 2021 • Taelin Karidi, Yichu Zhou, Nathan Schneider, Omri Abend, Vivek Srikumar
We present a method for exploring regions around individual points in a contextualized vector space (particularly, BERT space), as a way to investigate how these regions correspond to word senses.
no code implementations • 10 Apr 2024 • Li Zhou, Taelin Karidi, Nicolas Garneau, Yong Cao, Wanlong Liu, Wenyu Chen, Daniel Hershcovich
Recent studies have highlighted the presence of cultural biases in Large Language Models (LLMs), yet often lack a robust methodology to dissect these phenomena comprehensively.
1 code implementation • 20 Oct 2023 • Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend
Despite the impressive growth of the abilities of multilingual language models, such as XLM-R and mT5, it has been shown that they still face difficulties when tackling typologically-distant languages, particularly in the low-resource setting.
no code implementations • 24 May 2023 • Taelin Karidi, Leshem Choshen, Gal Patel, Omri Abend
For example, nouns and verbs are among the most frequent POS tags.
2 code implementations • 16 Mar 2023 • Alexander Yom Din, Taelin Karidi, Leshem Choshen, Mor Geva
Moreover, in the context of language modeling, our method allows "peeking" into early layer representations of GPT-2 and BERT, showing that often LMs already predict the final output in early layers.
1 code implementation • EMNLP 2021 • Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend
We explore the link between the extent to which syntactic relations are preserved in translation and the ease of correctly constructing a parse tree in a zero-shot setting.
1 code implementation • 23 Sep 2021 • Taelin Karidi, Yichu Zhou, Nathan Schneider, Omri Abend, Vivek Srikumar
We present a method for exploring regions around individual points in a contextualized vector space (particularly, BERT space), as a way to investigate how these regions correspond to word senses.
1 code implementation • ACL 2020 • Dmitry Nikolaev, Ofir Arviv, Taelin Karidi, Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend
The patterns in which the syntax of different languages converges and diverges are often used to inform work on cross-lingual transfer.
no code implementations • 12 Mar 2019 • Alexander Port, Taelin Karidi, Matilde Marcolli
We use the persistent homology method of topological data analysis and dimensional analysis techniques to study data of syntactic structures of world languages.