no code implementations • 7 Jan 2025 • Alexis Matzopoulos, Charl Hendriks, Hishaam Mahomed, Francois Meyer
The BabyLM challenge called on participants to develop sample-efficient language models.
no code implementations • 29 Mar 2024 • Francois Meyer, Jan Buys
Multilingual modelling can improve machine translation for low-resource languages, partly through shared subword representations.
2 code implementations • 12 Mar 2024 • Francois Meyer, Jan Buys
In this paper we tackle data-to-text for isiXhosa, which is low-resource and agglutinative.
1 code implementation • 11 May 2023 • Francois Meyer, Jan Buys
We propose a departure from this paradigm, called subword segmental machine translation (SSMT).
no code implementations • 21 Oct 2022 • Khalid N. Elmadani, Francois Meyer, Jan Buys
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 Shared Task: Large-Scale Machine Translation Evaluation for African Languages.
1 code implementation • 12 Oct 2022 • Francois Meyer, Jan Buys
We also train our model as a word-level sequence model, resulting in an unsupervised morphological segmenter that outperforms existing methods by a large margin for all 4 languages.
1 code implementation • NAACL 2021 • Yvette Oortwijn, Jelke Bloem, Pia Sommerauer, Francois Meyer, Wei Zhou, Antske Fokkens
We investigate the possibilities and limitations of using distributional semantic models for analyzing philosophical data by means of a realistic use-case.
no code implementations • CONLL 2020 • Francois Meyer, Martha Lewis
Words can have multiple senses.
no code implementations • NeurIPS 2007 • Francois Meyer, Greg Stephens
Here, we use data from the Experience Based Cognition competition to compare global and local methods of prediction applying both linear and nonlinear techniques of dimensionality reduction.