Search Results for author: Fabian David Schmidt

Found 5 papers, 3 papers with code

Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget

no code implementations30 Apr 2024 Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, Katharina von der Wense

We further find that KD yields larger gains over pretraining from scratch when the data must be repeated under the fixed computation budget.

One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer

1 code implementation16 Oct 2023 Fabian David Schmidt, Ivan Vulić, Goran Glavaš

Because of this, model selection based on source-language validation is unreliable: it picks model snapshots with suboptimal target-language performance.

Model Selection NER +3

Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging

1 code implementation26 May 2023 Fabian David Schmidt, Ivan Vulić, Goran Glavaš

The results indicate that averaging model checkpoints yields systematic and consistent performance gains across diverse target languages in all tasks.

Cross-Lingual Transfer Model Selection +4

SLICER: Sliced Fine-Tuning for Low-Resource Cross-Lingual Transfer for Named Entity Recognition

1 code implementation Proceedings of the Conference on Empirical Methods in Natural Language Processing 2022 Fabian David Schmidt, Ivan Vulić, Goran Glavaš

Large multilingual language models generally demonstrate impressive results in zero-shot cross-lingual transfer, yet often fail to successfully transfer to low-resource languages, even for token-level prediction tasks like named entity recognition (NER).

Multilingual text classification named-entity-recognition +3

Cannot find the paper you are looking for? You can Submit a new open access paper.