Search Results for author: Fabian David Schmidt

Found 5 papers, 3 papers with code

Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget

no code implementations • 30 Apr 2024 • Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, Katharina von der Wense

We further find that KD yields larger gains over pretraining from scratch when the data must be repeated under the fixed computation budget.

Paper
Add Code

One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer

1 code implementation • 16 Oct 2023 • Fabian David Schmidt, Ivan Vulić, Goran Glavaš

Because of this, model selection based on source-language validation is unreliable: it picks model snapshots with suboptimal target-language performance.

Model Selection NER +3

Paper
Code

Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging

1 code implementation • 26 May 2023 • Fabian David Schmidt, Ivan Vulić, Goran Glavaš

The results indicate that averaging model checkpoints yields systematic and consistent performance gains across diverse target languages in all tasks.

Cross-Lingual Transfer Model Selection +4

Paper
Code

SLICER: Sliced Fine-Tuning for Low-Resource Cross-Lingual Transfer for Named Entity Recognition

1 code implementation • Proceedings of the Conference on Empirical Methods in Natural Language Processing 2022 • Fabian David Schmidt, Ivan Vulić, Goran Glavaš

Large multilingual language models generally demonstrate impressive results in zero-shot cross-lingual transfer, yet often fail to successfully transfer to low-resource languages, even for token-level prediction tasks like named entity recognition (NER).

Multilingual text classification named-entity-recognition +3

Paper
Code

SEAGLE: A Platform for Comparative Evaluation of Semantic Encoders for Information Retrieval

no code implementations • IJCNLP 2019 • Fabian David Schmidt, Markus Dietsche, Simone Paolo Ponzetto, Goran Glava{\v{s}}

We introduce Seagle, a platform for comparative evaluation of semantic text encoding models on information retrieval (IR) tasks.

Information Retrieval Retrieval +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.