Search Results for author: Simin Fan

Found 4 papers, 2 papers with code

DoGE: Domain Reweighting with Generalization Estimation

no code implementations23 Oct 2023 Simin Fan, Matteo Pagliardini, Martin Jaggi

Moreover, aiming to generalize to out-of-domain target tasks, which is unseen in the pretraining corpus (OOD domain), DoGE can effectively identify inter-domain dependencies, and consistently achieves better test perplexity on the target domain.

Domain Generalization Language Modelling

Irreducible Curriculum for Language Model Pretraining

no code implementations23 Oct 2023 Simin Fan, Martin Jaggi

Automatic data selection and curriculum design for training large language models is challenging, with only a few existing methods showing improvements over standard training.

Language Modelling

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs

1 code implementation NAACL 2022 Xu Wang, Simin Fan, Jessica Houghton, Lu Wang

NLP-powered automatic question generation (QG) techniques carry great pedagogical potential of saving educators' time and benefiting student learning.

Misconceptions Question Generation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.