Evaluating the Values of Sources in Transfer Learning

NAACL 2021  ·  Md Rizwan Parvez, Kai-Wei Chang ·

Transfer learning that adapts a model trained on data-rich sources to low-resource targets has been widely applied in natural language processing (NLP). However, when training a transfer model over multiple sources, not every source is equally useful for the target. To better transfer a model, it is essential to understand the values of the sources. In this paper, we develop SEAL-Shap, an efficient source valuation framework for quantifying the usefulness of the sources (e.g., domains/languages) in transfer learning based on the Shapley value method. Experiments and comprehensive analyses on both cross-domain and cross-lingual transfers demonstrate that our framework is not only effective in choosing useful transfer sources but also the source values match the intuitive source-target similarity.

PDF Abstract NAACL 2021 PDF NAACL 2021 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Cross-Lingual POS Tagging Universal Dependency Treebank SEAL-Shap Average POS Tagging Accuracy 83.66 # 1
Accuracy-(EN) 89.47 # 1

Methods


No methods listed for this paper. Add relevant methods here