Transfer Learning

2824 papers with code • 7 benchmarks • 14 datasets

Transfer Learning is a machine learning technique where a model trained on one task is re-purposed and fine-tuned for a related, but different task. The idea behind transfer learning is to leverage the knowledge learned from a pre-trained model to solve a new, but related problem. This can be useful in situations where there is limited data available to train a new model from scratch, or when the new task is similar enough to the original task that the pre-trained model can be adapted to the new problem with only minor modifications.

( Image credit: Subodh Malgonde )

Benchmarks

Add a Result

These leaderboards are used to track progress in Transfer Learning

Dataset	Best Model	Compare
Office-Home	Ours TL-VGG16	See all
Amazon Review Polarity	Random	See all
BanglaLekha Isolated Dataset	Chatterjee, Dutta et al.[1] Transfer Learning on ResNet-50 91.13 % (50 Char) + 98.42% (Numbers)	See all
COCO70	Co-Tuning	See all
100 sleep nights of 8 caregivers	CNN	See all
KITTI Object Tracking Evaluation 2012	Physical Access	See all
Retinal Fundus MultiDisease Image Dataset (RFMiD)	riadd.aucmedi	See all

Libraries

Use these libraries to find Transfer Learning models and implementations

rwightman/pytorch-image-models

8 papers

29,774

thuml/Transfer-Learning-Library

8 papers

3,146

yoshitomo-matsubara/torchdistill

8 papers

1,272

huggingface/transformers

7 papers

125,059

See all 9 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

tensorflow/tpu • • ICML 2019

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available.

133

Paper
Code

Universal Language Model Fine-tuning for Text Classification

fastai/fastai • • ACL 2018

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch.

Paper
Code

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

UKPLab/sentence-transformers • • IJCNLP 2019

However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10, 000 sentences requires about 50 million inference computations (~65 hours) with BERT.

Paper
Code

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

huggingface/transformers • • arXiv 2019

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP).

Paper
Code

High Quality Monocular Depth Estimation via Transfer Learning

ialhashim/DenseDepth • • 31 Dec 2018

Accurate depth estimation from images is a fundamental task in many applications including scene understanding and reconstruction.

Paper
Code

ResNeSt: Split-Attention Networks

zhanghang1989/ResNeSt • • 19 Apr 2020

It is well known that featuremap attention and multi-path representation are important for visual recognition.

Paper
Code

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

huggingface/transformers • • NeurIPS 2019

As Transfer Learning from large-scale pre-trained models becomes more prevalent in Natural Language Processing (NLP), operating these large models in on-the-edge and/or under constrained computational training or inference budgets remains challenging.

Paper
Code

Bag of Tricks for Image Classification with Convolutional Neural Networks

dmlc/gluon-cv • • CVPR 2019

Much of the recent progress made in image classification research can be credited to training procedure refinements, such as changes in data augmentations and optimization methods.

Paper
Code

Universal Sentence Encoder

facebookresearch/InferSent • • 29 Mar 2018

For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance.

Paper
Code

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

facebookresearch/InferSent • • EMNLP 2017

Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features.

Paper
Code

Transfer Learning

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result