Pre-trained representations are becoming crucial for many NLP and perception tasks. While representation learning in NLP has transitioned to training on raw text without human annotations, visual and vision-language representations still rely heavily on curated training datasets that are expensive or require expert knowledge... (read more)
PDF Abstract
Ranked #1 on
Image Classification
on VTAB-1k
(using extra training data)
METHOD | TYPE | |
---|---|---|
๐ค No Methods Found | Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet |