UL2: Unifying Language Learning Paradigms

UL2: Unifying Language Learning Paradigms
1 code implementation10 May 2022

Our model also achieve strong results at in-context learning, outperforming 175B GPT-3 on zero-shot SuperGLUE and tripling the performance of T5-XXL on one-shot summarization.

Information Retrieval Long-range modeling +4

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
3 code implementations ICLR 2022

Despite the recent success of multi-task learning and transfer learning for natural language processing (NLP), few works have systematically studied the effect of scaling up the number of tasks during pre-training.

Denoising Multi-Task Learning

