An implementation of model & data parallel GPT3-like models using the mesh-tensorflow library.
Source: EleutherAI/GPT-Neo
Paper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Language Modelling | 7 | 18.92% |
Code Generation | 5 | 13.51% |
Text Generation | 5 | 13.51% |
Prompt Engineering | 3 | 8.11% |
Large Language Model | 2 | 5.41% |
Question Answering | 2 | 5.41% |
Quantization | 1 | 2.70% |
Program Synthesis | 1 | 2.70% |
Text-to-Code Generation | 1 | 2.70% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |