LAION-COCO is the world’s largest dataset of 600M generated high-quality captions for publicly available web-images. The images are extracted from the english subset of Laion-5B with an ensemble of BLIP L/14 and 2 CLIP versions (L/14 and RN50x64). This dataset allow models to produce high quality captions for images.
Source: LAION COCO: 600M SYNTHETIC CAPTIONS FROM LAION2B-ENPaper | Code | Results | Date | Stars |
---|