The ShareGPT4V dataset is a pioneering large-scale resource that features 1.2 million highly descriptive captions. These captions surpass existing datasets in terms of diversity and information content. They cover a wide range of topics, including world knowledge, object properties, spatial relationships, and aesthetic evaluations². The dataset was introduced to address bottlenecks in large multi-modal models and enhance their performance by providing better captions².

Source: Conversation with Bing, 3/19/2024 (1) ShareGPT4V: Improving Large Multi-Modal Models with Better Captions. https://arxiv.org/pdf/2311.12793.pdf. (2) openchat/openchat_sharegpt4_dataset · Datasets at Hugging Face. https://huggingface.co/datasets/openchat/openchat_sharegpt4_dataset. (3) openchat/openchat_sharegpt_v3 · Datasets at Hugging Face. https://huggingface.co/datasets/openchat/openchat_sharegpt_v3. (4) RyokoAI/ShareGPT52K · Datasets at Hugging Face. https://huggingface.co/datasets/RyokoAI/ShareGPT52K.

Papers


Paper Code Results Date Stars

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages