This is a corpus of about 500 computer vision datasets, from which the authors sampled 114 dataset publications across different vision tasks and coded for themes through both structured and qualitative content analysis. This work most closely pairs with the following research question: How do dataset developers in CV and NLP research, describe and motivate the decisions that go into their creation?
Paper | Code | Results | Date | Stars |
---|