Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning

7 Jan 2019Baoyuan Wu • Weidong Chen • Yanbo Fan • Yong Zhang • Jinlong Hou • Jie Liu • Junzhou Huang • Wei Liu • Tong Zhang

In existing visual representation learning tasks, deep convolutional neural networks (CNNs) are often trained on images annotated with single tags, such as ImageNet. In this work, we propose to train CNNs from images annotated with multiple tags, to enhance the quality of visual representation of the trained CNN model. We efficiently train the ResNet-101 model with multi-label outputs on Tencent ML-Images, taking 90 hours for 60 epochs, based on a large-scale distributed deep learning framework,i.e.,TFplus.

PDF Abstract

Evaluation


No evaluation results yet. Help compare this paper to other papers by submitting the tasks and evaluation metrics from the paper.