In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting.
#87 best model for Image Classification on ImageNet
We investigate omni-supervised learning, a special regime of semi-supervised learning in which the learner exploits all available labeled data plus internet-scale sources of unlabeled data.
In contrast to previous region-based detectors such as Fast/Faster R-CNN that apply a costly per-region subnetwork hundreds of times, our region-based detector is fully convolutional with almost all computation shared on the entire image.
#5 best model for Real-Time Object Detection on PASCAL VOC 2007
Our hypothesis is that the appearance of a person -- their pose, clothing, action -- is a powerful cue for localizing the objects they are interacting with.
#4 best model for Human-Object Interaction Detection on HICO-DET