Then, an additional penalty term, which is in proportion to the ratio of instance FPR overall FPR, is introduced into the denominator of the softmax-based loss.
We present the full-resolution correspondence learning for cross-domain images, which aids image translation.
To maintain competitive performance with such a light-weight network, we present novel training schemes: Segments of Line segment (SoL) augmentation and geometric learning scheme.
Ranked #4 on Line Segment Detection on wireframe dataset
Learning high-quality sentence representations benefits a wide range of natural language processing tasks.
We introduce 3DB: an extendable, unified framework for testing and debugging vision models using photorealistic simulation.
We introduce AndroidEnv, an open-source platform for Reinforcement Learning (RL) research built on top of the Android ecosystem.
Neural networks have shown great abilities in estimating depth from a single image.
Ranked #1 on Monocular Depth Estimation on Middlebury 2014
In this paper, we propose a new attention mechanism in Transformer termed Cross Attention, which alternates attention inner the image patch instead of the whole image to capture local information and apply attention between image patches which are divided from single-channel feature maps capture global information.