1 code implementation • 27 Jul 2022 • Miguel Angel Bautista, Pengsheng Guo, Samira Abnar, Walter Talbott, Alexander Toshev, Zhuoyuan Chen, Laurent Dinh, Shuangfei Zhai, Hanlin Goh, Daniel Ulbricht, Afshin Dehghan, Josh Susskind
We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera.
Ranked #1 on Image Generation on ARKitScenes
It is not only the first RGB-D dataset that is captured with a now widely available depth sensor, but to our best knowledge, it also is the largest indoor scene understanding data released.
This work details Sighthounds fully automated license plate detection and recognition system.
This paper describes the details of Sighthound's fully automated age, gender and emotion recognition system.
The backbone of our system is a deep convolutional neural network that is not only computationally inexpensive, but also provides state-of-the-art results on several competitive benchmarks.
In this paper, we propose a tracker that addresses the aforementioned problems and is capable of tracking hundreds of people efficiently.
Data association is the backbone to many multiple object tracking (MOT) methods.
In this paper we show that multiple object tracking (MOT) can be formulated in a framework, where the detection and data-association are performed simultaneously.
A video captures a sequence and interactions of concepts that can be static, for instance, objects or scenes, or dynamic, such as actions.
Recent years have seen a major push for face recognition technology due to the large expansion of image sharing on social networks.
In general, our method takes detection bounding boxes of a generic detector as input and generates the detection output with higher average precision and precise object regions.