An End-to-End Transformer Model for 3D Object Detection

facebookresearch/3detr 16 Sep 2021

We propose 3DETR, an end-to-end Transformer based object detection model for 3D point clouds.

3D Object Detection

164
1.90 stars / hour

Evaluating Large Language Models Trained on Code

microsoft/PythonProgrammingPuzzles 7 Jul 2021

We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities.

Code Generation Language Modelling

501
1.15 stars / hour

Physics-based Deep Learning

thunil/Physics-Based-Deep-Learning 11 Sep 2021

This digital book contains a practical and comprehensive introduction of everything related to deep learning in the context of physical simulations.

Physical Simulations

912
1.09 stars / hour

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

xinntao/Real-ESRGAN 22 Jul 2021

Though many attempts have been made in blind super-resolution to restore low-resolution images with unknown and complex degradations, they are still far from addressing general real-world degraded images.

Video Super-Resolution

2,697
0.92 stars / hour

Robust High-Resolution Video Matting with Temporal Guidance

PeterL1n/RobustVideoMatting 25 Aug 2021

We introduce a robust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance.

Video Matting

1,046
0.91 stars / hour

Image Shape Manipulation from a Single Augmented Training Sample

eliahuhorwitz/DeepSIM 13 Sep 2021

In this paper, we present DeepSIM, a generative model for conditional image manipulation based on a single image.

Image Manipulation

228
0.65 stars / hour

RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

princeton-vl/raft-stereo 15 Sep 2021

We introduce RAFT-Stereo, a new deep architecture for rectified stereo based on the optical flow network RAFT.

Optical Flow Estimation Stereo Matching

50
0.46 stars / hour

Suppressing Uncertainties for Large-Scale Facial Expression Recognition

RainbowRui/Landmark-Driven-Facial-Expression-Recognition CVPR 2020

Annotating a qualitative large-scale facial expression dataset is extremely difficult due to the uncertainties caused by ambiguous facial expressions, low-quality facial images, and the subjectiveness of annotators.

Facial Expression Recognition

57
0.46 stars / hour

Recurrent Multi-view Alignment Network for Unsupervised Surface Registration

WanquanF/RMA-Net CVPR 2021

Learning non-rigid registration in an end-to-end manner is challenging due to the inherent high degrees of freedom and the lack of labeled training data.

Deformable Object Manipulation Neural Rendering +1

69
0.40 stars / hour

PnP-DETR: Towards Efficient Visual Analysis with Transformers

twangnh/pnp-detr 15 Sep 2021

Recently, DETR pioneered the solution of vision tasks with transformers, it directly translates the image feature map into the object detection result.

Object Detection Panoptic Segmentation

40
0.39 stars / hour