Trending Research

ResNeSt: Split-Attention Networks

19 Apr 2020zhanghang1989/ResNeSt

While image classification models have recently continued to advance, most downstream applications such as object detection and semantic segmentation still employ ResNet variants as the backbone network due to their simple and modular structure.

IMAGE CLASSIFICATION INSTANCE SEGMENTATION OBJECT DETECTION SEMANTIC SEGMENTATION

907
2.58 stars / hour

3D Photography using Context-aware Layered Depth Inpainting

9 Apr 2020vt-vl-lab/3d-photo-inpainting

We propose a method for converting a single RGB-D input image into a 3D photo - a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view.

NOVEL VIEW SYNTHESIS

2,072
1.14 stars / hour

EfficientDet: Scalable and Efficient Object Detection

20 Nov 2019zylo117/Yet-Another-EfficientDet-Pytorch

Model efficiency has become increasingly important in computer vision.

AUTOML OBJECT DETECTION

2,476
1.06 stars / hour

Learning to See in the Dark

CVPR 2018 cchen156/Learning-to-See-in-the-Dark

Imaging in low light is challenging due to low photon count and low SNR.

DEBLURRING DENOISING

4,459
0.76 stars / hour

MPNet: Masked and Permuted Pre-training for Language Understanding

20 Apr 2020microsoft/MPNet

Since BERT neglects dependency among predicted tokens, XLNet introduces permuted language modeling (PLM) for pre-training to address this problem.

LANGUAGE MODELLING

20
0.75 stars / hour

A Simple Baseline for Multi-Object Tracking

4 Apr 2020ifzhang/FairMOT

There has been remarkable progress on object detection and re-identification in recent years which are the core components for multi-object tracking.

 SOTA for Multi-Object Tracking on MOT16 (using extra training data)

MULTI-OBJECT TRACKING OBJECT DETECTION

897
0.77 stars / hour

Longformer: The Long-Document Transformer

10 Apr 2020allenai/longformer

To address this limitation, we introduce the Longformer with an attention mechanism that scales linearly with sequence length, making it easy to process documents of thousands of tokens or longer.

LANGUAGE MODELLING

340
0.73 stars / hour

Semantically Multi-modal Image Synthesis

28 Mar 2020Seanseattle/SMIS

Experiments on several challenging datasets demonstrate the superiority of GroupDNet on performing the SMIS task.

IMAGE GENERATION

76
0.61 stars / hour

Few-Shot NLG with Pre-Trained Language Model

21 Apr 2019czyssrs/Few-Shot-NLG

Neural-based end-to-end approaches to natural language generation (NLG) from structured data or knowledge are data-hungry, making their adoption for real-world applications difficult with limited data.

FEW-SHOT LEARNING LANGUAGE MODELLING TEXT GENERATION

70
0.54 stars / hour