A ConvNet for the 2020s

facebookresearch/ConvNeXt 10 Jan 2022

The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification model.

 Ranked #1 on Domain Generalization on ImageNet-Sketch (using extra training data)

Domain Generalization Image Classification +2

2,286
2.74 stars / hour

Masked Autoencoders Are Scalable Vision Learners

facebookresearch/mae 11 Nov 2021

Our MAE approach is simple: we mask random patches of the input image and reconstruct the missing pixels.

Domain Generalization Object Detection +3

2,119
1.28 stars / hour

Extracting Triangular 3D Models, Materials, and Lighting From Images

nvlabs/tiny-cuda-nn 24 Nov 2021

We present an efficient method for joint optimization of topology, materials and lighting from multi-view image observations.

582
1.10 stars / hour

Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI

BoltzmannEntropy/interviews.ai 30 Dec 2021

The second edition of Deep Learning Interviews is home to hundreds of fully-solved problems, from a wide range of key topics in AI.

3,063
1.00 stars / hour

Detecting Twenty-thousand Classes using Image-level Supervision

facebookresearch/Detic 7 Jan 2022

For the first time, we train a detector with all the twenty-one-thousand classes of the ImageNet dataset and show that it generalizes to new datasets without fine-tuning.

Image Classification

651
0.93 stars / hour

A 1D CNN for high accuracy classification and transfer learning in motor imagery EEG-based brain-computer interface

Kubasinska/MI-EEG-1D-CNN Journal of Neural Engineering 2022

In addition, we present a transfer learning method used to extract critical features from the EEG group dataset and then to customize the model to the single individual by training its late layers with only 12-min individual-related data.

Data Augmentation EEG +1

20
0.57 stars / hour

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer

xmu-xiaoma666/External-Attention-pytorch 5 Oct 2021

In this paper, we ask the following question: is it possible to combine the strengths of CNNs and ViTs to build a light-weight and low latency network for mobile vision tasks?

Image Classification Object Detection

3,628
0.55 stars / hour

CoAtNet: Marrying Convolution and Attention for All Data Sizes

xmu-xiaoma666/External-Attention-pytorch NeurIPS 2021

Transformers have attracted increasing interests in computer vision, but they still fall behind state-of-the-art convolutional networks.

Image Classification

3,641
0.55 stars / hour

Layered Neural Atlases for Consistent Video Editing

ykasten/layered-neural-atlases 23 Sep 2021

We present a method that decomposes, or "unwraps", an input video into a set of layered 2D atlases, each providing a unified representation of the appearance of an object (or background) over the video.

Style Transfer Video Editing +2

214
0.47 stars / hour

Feature Selection Methods for Uplift Modeling

uber/causalml 5 May 2020

To address this problem, we introduce a set of feature selection methods designed specifically for uplift modeling, including both filter methods and embedded methods.

Feature Selection

2,690
0.41 stars / hour