Instant Neural Graphics Primitives with a Multiresolution Hash Encoding

nvlabs/instant-ngp 16 Jan 2022

Neural graphics primitives, parameterized by fully connected neural networks, can be costly to train and evaluate.

2,006
1.95 stars / hour

A ConvNet for the 2020s

facebookresearch/ConvNeXt 10 Jan 2022

The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification model.

 Ranked #1 on Domain Generalization on ImageNet-Sketch (using extra training data)

Domain Generalization Image Classification +2

2,530
1.79 stars / hour

Masked Autoencoders Are Scalable Vision Learners

facebookresearch/mae 11 Nov 2021

Our MAE approach is simple: we mask random patches of the input image and reconstruct the missing pixels.

Domain Generalization Object Detection +3

2,228
1.42 stars / hour

Extracting Triangular 3D Models, Materials, and Lighting From Images

nvlabs/tiny-cuda-nn 24 Nov 2021

We present an efficient method for joint optimization of topology, materials and lighting from multi-view image observations.

674
1.20 stars / hour

Detecting Twenty-thousand Classes using Image-level Supervision

facebookresearch/Detic 7 Jan 2022

For the first time, we train a detector with all the twenty-one-thousand classes of the ImageNet dataset and show that it generalizes to new datasets without fine-tuning.

Image Classification

703
0.78 stars / hour

The effect of information controls on developers in China: An analysis of censorship in Chinese open source projects

citizenlab/chat-censorship COLING 2018

Censorship of Internet content in China is understood to operate through a system of intermediary liability whereby service providers are liable for the content on their platforms.

464
0.70 stars / hour

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

hkunlp/unifiedskg 16 Jan 2022

Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases.

Few-Shot Learning Question Answering +2

80
0.68 stars / hour

Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI

BoltzmannEntropy/interviews.ai 30 Dec 2021

The second edition of Deep Learning Interviews is home to hundreds of fully-solved problems, from a wide range of key topics in AI.

3,126
0.62 stars / hour

Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training

hpcaitech/colossalai 28 Oct 2021

The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing.

592
0.56 stars / hour

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer

xmu-xiaoma666/External-Attention-pytorch 5 Oct 2021

In this paper, we ask the following question: is it possible to combine the strengths of CNNs and ViTs to build a light-weight and low latency network for mobile vision tasks?

Image Classification Object Detection

3,703
0.50 stars / hour