SHERF: Generalizable Human NeRF from a Single Image

skhu101/sherf 22 Mar 2023

To this end, we propose a bank of 3D-aware hierarchical features, including global, point-level, and pixel-aligned features, to facilitate informative encoding.

3D Human Reconstruction

69
1.24 stars / hour

NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping

junyuandeng/nerf-loam 19 Mar 2023

To bridge this gap, in this paper, we propose a novel NeRF-based LiDAR odometry and mapping approach, NeRF-LOAM, consisting of three modules neural odometry, neural mapping, and mesh reconstruction.

129
1.19 stars / hour

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

winfredy/sadtalker 22 Nov 2022

We present SadTalker, which generates 3D motion coefficients (head pose, expression) of the 3DMM from audio and implicitly modulates a novel 3D-aware face render for talking head generation.

Talking Head Generation

584
1.18 stars / hour

LoRA: Low-Rank Adaptation of Large Language Models

microsoft/LoRA ICLR 2022

We propose Low-Rank Adaptation, or LoRA, which freezes the pre-trained model weights and injects trainable rank decomposition matrices into each layer of the Transformer architecture, greatly reducing the number of trainable parameters for downstream tasks.

Language Modelling

1,303
1.17 stars / hour

Learning Context-aware Classifier for Semantic Segmentation

Pointcept/Pointcept 21 Mar 2023

Semantic segmentation is still a challenging task for parsing diverse contexts in different scenes, thus the fixed classifier might not be able to well address varying feature distributions during testing.

Semantic Segmentation

127
1.07 stars / hour

SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot

ist-daslab/sparsegpt 2 Jan 2023

We show for the first time that large-scale generative pretrained transformer (GPT) family models can be pruned to at least 50% sparsity in one-shot, without any retraining, at minimal loss of accuracy.

 Ranked #1 on Language Modelling on WikiText-2 (using extra training data)

Common Sense Reasoning Language Modelling +2

52
1.00 stars / hour

Neural Preset for Color Style Transfer

ZHKKKe/NeuralPreset 23 Mar 2023

In this paper, we present a Neural Preset technique to address the limitations of existing color style transfer methods, including visual artifacts, vast memory requirement, and slow style switching speed.

Image Dehazing Image Harmonization +2

27
0.96 stars / hour

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

chenyangqiqi/fatezero 16 Mar 2023

We also have a better zero-shot shape-aware editing ability based on the text-to-video model.

Video Editing

388
0.90 stars / hour

Planning-oriented Autonomous Driving

opendrivelab/uniad 20 Dec 2022

Oriented at this, we revisit the key components within perception and prediction, and prioritize the tasks such that all these tasks contribute to planning.

Autonomous Driving Philosophy

311
0.88 stars / hour

Ablating Concepts in Text-to-Image Diffusion Models

nupurkmr9/concept-ablation 23 Mar 2023

To achieve this goal, we propose an efficient method of ablating concepts in the pretrained model, i. e., preventing the generation of a target concept.

23
0.85 stars / hour