Multiview Compressive Coding for 3D Reconstruction

facebookresearch/mcc 19 Jan 2023

We introduce a simple framework that operates on 3D points of single objects or whole scenes coupled with category-agnostic large-scale training from diverse RGB-D videos.

3D Reconstruction Self-Supervised Learning +1

181
0.28 stars / hour

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

BlinkDL/RWKV-LM 18 Nov 2022

We propose SmoothQuant, a training-free, accuracy-preserving, and general-purpose post-training quantization (PTQ) solution to enable 8-bit weight, 8-bit activation (W8A8) quantization for LLMs that can be implemented efficiently.

Quantization

1,106
0.27 stars / hour

GLM-130B: An Open Bilingual Pre-trained Model

thudm/glm-130b 5 Oct 2022

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters.

Language Modelling Multi-task Language Understanding +1

1,688
0.27 stars / hour

AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities

automatic1111/stable-diffusion-webui 12 Nov 2022

In this work, we present a conceptually simple and effective method to train a strong bilingual/multilingual multimodal representation model.

Contrastive Learning Cross-Modal Retrieval +11

29,774
0.25 stars / hour

Triton: An Intermediate Language and Compiler for Tiled Neural Network Computations

openai/triton MAPL 2019

The validation and deployment of novel research ideas in the field of Deep Learning is often limited by the availability of efficient compute kernels for certain basic primitives.

5,038
0.24 stars / hour

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers

microsoft/lmops 20 Dec 2022

In order to better understand how ICL works, this paper explains language models as meta-optimizers and understands ICL as a kind of implicit finetuning.

Pretrained Language Models

507
0.24 stars / hour

DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching

XT-1997/DeepMatcher 8 Jan 2023

In this work, we propose DeepMatcher, a deep Transformer-based network built upon our investigation of local feature matching in detector-free methods.

142
0.23 stars / hour

BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

microsoft/biogpt 19 Oct 2022

Pre-trained language models have attracted increasing attention in the biomedical domain, inspired by their great success in the general natural language domain.

Document Classification Language Modelling +3

132
0.22 stars / hour

A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction

hustvl/saunet 24 Jan 2023

We present a simple, efficient, and scalable unfolding network, SAUNet, to simplify the network design with an adaptive alternate optimization framework for hyperspectral image (HSI) reconstruction.

Image Reconstruction

16
0.22 stars / hour

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

showlab/Tune-A-Video 22 Dec 2022

To reproduce the success of text-to-image (T2I) generation, recent works in text-to-video (T2V) generation employ large-scale text-video dataset for fine-tuning.

Style Transfer Text-to-Video Generation +1

147
0.22 stars / hour