LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

timdettmers/bitsandbytes 15 Aug 2022

We develop a procedure for Int8 matrix multiplication for feed-forward and attention projection layers in transformers, which cut the memory needed for inference by half while retaining full precision performance.

Language Modelling Linguistic Acceptability +4

1,237
0.38 stars / hour

HFT: Lifting Perspective Representations via Hybrid Feature Transformation

jiayuzou2020/hft 11 Apr 2022

In order to reap the benefits and avoid the drawbacks of CBFT and CFFT, we propose a novel framework with a Hybrid Feature Transformation module (HFT).

Autonomous Driving Decision Making +1

102
0.38 stars / hour

Image as Set of Points

ma-xu/context-cluster 2 Mar 2023

Context clusters (CoCs) view an image as a set of unorganized points and extract features via simplified clustering algorithm.

309
0.38 stars / hour

A Simple Framework for Open-Vocabulary Segmentation and Detection

idea-research/openseed 14 Mar 2023

We present \ourmodel{}, a simple Open-vocabulary Segmentation and Detection framework that jointly learns from different segmentation and detection datasets.

 Ranked #1 on Instance Segmentation on ADE20K val (using extra training data)

Instance Segmentation Panoptic Segmentation

59
0.37 stars / hour

Robust Speech Recognition via Large-Scale Weak Supervision

openai/whisper Preprint 2022

We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet.

Robust Speech Recognition speech-recognition

28,877
0.37 stars / hour

MaskSketch: Unpaired Structure-guided Masked Image Generation

lllyasviel/controlnet 10 Feb 2023

We show that intermediate self-attention maps of a masked generative transformer encode important structural information of the input image, such as scene layout and object shape, and we propose a novel sampling method based on this observation to enable structure-guided generation.

Conditional Image Generation Image-to-Image Translation +2

14,374
0.37 stars / hour

DeepMIM: Deep Supervision for Masked Image Modeling

oliverrensu/deepmim 15 Mar 2023

Deep supervision, which involves extra supervisions to the intermediate features of a neural network, was widely used in image classification in the early deep learning era since it significantly reduces the training difficulty and eases the optimization like avoiding gradient vanish over the vanilla training.

Image Classification object-detection +2

32
0.37 stars / hour

scenic

google-research/scenic 30 Jan 2023

Scenic: A Jax Library for Computer Vision Research and Beyond

Inductive Bias

1,942
0.37 stars / hour

The Intel Neuromorphic DNS Challenge

intellabs/intelneuromorphicdnschallenge 16 Mar 2023

A critical enabler for progress in neuromorphic computing research is the ability to transparently evaluate different neuromorphic solutions on important tasks and to compare them to state-of-the-art conventional solutions.

Audio Denoising Denoising

24
0.36 stars / hour

DAMO-YOLO : A Report on Real-Time Object Detection Design

tinyvision/damo-yolo 23 Nov 2022

In this report, we present a fast and accurate object detection method dubbed DAMO-YOLO, which achieves higher performance than the state-of-the-art YOLO series.

Neural Architecture Search object-detection +1

2,301
0.36 stars / hour