Salesforce CausalAI Library: A Fast and Scalable Framework for Causal Analysis of Time Series and Tabular Data

salesforce/causalai 25 Jan 2023

We introduce the Salesforce CausalAI Library, an open-source library for causal analysis using observational data.

Causal Discovery Causal Inference +1

LAION-5B: An open large-scale dataset for training next generation image-text models

mlfoundations/open_clip NeurIPS 2022 Datasets and Benchmarks 2022

We show successful replication and fine-tuning of foundational models like CLIP, GLIDE and Stable Diffusion using the dataset, and discuss further experiments enabled with an openly available dataset of this scale.

Image Generation Zero-Shot Learning

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

air-discover/toist 19 Oct 2022

As such, we study the challenging problem of task oriented detection, which aims to find objects that best afford an action indicated by verbs like sit comfortably on.

Instance Segmentation Referring Expression +2

AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities

automatic1111/stable-diffusion-webui 12 Nov 2022

In this work, we present a conceptually simple and effective method to train a strong bilingual/multilingual multimodal representation model.

Contrastive Learning Cross-Modal Retrieval +11

SEANet: A Multi-modal Speech Enhancement Network

google-research/seanet 4 Sep 2020

We explore the possibility of leveraging accelerometer data to perform speech enhancement in very noisy conditions.

Speech Enhancement

Hungry Hungry Hippos: Towards Language Modeling with State Space Models

hazyresearch/h3 28 Dec 2022

First, we use synthetic language modeling tasks to understand the gap between SSMs and attention.

Few-Shot Learning Language Modelling

How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection

hello-simpleai/chatgpt-comparison-detection 18 Jan 2023

We call the collected dataset the Human ChatGPT Comparison Corpus (HC3).

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

BlinkDL/RWKV-LM 18 Nov 2022

We propose SmoothQuant, a training-free, accuracy-preserving, and general-purpose post-training quantization (PTQ) solution to enable 8-bit weight, 8-bit activation (W8A8) quantization for LLMs that can be implemented efficiently.


Generate rather than Retrieve: Large Language Models are Strong Context Generators

wyu97/GenRead 21 Sep 2022

We call our method generate-then-read (GenRead), which first prompts a large language model to generate contextutal documents based on a given question, and then reads the generated documents to produce the final answer.

Language Modelling Open-Domain Question Answering

Image Super-Resolution using Efficient Striped Window Transformer

fried-rice-lab/friedricelab 24 Jan 2023

To further exploit the potential of the transformer, we propose a novel flexible window training strategy.

Image Super-Resolution Single Image Super Resolution

