Search Results for author: Grace Luo

Found 7 papers, 3 papers with code

Readout Guidance: Learning Control from Diffusion Features

no code implementations • 4 Dec 2023 • Grace Luo, Trevor Darrell, Oliver Wang, Dan B Goldman, Aleksander Holynski

We present Readout Guidance, a method for controlling text-to-image diffusion models with learned signals.

Paper
Add Code

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence

no code implementations • NeurIPS 2023 • Grace Luo, Lisa Dunlap, Dong Huk Park, Aleksander Holynski, Trevor Darrell

We propose Diffusion Hyperfeatures, a framework for consolidating multi-scale and multi-timestep feature maps into per-pixel feature descriptors that can be used for downstream tasks.

Semantic correspondence

Paper
Add Code

Shape-Guided Diffusion with Inside-Outside Attention

no code implementations • 1 Dec 2022 • Dong Huk Park, Grace Luo, Clayton Toste, Samaneh Azadi, Xihui Liu, Maka Karalashvili, Anna Rohrbach, Trevor Darrell

We introduce precise object silhouette as a new form of user control in text-to-image diffusion models, which we dub Shape-Guided Diffusion.

Object

Paper
Add Code

Focus! Relevant and Sufficient Context Selection for News Image Captioning

no code implementations • 1 Dec 2022 • Mingyang Zhou, Grace Luo, Anna Rohrbach, Zhou Yu

In our paper, we first demonstrate that by combining more fine-grained context that captures the key named entities (obtained via an oracle) and the global context that summarizes the news, we can dramatically improve the model's ability to generate accurate news captions.

Image Captioning Relation Extraction +1

Paper
Add Code

G^3: Geolocation via Guidebook Grounding

1 code implementation • 28 Nov 2022 • Grace Luo, Giscard Biamby, Trevor Darrell, Daniel Fried, Anna Rohrbach

We propose the task of Geolocation via Guidebook Grounding that uses a dataset of StreetView images from a diverse set of locations and an associated textual guidebook for GeoGuessr, a popular interactive geolocation game.

Paper
Code

Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation

1 code implementation • NAACL 2022 • Giscard Biamby, Grace Luo, Trevor Darrell, Anna Rohrbach

Detecting out-of-context media, such as "mis-captioned" images on Twitter, is a relevant problem, especially in domains of high public significance.

Misinformation

Paper
Code

NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media

1 code implementation • EMNLP 2021 • Grace Luo, Trevor Darrell, Anna Rohrbach

Online misinformation is a prevalent societal issue, with adversaries relying on tools ranging from cheap fakes to sophisticated deep fakes.

Misinformation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.