TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-based Image Editing	PIE-Bench	StyleDiffusion+Prompt-to-Prompt	CLIPSIM	24.78	# 7
Text-based Image Editing	PIE-Bench	StyleDiffusion+Prompt-to-Prompt	Structure Distance	11.65	# 1
Text-based Image Editing	PIE-Bench	StyleDiffusion+Prompt-to-Prompt	Background PSNR	26.05	# 7
Text-based Image Editing	PIE-Bench	StyleDiffusion+Prompt-to-Prompt	Background LPIPS	66.10	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stylediffusion-prompt-embedding-inversion-for/text-based-image-editing-on-pie-bench)](https://paperswithcode.com/sota/text-based-image-editing-on-pie-bench?p=stylediffusion-prompt-embedding-inversion-for)`

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

28 Mar 2023 · Senmao Li, Joost Van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang ·

A significant research effort is focused on exploiting the amazing capacities of pretrained diffusion models for the editing of images. They either finetune the model, or invert the image in the latent space of the pretrained model. However, they suffer from two problems: (1) Unsatisfying results for selected regions, and unexpected changes in nonselected regions. (2) They require careful text prompt editing where the prompt should include all visual objects in the input image. To address this, we propose two improvements: (1) Only optimizing the input of the value linear network in the cross-attention layers, is sufficiently powerful to reconstruct a real image. (2) We propose attention regularization to preserve the object-like attention maps after editing, enabling us to obtain accurate style editing without invoking significant structural changes. We further improve the editing technique which is used for the unconditional branch of classifier-free guidance, as well as the conditional one as used by P2P. Extensive experimental prompt-editing results on a variety of images, demonstrate qualitatively and quantitatively that our method has superior editing capabilities than existing and concurrent works.

PDF Abstract

Code

Add Remove Mark official

sen-mao/StyleDiffusion official

Tasks

Add Remove

Text-based Image Editing

Datasets

MS COCO PIE-Bench

Results from the Paper

Add Remove

Ranked #7 on Text-based Image Editing on PIE-Bench

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-based Image Editing	PIE-Bench	StyleDiffusion+Prompt-to-Prompt	CLIPSIM	24.78	# 7	Compare
			Structure Distance	11.65	# 1	Compare
			Background PSNR	26.05	# 7	Compare
			Background LPIPS	66.10	# 6	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove