Search Results for author: Chen Henry Wu

Found 18 papers, 14 papers with code

Adversarial Attacks on Multimodal Agents

1 code implementation18 Jun 2024 Chen Henry Wu, Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried, aditi raghunathan

Our attacks use adversarial text strings to guide gradient-based perturbation over one trigger image in the environment: (1) our captioner attack attacks white-box captioners if they are used to process images into captions as additional inputs to the VLM; (2) our CLIP attack attacks a set of CLIP models jointly, which can transfer to proprietary VLMs.

Adversarial Text

Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention

no code implementations6 Dec 2023 Jianjin Xu, Saman Motamed, Praneetha Vaddamanu, Chen Henry Wu, Christian Haene, Jean-Charles Bazin, Fernando de la Torre

Specifically, we insert parallel attention matrices to each cross-attention module in the denoising network, which attends to features extracted from reference images by an identity encoder.

Denoising Facial Inpainting

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

1 code implementation20 Sep 2023 Tianbao Xie, Siheng Zhao, Chen Henry Wu, Yitao Liu, Qian Luo, Victor Zhong, Yanchao Yang, Tao Yu

Unlike inverse RL and recent work that uses LLMs to write sparse reward codes or unshaped dense rewards with a constant function across timesteps, Text2Reward produces interpretable, free-form dense reward codes that cover a wide range of tasks, utilize existing packages, and allow iterative refinement with human feedback.

reinforcement-learning Reinforcement Learning (RL)

PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face Inpainting

2 code implementations ICCV 2023 Saman Motamed, Jianjin Xu, Chen Henry Wu, Fernando de la Torre

By using ~40 reference images, PATMAT creates anchor points in MAT's style module, and tunes the model using the fixed anchors to adapt the model to a new face identity.

Facial Inpainting

Zero-shot Model Diagnosis

no code implementations CVPR 2023 Jinqi Luo, Zhaoning Wang, Chen Henry Wu, Dong Huang, Fernando de la Torre

Extensive experiments demonstrate that our method is capable of producing counterfactual images and offering sensitivity analysis for model diagnosis without the need for a test set.

counterfactual Fairness

Semantic Image Attack for Visual Model Diagnosis

no code implementations23 Mar 2023 Jinqi Luo, Zhaoning Wang, Chen Henry Wu, Dong Huang, Fernando de la Torre

Rather than relying on a carefully designed test set to assess ML models' failures, fairness, or robustness, this paper proposes Semantic Image Attack (SIA), a method based on the adversarial attack that provides semantic adversarial images to allow model diagnosis, interpretability, and robustness.

Adversarial Attack Attribute +2

A Latent Space of Stochastic Diffusion Models for Zero-Shot Image Editing and Guidance

1 code implementation ICCV 2023 Chen Henry Wu, Fernando de la Torre

We demonstrate that this latent space of stochastic diffusion models can be used in the same way as that of deterministic diffusion models in two applications.

Denoising

Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance

4 code implementations11 Oct 2022 Chen Henry Wu, Fernando de la Torre

The commonly-adopted formulation of the latent code of diffusion models is a sequence of gradually denoised samples, as opposed to the simpler (e. g., Gaussian) latent space of GANs, VAEs, and normalizing flows.

Image-to-Image Translation

Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

1 code implementation14 Sep 2022 Chen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando de la Torre

Our experiments demonstrate how PromptGen can efficiently sample from several unconditional generative models (e. g., StyleGAN2, StyleNeRF, diffusion autoencoder, NVAE) in a controlled or/and de-biased manner using various off-the-shelf models: (1) with the CLIP model as control, PromptGen can sample images guided by text, (2) with image classifiers as control, PromptGen can de-bias generative models across a set of attributes or attribute combinations, and (3) with inverse graphics models as control, PromptGen can sample images of the same identity in different poses.

Attribute

Selective Annotation Makes Language Models Better Few-Shot Learners

1 code implementation5 Sep 2022 Hongjin Su, Jungo Kasai, Chen Henry Wu, Weijia Shi, Tianlu Wang, Jiayi Xin, Rui Zhang, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu

Departing from recent in-context learning methods, we formulate an annotation-efficient, two-step framework: selective annotation that chooses a pool of examples to annotate from unlabeled data in advance, followed by prompt retrieval that retrieves task examples from the annotated pool at test time.

Code Generation In-Context Learning +1

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

2 code implementations3 Aug 2021 Hao Zhou, Pei Ke, Zheng Zhang, Yuxian Gu, Yinhe Zheng, Chujie Zheng, Yida Wang, Chen Henry Wu, Hao Sun, Xiaocong Yang, Bosi Wen, Xiaoyan Zhu, Minlie Huang, Jie Tang

Although pre-trained language models have remarkably enhanced the generation ability of dialogue systems, open-domain Chinese dialogue systems are still limited by the dialogue data and the model size compared with English ones.

NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer

1 code implementation Findings (ACL) 2021 Fei Huang, Zikai Chen, Chen Henry Wu, Qihan Guo, Xiaoyan Zhu, Minlie Huang

First, we observe that most words in the transferred sentence can be aligned with related words in the source sentence, so we explicitly model word alignments to suppress irrelevant words.

Sentence Style Transfer +3

Cannot find the paper you are looking for? You can Submit a new open access paper.