Search Results for author: Arantxa Casanova

Found 9 papers, 5 papers with code

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

no code implementations • 17 Nov 2023 • Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

Evaluation results show our method improves visual quality by 14%, prompt alignment by 16. 2% and scene diversity by 15. 3%, compared to prompt engineering the base Emu model for stickers generation.

Image Generation Prompt Engineering

Paper
Add Code

Controllable Image Generation via Collage Representations

no code implementations • 26 Apr 2023 • Arantxa Casanova, Marlène Careil, Adriana Romero-Soriano, Christopher J. Pal, Jakob Verbeek, Michal Drozdzal

Our experiments on the OI dataset show that M&Ms outperforms baselines in terms of fine-grained scene controllability while being very competitive in terms of image quality and sample diversity.

Attribute Image Generation

Paper
Add Code

Instance-Conditioned GAN Data Augmentation for Representation Learning

no code implementations • 16 Mar 2023 • Pietro Astolfi, Arantxa Casanova, Jakob Verbeek, Pascal Vincent, Adriana Romero-Soriano, Michal Drozdzal

We showcase the benefits of DA_IC-GAN by plugging it out-of-the-box into the supervised training of ResNets and DeiT models on the ImageNet dataset, and achieving accuracy boosts up to between 1%p and 2%p with the highest capacity models.

Data Augmentation Few-Shot Learning +1

Paper
Add Code

Revisiting Hotels-50K and Hotel-ID

1 code implementation • 20 Jul 2022 • Aarash Feizi, Arantxa Casanova, Adriana Romero-Soriano, Reihaneh Rabbany

In this paper, we propose revisited versions for two recent hotel recognition datasets: Hotels50K and Hotel-ID.

Image Retrieval Retrieval

Paper
Code

Instance-Conditioned GAN

1 code implementation • NeurIPS 2021 • Arantxa Casanova, Marlène Careil, Jakob Verbeek, Michal Drozdzal, Adriana Romero-Soriano

Generative Adversarial Networks (GANs) can generate near photo realistic images in narrow domains such as human faces.

Ranked #1 on Conditional Image Generation on ImageNet 64x64

Conditional Image Generation Unconditional Image Generation

529

Paper
Code

Generating unseen complex scenes: are we there yet?

no code implementations • 7 Dec 2020 • Arantxa Casanova, Michal Drozdzal, Adriana Romero-Soriano

In this paper, we propose a methodology to compare complex scene conditional generation models, and provide an in-depth analysis that assesses the ability of each model to (1) fit the training distribution and hence perform well on seen conditionings, (2) to generalize to unseen conditionings composed of seen object combinations, and (3) generalize to unseen conditionings composed of unseen object combinations.

Object

Paper
Add Code

Reinforced active learning for image segmentation

1 code implementation • ICLR 2020 • Arantxa Casanova, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher J. Pal

Our method proposes a new modification of the deep Q-network (DQN) formulation for active learning, adapting it to the large-scale nature of semantic segmentation problems.

Active Learning Image Segmentation +3

Paper
Code

On the iterative refinement of densely connected representation levels for semantic segmentation

1 code implementation • 30 Apr 2018 • Arantxa Casanova, Guillem Cucurull, Michal Drozdzal, Adriana Romero, Yoshua Bengio

State-of-the-art semantic segmentation approaches increase the receptive field of their models by using either a downsampling path composed of poolings/strided convolutions or successive dilated convolutions.

Image Segmentation Scene Understanding +1

Paper
Code

Graph Attention Networks

90 code implementations • ICLR 2018 • Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, Yoshua Bengio

We present graph attention networks (GATs), novel neural network architectures that operate on graph-structured data, leveraging masked self-attentional layers to address the shortcomings of prior methods based on graph convolutions or their approximations.

Ranked #1 on Node Classification on Pubmed (Validation metric)

Document Classification Graph Attention +8

47,519

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.