Search Results for author: Albert Pumarola

Found 27 papers, 8 papers with code

Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models

no code implementations • 2 Mar 2024 • Neta Shaul, Uriel Singer, Ricky T. Q. Chen, Matthew Le, Ali Thabet, Albert Pumarola, Yaron Lipman

This paper introduces Bespoke Non-Stationary (BNS) Solvers, a solver distillation approach to improve sample efficiency of Diffusion and Flow models.

Audio Generation Conditional Image Generation +1

Paper
Add Code

Animated Stickers: Bringing Stickers to Life with Video Diffusion

no code implementations • 8 Feb 2024 • David Yan, Winnie Zhang, Luxin Zhang, Anmol Kalia, Dingkang Wang, Ankit Ramchandani, Miao Liu, Albert Pumarola, Edgar Schoenfeld, Elliot Blanchard, Krishna Narni, Yaqiao Luo, Lawrence Chen, Guan Pang, Ali Thabet, Peter Vajda, Amy Bearman, Licheng Yu

Our model is built on top of the state-of-the-art Emu text-to-image model, with the addition of temporal layers to model motion.

Paper
Add Code

fMPI: Fast Novel View Synthesis in the Wild with Layered Scene Representations

no code implementations • 26 Dec 2023 • Jonas Kohler, Nicolas Griffiths Sanchez, Luca Cavalli, Catherine Herold, Albert Pumarola, Alberto Garcia Garcia, Ali Thabet

In this study, we propose two novel input processing paradigms for novel view synthesis (NVS) methods based on layered scene representations that significantly improve their runtime without compromising quality.

Novel View Synthesis

Paper
Add Code

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

1 code implementation • 19 Dec 2023 • Angela Castillo, Jonas Kohler, Juan C. Pérez, Juan Pablo Pérez, Albert Pumarola, Bernard Ghanem, Pablo Arbeláez, Ali Thabet

Our findings provide insights into the efficiency of the conditional denoising process that contribute to more practical and swift deployment of text-conditioned diffusion models.

Denoising Neural Architecture Search

196

Paper
Code

Bespoke Solvers for Generative Flow Models

no code implementations • 29 Oct 2023 • Neta Shaul, Juan Perez, Ricky T. Q. Chen, Ali Thabet, Albert Pumarola, Yaron Lipman

For example, a Bespoke solver for a CIFAR10 model produces samples with Fr\'echet Inception Distance (FID) of 2. 73 with 10 NFE, and gets to 1% of the Ground Truth (GT) FID (2. 59) for this model with only 20 NFE.

Paper
Add Code

BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis

no code implementations • 21 Apr 2023 • Angela Castillo, Maria Escobar, Guillaume Jeanneret, Albert Pumarola, Pablo Arbeláez, Ali Thabet, Artsiom Sanakoyeu

To the best of our knowledge, this is the first approach that uses the reverse diffusion process to model full-body tracking as a conditional sequence generation task.

Mixed Reality Motion Synthesis

Paper
Add Code

Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model

1 code implementation • CVPR 2023 • Yuming Du, Robin Kips, Albert Pumarola, Sebastian Starke, Ali Thabet, Artsiom Sanakoyeu

A particular challenge is that only a sparse tracking signal is available from standalone HMDs (Head Mounted Devices), often limited to tracking the user's head and wrists.

226

Paper
Code

VisCo Grids: Surface Reconstruction with Viscosity and Coarea Grids

no code implementations • 25 Mar 2023 • Albert Pumarola, Artsiom Sanakoyeu, Lior Yariv, Ali Thabet, Yaron Lipman

Surface reconstruction has been seeing a lot of progress lately by utilizing Implicit Neural Representations (INRs).

Inductive Bias Surface Reconstruction

Paper
Add Code

3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion

no code implementations • 21 Mar 2023 • Yu-Jhe Li, Tao Xu, Ji Hou, Bichen Wu, Xiaoliang Dai, Albert Pumarola, Peizhao Zhang, Peter Vajda, Kris Kitani

We note that the novelty of our model lies in that we introduce contrastive learning during training the diffusion prior which enables the generation of the valid view-invariant latent code.

Contrastive Learning Text to 3D

Paper
Add Code

Re-ReND: Real-time Rendering of NeRFs across Devices

1 code implementation • ICCV 2023 • Sara Rojas, Jesus Zarzar, Juan Camilo Perez, Artsiom Sanakoyeu, Ali Thabet, Albert Pumarola, Bernard Ghanem

Re-ReND is designed to achieve real-time performance by converting the NeRF into a representation that can be efficiently processed by standard graphics pipelines.

Paper
Code

INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors

no code implementations • 5 Dec 2022 • Chaojian Li, Bichen Wu, Albert Pumarola, Peizhao Zhang, Yingyan Lin, Peter Vajda

We present a method that accelerates reconstruction of 3D scenes and objects, aiming to enable instant reconstruction on edge devices such as mobile phones and AR/VR headsets.

Novel View Synthesis

Paper
Add Code

3D-Aware Encoding for Style-based Neural Radiance Fields

no code implementations • 12 Nov 2022 • Yu-Jhe Li, Tao Xu, Bichen Wu, Ningyuan Zheng, Xiaoliang Dai, Albert Pumarola, Peizhao Zhang, Peter Vajda, Kris Kitani

In the first stage, we introduce a base encoder that converts the input image to a latent code.

Contrastive Learning Image Reconstruction

Paper
Add Code

Single-view 3D Body and Cloth Reconstruction under Complex Poses

no code implementations • 9 May 2022 • Nicolas Ugrinovic, Albert Pumarola, Alberto Sanfeliu, Francesc Moreno-Noguer

We, therefore, propose a coarse-to-fine approach in which we first learn an implicit function that maps the input image to a 3D body shape with a low level of detail, but which correctly fits the underlying human pose, despite its complexity.

Ranked #1 on 3D Reconstruction on 3DPeople

3D Reconstruction

Paper
Add Code

Enhancing Egocentric 3D Pose Estimation with Third Person Views

1 code implementation • 6 Jan 2022 • Ameya Dhamanaskar, Mariella Dimiccoli, Enric Corona, Albert Pumarola, Francesc Moreno-Noguer

In this paper, we propose a novel approach to enhance the 3D body pose estimation of a person computed from videos captured from a single wearable camera.

3D Pose Estimation Domain Adaptation

Paper
Code

PhysXNet: A Customizable Approach for LearningCloth Dynamics on Dressed People

no code implementations • 13 Nov 2021 • Jordi Sanchez-Riera, Albert Pumarola, Francesc Moreno-Noguer

We introduce PhysXNet, a learning-based approach to predict the dynamics of deformable clothes given 3D skeleton motion sequences of humans wearing these clothes.

Paper
Add Code

FaceDet3D: Facial Expressions with 3D Geometric Detail Hallucination

no code implementations • 29 Sep 2021 • ShahRukh Athar, Albert Pumarola, Francesc Moreno-Noguer, Dimitris Samaras

Facial Expressions induce a variety of high-level details on the 3D face geometry.

Hallucination

Paper
Add Code

H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

1 code implementation • ICCV 2021 • Eduard Ramon, Gil Triginer, Janna Escur, Albert Pumarola, Jaime Garcia, Xavier Giro-i-Nieto, Francesc Moreno-Noguer

In this paper, we tackle these limitations for the specific problem of few-shot full 3D head reconstruction, by endowing coordinate-based representations with a probabilistic shape prior that enables faster convergence and better generalization when using few input images (down to three).

3D Reconstruction Multi-View 3D Reconstruction +1

118

Paper
Code

SMPLicit: Topology-aware Generative Model for Clothed People

1 code implementation • CVPR 2021 • Enric Corona, Albert Pumarola, Guillem Alenyà, Gerard Pons-Moll, Francesc Moreno-Noguer

In this paper we introduce SMPLicit, a novel generative model to jointly represent body pose, shape and clothing geometry.

3D Reconstruction

273

Paper
Code

FaceDet3D: Facial Expressions with 3D Geometric Detail Prediction

no code implementations • 14 Dec 2020 • ShahRukh Athar, Albert Pumarola, Francesc Moreno-Noguer, Dimitris Samaras

The facial details are represented as a vertex displacement map and used then by a Neural Renderer to photo-realistically render novel images of any single image in any desired expression and view.

Paper
Add Code

D-NeRF: Neural Radiance Fields for Dynamic Scenes

1 code implementation • CVPR 2021 • Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer

In this paper we introduce D-NeRF, a method that extends neural radiance fields to a dynamic domain, allowing to reconstruct and render novel images of objects under rigid and non-rigid motions from a \emph{single} camera moving around the scene.

Neural Rendering

500

Paper
Code

C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds

no code implementations • CVPR 2020 • Albert Pumarola, Stefan Popov, Francesc Moreno-Noguer, Vittorio Ferrari

Flow-based generative models have highly desirable properties like exact log-likelihood evaluation and exact latent-variable inference, however they are still in their infancy and have not received as much attention as alternative generative models.

3D Reconstruction Image Manipulation +1

Paper
Add Code

3DPeople: Modeling the Geometry of Dressed Humans

no code implementations • ICCV 2019 • Albert Pumarola, Jordi Sanchez, Gary P. T. Choi, Alberto Sanfeliu, Francesc Moreno-Noguer

Finally, we design a multi-resolution deep generative network that, given an input image of a dressed human, predicts his/her geometry image (and thus the clothed body shape) in an end-to-end manner.

3D Human Shape Estimation Optical Flow Estimation

Paper
Add Code

Context-aware Human Motion Prediction

no code implementations • CVPR 2020 • Enric Corona, Albert Pumarola, Guillem Alenyà, Francesc Moreno-Noguer

The problem of predicting human motion given a sequence of past observations is at the core of many applications in robotics and computer vision.

Graph Attention Human motion prediction +1

Paper
Add Code

Fast video object segmentation with Spatio-Temporal GANs

no code implementations • 28 Mar 2019 • Sergi Caelles, Albert Pumarola, Francesc Moreno-Noguer, Alberto Sanfeliu, Luc van Gool

To achieve this, we concentrate all the heavy computational load to the training phase with two critics that enforce spatial and temporal mask consistency over the last K frames.

Descriptive Object +4

Paper
Add Code

Geometry-Aware Network for Non-Rigid Shape Prediction from a Single View

no code implementations • CVPR 2018 • Albert Pumarola, Antonio Agudo, Lorenzo Porzi, Alberto Sanfeliu, Vincent Lepetit, Francesc Moreno-Noguer

We propose a method for predicting the 3D shape of a deformable surface from a single view.

Paper
Add Code

Unsupervised Person Image Synthesis in Arbitrary Poses

no code implementations • CVPR 2018 • Albert Pumarola, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

Given an input image of a person and a desired pose represented by a 2D skeleton, our model renders the image of the same person under the new pose, synthesizing novel views of the parts visible in the input image and hallucinating those that are not seen.

Image Generation

Paper
Add Code

GANimation: Anatomically-aware Facial Animation from a Single Image

7 code implementations • ECCV 2018 • Albert Pumarola, Antonio Agudo, Aleix M. Martinez, Alberto Sanfeliu, Francesc Moreno-Noguer

Recent advances in Generative Adversarial Networks (GANs) have shown impressive results for task of facial expression synthesis.

Conditional Image Generation Face Generation +1

1,928

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.