Search Results for author: Francesc Moreno-Noguer

Found 76 papers, 26 papers with code

Morphological Symmetries in Robotics

2 code implementations • 23 Feb 2024 • Daniel Ordoñez-Apraez, Giulio Turrisi, Vladimir Kostic, Mario Martin, Antonio Agudo, Francesc Moreno-Noguer, Massimiliano Pontil, Claudio Semini, Carlos Mastalli

We present a comprehensive framework for studying and leveraging morphological symmetries in robotic systems.

Data Augmentation

Paper
Code

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

no code implementations • 13 Dec 2023 • Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda, Antonio Agudo, Francesc Moreno-Noguer

Instead of predicting body model parameters or 3D vertex coordinates, our focus is on forecasting the proposed discrete latent representation, which can be decoded into a registered human mesh.

Paper
Add Code

Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields

no code implementations • 3 Nov 2023 • Jianxiong Shen, Ruijie Ren, Adria Ruiz, Francesc Moreno-Noguer

To quantify the uncertainty on the learned surface, we model a stochastic radiance field.

Paper
Add Code

Implicit Shape and Appearance Priors for Few-Shot Full Head Reconstruction

1 code implementation • 12 Oct 2023 • Pol Caselles, Eduard Ramon, Jaime Garcia, Gil Triginer, Francesc Moreno-Noguer

Recent advancements in learning techniques that employ coordinate-based neural representations have yielded remarkable results in multi-view 3D reconstruction tasks.

3D Reconstruction Multi-View 3D Reconstruction

117

Paper
Code

PoseFix: Correcting 3D Human Poses with Natural Language

no code implementations • ICCV 2023 • Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez

Automatically producing instructions to modify one's posture could open the door to endless applications, such as personalized coaching and in-home physical therapy.

Text Generation

Paper
Add Code

InstantAvatar: Efficient 3D Head Reconstruction via Surface Rendering

no code implementations • 9 Aug 2023 • Antonio Canela, Pol Caselles, Ibrar Malik, Eduard Ramon, Jaime García, Jordi Sánchez-Riera, Gil Triginer, Francesc Moreno-Noguer

In order to speed up the reconstruction process, we propose a system that combines, for the first time, a voxel-grid neural field representation with a surface renderer.

valid

Paper
Add Code

On discrete symmetries of robotics systems: A group-theoretic and data-driven analysis

2 code implementations • 21 Feb 2023 • Daniel Ordonez-Apraez, Mario Martin, Antonio Agudo, Francesc Moreno-Noguer

We present a comprehensive study on discrete morphological symmetries of dynamical systems, which are commonly observed in biological and artificial locomoting systems, such as legged, swimming, and flying animals/robots/virtual characters.

Contact Detection Data Augmentation +1

Paper
Code

Visual Semantic Relatedness Dataset for Image Captioning

1 code implementation • 20 Jan 2023 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

Modern image captioning system relies heavily on extracting knowledge from images to capture the concept of a static story.

Image Captioning text similarity

Paper
Code

NeRFLight: Fast and Light Neural Radiance Fields Using a Shared Feature Grid

no code implementations • CVPR 2023 • Fernando Rivas-Manzaneque, Jorge Sierra-Acosta, Adrian Penate-Sanchez, Francesc Moreno-Noguer, Angela Ribeiro

While original Neural Radiance Fields (NeRF) have shown impressive results in modeling the appearance of a scene with compact MLP architectures, they are not able to achieve real-time rendering.

Paper
Add Code

PoseScript: Linking 3D Human Poses and Natural Language

1 code implementation • 21 Oct 2022 • Ginger Delmas, Philippe Weinzaepfel, Thomas Lucas, Francesc Moreno-Noguer, Grégory Rogez

Thirdly, we present a learned process for generating pose descriptions.

Cross-Modal Retrieval Image Captioning +3

Paper
Code

Belief Revision based Caption Re-ranker with Visual Semantic Information

1 code implementation • COLING 2022 • Ahmed Sabir, Francesc Moreno-Noguer, Pranava Madhyastha, Lluís Padró

In this work, we focus on improving the captions generated by image-caption generation systems.

Caption Generation Image Captioning +4

Paper
Code

SIRA: Relightable Avatars from a Single Image

no code implementations • 7 Sep 2022 • Pol Caselles, Eduard Ramon, Jaime Garcia, Xavier Giro-i-Nieto, Francesc Moreno-Noguer, Gil Triginer

Our key ingredients are two data-driven statistical models based on neural fields that resolve the ambiguities of single-view 3D surface reconstruction and appearance factorization.

Surface Reconstruction

Paper
Add Code

Topic Detection in Continuous Sign Language Videos

1 code implementation • 1 Sep 2022 • Alvaro Budria, Laia Tarres, Gerard I. Gallego, Francesc Moreno-Noguer, Jordi Torres, Xavier Giro-i-Nieto

Significant progress has been made recently on challenging tasks in automatic sign language understanding, such as sign language recognition, translation and production.

Sign Language Recognition Translation

Paper
Code

Back to MLP: A Simple Baseline for Human Motion Prediction

1 code implementation • 4 Jul 2022 • Wen Guo, Yuming Du, Xi Shen, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer

This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences.

Ranked #3 on Multi-Person Pose forecasting on Expi - unseen actions split

Human motion prediction Human Pose Forecasting +2

110

Paper
Code

Learned Vertex Descent: A New Direction for 3D Human Model Fitting

no code implementations • 12 May 2022 • Enric Corona, Gerard Pons-Moll, Guillem Alenyà, Francesc Moreno-Noguer

An exhaustive evaluation demonstrates that our approach is able to capture the underlying body of clothed people with very different body shapes, achieving a significant improvement compared to state-of-the-art.

Paper
Add Code

Single-view 3D Body and Cloth Reconstruction under Complex Poses

no code implementations • 9 May 2022 • Nicolas Ugrinovic, Albert Pumarola, Alberto Sanfeliu, Francesc Moreno-Noguer

We, therefore, propose a coarse-to-fine approach in which we first learn an implicit function that maps the input image to a 3D body shape with a low level of detail, but which correctly fits the underlying human pose, despite its complexity.

Ranked #1 on 3D Reconstruction on 3DPeople

3D Reconstruction

Paper
Add Code

Permutation-Invariant Relational Network for Multi-person 3D Pose Estimation

no code implementations • 11 Apr 2022 • Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

For this purpose, we build a residual-like permutation-invariant network that successfully refines potentially corrupted initial 3D poses estimated by an off-the-shelf detector.

Ranked #4 on 3D Multi-Person Pose Estimation (absolute) on MuPoTS-3D

3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +1

Paper
Add Code

LISA: Learning Implicit Shape and Appearance of Hands

no code implementations • CVPR 2022 • Enric Corona, Tomas Hodan, Minh Vo, Francesc Moreno-Noguer, Chris Sweeney, Richard Newcombe, Lingni Ma

This paper proposes a do-it-all neural model of human hands, named LISA.

Paper
Add Code

HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE

no code implementations • 4 Apr 2022 • Xiaoyu Bie, Wen Guo, Simon Leglaive, Lauren Girin, Francesc Moreno-Noguer, Xavier Alameda-Pineda

Studies on the automatic processing of 3D human pose data have flourished in the recent past.

motion prediction

Paper
Add Code

Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification

1 code implementation • 18 Mar 2022 • Jianxiong Shen, Antonio Agudo, Francesc Moreno-Noguer, Adria Ruiz

For this purpose, our method learns a distribution over all possible radiance fields modelling which is used to quantify the uncertainty associated with the modelled scene.

Autonomous Driving Decision Making +2

Paper
Code

Enhancing Egocentric 3D Pose Estimation with Third Person Views

1 code implementation • 6 Jan 2022 • Ameya Dhamanaskar, Mariella Dimiccoli, Enric Corona, Albert Pumarola, Francesc Moreno-Noguer

In this paper, we propose a novel approach to enhance the 3D body pose estimation of a person computed from videos captured from a single wearable camera.

3D Pose Estimation Domain Adaptation

Paper
Code

PhysXNet: A Customizable Approach for LearningCloth Dynamics on Dressed People

no code implementations • 13 Nov 2021 • Jordi Sanchez-Riera, Albert Pumarola, Francesc Moreno-Noguer

We introduce PhysXNet, a learning-based approach to predict the dynamics of deformable clothes given 3D skeleton motion sequences of humans wearing these clothes.

Paper
Add Code

Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images

1 code implementation • 2 Nov 2021 • Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

We address the problem of multi-person 3D body pose and shape estimation from a single image.

3D Human Shape Estimation 3D Multi-Person Mesh Recovery +1

Paper
Code

An Adaptable Approach to Learn Realistic Legged Locomotion without Examples

no code implementations • 28 Oct 2021 • Daniel Ordonez-Apraez, Antonio Agudo, Francesc Moreno-Noguer, Mario Martin

We present experimental results showing that even in a model-free setup and with a simple reactive control architecture, the learned policies can generate realistic and energy-efficient locomotion gaits for a bipedal and a quadrupedal robot.

Reinforcement Learning (RL)

Paper
Add Code

Grasp-Oriented Fine-grained Cloth Segmentation without Real Supervision

no code implementations • 6 Oct 2021 • Ruijie Ren, Mohit Gurnani Rajesh, Jordi Sanchez-Riera, Fan Zhang, Yurun Tian, Antonio Agudo, Yiannis Demiris, Krystian Mikolajczyk, Francesc Moreno-Noguer

We show that training our network solely with synthetic data and the proposed DA yields results competitive with models trained on real data.

Domain Adaptation

Paper
Add Code

FaceDet3D: Facial Expressions with 3D Geometric Detail Hallucination

no code implementations • 29 Sep 2021 • ShahRukh Athar, Albert Pumarola, Francesc Moreno-Noguer, Dimitris Samaras

Facial Expressions induce a variety of high-level details on the 3D face geometry.

Hallucination

Paper
Add Code

Stochastic Neural Radiance Fields: Quantifying Uncertainty in Implicit 3D Representations

no code implementations • 5 Sep 2021 • Jianxiong Shen, Adria Ruiz, Antonio Agudo, Francesc Moreno-Noguer

In this context, we propose Stochastic Neural Radiance Fields (S-NeRF), a generalization of standard NeRF that learns a probability distribution over all the possible radiance fields modeling the scene.

Novel View Synthesis Uncertainty Quantification +1

Paper
Add Code

SIDER: Single-Image Neural Optimization for Facial Geometric Detail Recovery

no code implementations • 11 Aug 2021 • Aggelina Chatziagapi, ShahRukh Athar, Francesc Moreno-Noguer, Dimitris Samaras

We present SIDER(Single-Image neural optimization for facial geometric DEtail Recovery), a novel photometric optimization method that recovers detailed facial geometry from a single image in an unsupervised manner.

Paper
Add Code

H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

1 code implementation • ICCV 2021 • Eduard Ramon, Gil Triginer, Janna Escur, Albert Pumarola, Jaime Garcia, Xavier Giro-i-Nieto, Francesc Moreno-Noguer

In this paper, we tackle these limitations for the specific problem of few-shot full 3D head reconstruction, by endowing coordinate-based representations with a probabilistic shape prior that enables faster convergence and better generalization when using few input images (down to three).

3D Reconstruction Multi-View 3D Reconstruction +1

117

Paper
Code

Uncertainty-Aware Camera Pose Estimation from Points and Lines

no code implementations • CVPR 2021 • Alexander Vakhitov, Luis Ferraz Colomina, Antonio Agudo, Francesc Moreno-Noguer

The new PnP(L) methods outperform the state-of-the-art on real data in isolation, showing an increase in mean translation accuracy by 18% on a representative subset of KITTI, while the new uncertain refinement improves pose accuracy for most of the solvers, e. g. decreasing mean translation error for the EPnP by 16% compared to the standard refinement on the same dataset.

Camera Localization Pose Estimation +2

Paper
Add Code

Multi-Person Extreme Motion Prediction

1 code implementation • CVPR 2022 • Wen Guo, Xiaoyu Bie, Xavier Alameda-Pineda, Francesc Moreno-Noguer

In this paper, we explore this problem when dealing with humans performing collaborative tasks, we seek to predict the future motion of two interacted persons given two sequences of their past skeletons.

Ranked #2 on Multi-Person Pose forecasting on Expi - unseen actions split

Human motion prediction motion prediction +2

Paper
Code

SMPLicit: Topology-aware Generative Model for Clothed People

1 code implementation • CVPR 2021 • Enric Corona, Albert Pumarola, Guillem Alenyà, Gerard Pons-Moll, Francesc Moreno-Noguer

In this paper we introduce SMPLicit, a novel generative model to jointly represent body pose, shape and clothing geometry.

3D Reconstruction

273

Paper
Code

3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

2 code implementations • 11 Mar 2021 • Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, Laszlo A. Jeni, Fernando de la Torre

Two common approaches to deal with low-resolution images are applying super-resolution techniques to the input, which may result in unpleasant artifacts, or simply training one model for each resolution, which is impractical in many realistic applications.

3D human pose and shape estimation Contrastive Learning +1

216

Paper
Code

Multi-FinGAN: Generative Coarse-To-Fine Sampling of Multi-Finger Grasps

1 code implementation • 17 Dec 2020 • Jens Lundell, Enric Corona, Tran Nguyen Le, Francesco Verdoja, Philippe Weinzaepfel, Gregory Rogez, Francesc Moreno-Noguer, Ville Kyrki

While there exists many methods for manipulating rigid objects with parallel-jaw grippers, grasping with multi-finger robotic hands remains a quite unexplored research topic.

Paper
Code

FaceDet3D: Facial Expressions with 3D Geometric Detail Prediction

no code implementations • 14 Dec 2020 • ShahRukh Athar, Albert Pumarola, Francesc Moreno-Noguer, Dimitris Samaras

The facial details are represented as a vertex displacement map and used then by a Neural Renderer to photo-realistically render novel images of any single image in any desired expression and view.

Paper
Add Code

D-NeRF: Neural Radiance Fields for Dynamic Scenes

1 code implementation • CVPR 2021 • Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer

In this paper we introduce D-NeRF, a method that extends neural radiance fields to a dynamic domain, allowing to reconstruct and render novel images of objects under rigid and non-rigid motions from a \emph{single} camera moving around the scene.

Neural Rendering

499

Paper
Code

PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation

no code implementations • 11 Oct 2020 • Wen Guo, Enric Corona, Francesc Moreno-Noguer, Xavier Alameda-Pineda

Our pose interacting network, or PI-Net, inputs the initial pose estimates of a variable number of interactees into a recurrent architecture used to refine the pose of the person-of-interest.

Ranked #12 on 3D Multi-Person Pose Estimation (root-relative) on MuPoTS-3D

3D Multi-Person Pose Estimation (root-relative) 3D Pose Estimation

Paper
Add Code

3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning

2 code implementations • ECCV 2020 • Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, Laszlo A. Jeni, Fernando de la Torre

3D human shape and pose estimation from monocular images has been an active area of research in computer vision, having a substantial impact on the development of new applications, from activity recognition to creating virtual avatars.

Ranked #71 on 3D Human Pose Estimation on MPI-INF-3DHP

3D Human Pose Estimation 3D Shape Reconstruction +4

216

Paper
Code

Neural Cellular Automata Manifold

no code implementations • CVPR 2021 • Alejandro Hernandez Ruiz, Armand Vilalta, Francesc Moreno-Noguer

In biological terms, our approach would play the role of the transcription factors, modulating the mapping of genes into specific proteins that drive cellular differentiation, which occurs right before the morphogenesis.

Image Generation

Paper
Add Code

Textual Visual Semantic Dataset for Text Spotting

no code implementations • 21 Apr 2020 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

In this paper, we propose a visual context dataset for Text Spotting in the wild, where the publicly available dataset COCO-text [Veit et al. 2016] has been extended with information about the scene (such as objects and places appearing in the image) to enable researchers to include semantic relations between texts and scene in their Text Spotting systems, and to offer a common framework for such approaches.

text similarity Text Spotting

Paper
Add Code

C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds

no code implementations • CVPR 2020 • Albert Pumarola, Stefan Popov, Francesc Moreno-Noguer, Vittorio Ferrari

Flow-based generative models have highly desirable properties like exact log-likelihood evaluation and exact latent-variable inference, however they are still in their infancy and have not received as much attention as alternative generative models.

3D Reconstruction Image Manipulation +1

Paper
Add Code

Improving Map Re-localization with Deep 'Movable' Objects Segmentation on 3D LiDAR Point Clouds

no code implementations • 8 Oct 2019 • Victor Vaquero, Kai Fischer, Francesc Moreno-Noguer, Alberto Sanfeliu, Stefan Milz

Results show that we are able to accurately re-locate over a filtered map, consistently reducing trajectory errors between an average of 35. 1% with respect to a non-filtered map version and of 47. 9% with respect to a standalone map created on the current session.

Autonomous Vehicles

Paper
Add Code

Semantic Relatedness Based Re-ranker for Text Spotting

3 code implementations • IJCNLP 2019 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

We present a scenario where semantic similarity is not enough, and we devise a neural approach to learn semantic relatedness.

Clustering Dimensionality Reduction +5

Paper
Code

3DPeople: Modeling the Geometry of Dressed Humans

no code implementations • ICCV 2019 • Albert Pumarola, Jordi Sanchez, Gary P. T. Choi, Alberto Sanfeliu, Francesc Moreno-Noguer

Finally, we design a multi-resolution deep generative network that, given an input image of a dressed human, predicts his/her geometry image (and thus the clothed body shape) in an end-to-end manner.

3D Human Shape Estimation Optical Flow Estimation

Paper
Add Code

Context-aware Human Motion Prediction

no code implementations • CVPR 2020 • Enric Corona, Albert Pumarola, Guillem Alenyà, Francesc Moreno-Noguer

The problem of predicting human motion given a sequence of past observations is at the core of many applications in robotics and computer vision.

Graph Attention Human motion prediction +1

Paper
Add Code

Fast video object segmentation with Spatio-Temporal GANs

no code implementations • 28 Mar 2019 • Sergi Caelles, Albert Pumarola, Francesc Moreno-Noguer, Alberto Sanfeliu, Luc van Gool

To achieve this, we concentrate all the heavy computational load to the training phase with two critics that enforce spatial and temporal mask consistency over the last K frames.

Descriptive Object +4

Paper
Add Code

Human Motion Prediction via Spatio-Temporal Inpainting

1 code implementation • 13 Dec 2018 • Alejandro Hernandez Ruiz, Juergen Gall, Francesc Moreno-Noguer

First, we represent the data using a spatio-temporal tensor of 3D skeleton coordinates which allows formulating the prediction problem as an inpainting one, for which GANs work particularly well.

Generative Adversarial Network Human motion prediction +3

Paper
Code

Visual Re-ranking with Natural Language Understanding for Text Spotting

3 code implementations • 29 Oct 2018 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

We propose a post-processing approach to improve scene text recognition accuracy by using occurrence probabilities of words (unigram language model), and the semantic correlation between scene and text.

Language Modelling Natural Language Understanding +3

Paper
Code

Visual Semantic Re-ranker for Text Spotting

1 code implementation • 23 Oct 2018 • Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró

In this paper, we propose a post-processing approach to improve the accuracy of text spotting by using the semantic relation between the text and the scene.

Text Spotting

Paper
Code

Unsupervised Person Image Synthesis in Arbitrary Poses

no code implementations • CVPR 2018 • Albert Pumarola, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer

Given an input image of a person and a desired pose represented by a 2D skeleton, our model renders the image of the same person under the new pose, synthesizing novel views of the parts visible in the input image and hallucinating those that are not seen.

Image Generation

Paper
Add Code

Geometry-Aware Network for Non-Rigid Shape Prediction from a Single View

no code implementations • CVPR 2018 • Albert Pumarola, Antonio Agudo, Lorenzo Porzi, Alberto Sanfeliu, Vincent Lepetit, Francesc Moreno-Noguer

We propose a method for predicting the 3D shape of a deformable surface from a single view.

Paper
Add Code

Hallucinating Dense Optical Flow from Sparse Lidar for Autonomous Vehicles

no code implementations • 30 Aug 2018 • Victor Vaquero, Alberto Sanfeliu, Francesc Moreno-Noguer

In this paper we propose a novel approach to estimate dense optical flow from sparse lidar data acquired on an autonomous vehicle.

Autonomous Vehicles Optical Flow Estimation

Paper
Add Code

Deep Lidar CNN to Understand the Dynamics of Moving Vehicles

no code implementations • 28 Aug 2018 • Victor Vaquero, Alberto Sanfeliu, Francesc Moreno-Noguer

Perception technologies in Autonomous Driving are experiencing their golden age due to the advances in Deep Learning.

Autonomous Driving Optical Flow Estimation

Paper
Add Code

Deconvolutional Networks for Point-Cloud Vehicle Detection and Tracking in Driving Scenarios

no code implementations • 23 Aug 2018 • Victor Vaquero, Ivan del Pino, Francesc Moreno-Noguer, Joan Solà, Alberto Sanfeliu, Juan Andrade-Cetto

The system is thoroughly evaluated on the KITTI tracking dataset, and we show the performance boost provided by our CNN-based vehicle detector over a standard geometric approach.

Autonomous Driving Multi-Object Tracking

Paper
Add Code

Joint Coarse-And-Fine Reasoning for Deep Optical Flow

no code implementations • 22 Aug 2018 • Victor Vaquero, German Ros, Francesc Moreno-Noguer, Antonio M. Lopez, Alberto Sanfeliu

We propose a novel representation for dense pixel-wise estimation tasks using CNNs that boosts accuracy and reduces training time, by explicitly exploiting joint coarse-and-fine reasoning.

Optical Flow Estimation

Paper
Add Code

GANimation: Anatomically-aware Facial Animation from a Single Image

7 code implementations • ECCV 2018 • Albert Pumarola, Antonio Agudo, Aleix M. Martinez, Alberto Sanfeliu, Francesc Moreno-Noguer

Recent advances in Generative Adversarial Networks (GANs) have shown impressive results for task of facial expression synthesis.

Conditional Image Generation Face Generation +1

1,928

Paper
Code

Image Collection Pop-Up: 3D Reconstruction and Clustering of Rigid and Non-Rigid Categories

no code implementations • CVPR 2018 • Antonio Agudo, Melcior Pijoan, Francesc Moreno-Noguer

This paper introduces an approach to simultaneously estimate 3D shape, camera pose, and object and type of deformation clustering, from partial 2D annotations in a multi-instance collection of images.

3D Reconstruction Clustering +1

Paper
Add Code

3D CNNs on Distance Matrices for Human Action Recognition

1 code implementation • MM '17 Proceedings of the 25th ACM international conference on Multimedia 2017 • Alejandro Hernandez Ruiz, Lorenzo Porzi, Samuel Rota Bulò, Francesc Moreno-Noguer

In this paper we are interested in recognizing human actions from sequences of 3D skeleton data.

Ranked #90 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Skeleton Based Action Recognition +1

Paper
Code

DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction

no code implementations • CVPR 2017 • Antonio Agudo, Francesc Moreno-Noguer

We present an approach to reconstruct the 3D shape of multiple deforming objects from incomplete 2D trajectories acquired by a single camera.

3D Reconstruction Clustering +1

Paper
Add Code

Multi-Modal Fashion Product Retrieval

no code implementations • WS 2017 • Antonio Rubio Romano, LongLong Yu, Edgar Simo-Serra, Francesc Moreno-Noguer

Finding a product in the fashion world can be a daunting task.

Retrieval

Paper
Add Code

The BreakingNews Dataset

no code implementations • WS 2017 • Arnau Ramisa, Fei Yan, Francesc Moreno-Noguer, Krystian Mikolajczyk

We present BreakingNews, a novel dataset with approximately 100K news articles including images, text and captions, and enriched with heterogeneous meta-data (e. g. GPS coordinates and popularity metrics).

Image Captioning Sentiment Analysis

Paper
Add Code

3D Human Pose Estimation from a Single Image via Distance Matrix Regression

no code implementations • CVPR 2017 • Francesc Moreno-Noguer

We follow a standard two-step pipeline by first detecting the 2D position of the $N$ body joints, and then using these observations to infer 3D pose.

Ranked #19 on 3D Human Pose Estimation on HumanEva-I

3D Human Pose Estimation Position +1

Paper
Add Code

BreakingNews: Article Annotation by Image and Text Processing

no code implementations • 23 Mar 2016 • Arnau Ramisa, Fei Yan, Francesc Moreno-Noguer, Krystian Mikolajczyk

Deep Canonical Correlation Analysis is deployed for article illustration, and a new loss function based on Great Circle Distance is proposed for geolocation.

Image Retrieval Retrieval +1

Paper
Add Code

Discriminative Learning of Deep Convolutional Feature Point Descriptors

1 code implementation • ICCV 2015 • Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer

Deep learning has revolutionalized image-level tasks such as classification, but patch-level tasks, such as correspondence, still rely on hand-crafted features, e. g. SIFT.

Ranked #2 on Satellite Image Classification on SAT-4

Satellite Image Classification

120

Paper
Code

Learning Shape, Motion and Elastic Models in Force Space

no code implementations • ICCV 2015 • Antonio Agudo, Francesc Moreno-Noguer

In this paper, we address the problem of simultaneously recovering the 3D shape and pose of a deformable and potentially elastic object from 2D motion.

Paper
Add Code

Structured Prediction with Output Embeddings for Semantic Image Annotation

no code implementations • NAACL 2016 • Ariadna Quattoni, Arnau Ramisa, Pranava Swaroop Madhyastha, Edgar Simo-Serra, Francesc Moreno-Noguer

We address the task of annotating images with semantic tuples.

Structured Prediction

Paper
Add Code

Combining Geometric, Textual and Visual Features for Predicting Prepositions in Image Descriptions

no code implementations • EMNLP 2015 • Arnau Ramisa, Josiah Wang, Ying Lu, Dell, Emmanuel rea, Francesc Moreno-Noguer, Robert Gaizauskas

Image Retrieval

Paper
Add Code

Semantic Tuples for Evaluation of Image to Sentence Generation

no code implementations • WS 2015 • Lily D. Ellebracht, Arnau Ramisa, Pranava Swaroop Madhyastha, Jose Cordero-Rama, Francesc Moreno-Noguer, Ariadna Quattoni

Image Captioning Machine Translation +1

Paper
Add Code

Simultaneous Pose and Non-Rigid Shape With Particle Dynamics

no code implementations • CVPR 2015 • Antonio Agudo, Francesc Moreno-Noguer

In this paper, we propose a sequential solution to simultaneously estimate camera pose and non-rigid 3D shape from a monocular video.

Paper
Add Code

TED: A Tolerant Edit Distance for Segmentation Evaluation

1 code implementation • 8 Mar 2015 • Jan Funke, Francesc Moreno-Noguer, Albert Cardona, Matthew Cook

This measure, which we call Tolerant Edit Distance (TED), is motivated by two observations: (1) Some errors, like small boundary shifts, are tolerable in practice.

Paper
Code

Neuroaesthetics in Fashion: Modeling the Perception of Fashionability

no code implementations • Conference 2015 • Edgar Simo-Serra, Sanja Fidler, Francesc Moreno-Noguer, Raquel Urtasun

Importantly, our model is able to give rich feedback back to the user, conveying which garments or even scenery she/he should change in order to improve fashionability.

Paper
Add Code

Fracking Deep Convolutional Image Descriptors

no code implementations • 19 Dec 2014 • Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Francesc Moreno-Noguer

In this paper we propose a novel framework for learning local image descriptors in a discriminative manner.

Paper
Add Code

Very Fast Solution to the PnP Problem with Algebraic Outlier Rejection

no code implementations • CVPR 2014 • Luis Ferraz, Xavier Binefa, Francesc Moreno-Noguer

Given a set of 3D-to-2D matches, we formulate pose estimation problem as a low-rank homogeneous sys- tem where the solution lies on its 1D null space.

Pose Estimation

Paper
Add Code

Segmentation-aware Deformable Part Models

no code implementations • CVPR 2014 • Eduard Trulls, Stavros Tsogkas, Iasonas Kokkinos, Alberto Sanfeliu, Francesc Moreno-Noguer

In this work we propose a technique to combine bottom-up segmentation, coming in the form of SLIC superpixels, with sliding window detectors, such as Deformable Part Models (DPMs).

Optical Flow Estimation Segmentation +1

Paper
Add Code

A Joint Model for 2D and 3D Pose Estimation from a Single Image

no code implementations • CVPR 2013 • Edgar Simo-Serra, Ariadna Quattoni, Carme Torras, Francesc Moreno-Noguer

We introduce a novel approach to automatically recover 3D human pose from a single image.

Ranked #25 on 3D Human Pose Estimation on HumanEva-I

3D Human Pose Estimation 3D Pose Estimation +1

Paper
Add Code

Dense Segmentation-Aware Descriptors

no code implementations • CVPR 2013 • Eduard Trulls, Iasonas Kokkinos, Alberto Sanfeliu, Francesc Moreno-Noguer

In this work we exploit segmentation to construct appearance descriptors that can robustly deal with occlusion and background changes.

Motion Estimation Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.