Search Results for author: Roberto Cipolla

Found 67 papers, 33 papers with code

NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo

no code implementations • 20 May 2024 • Fotios Logothetis, Ignas Budvytis, Roberto Cipolla

In this work we present a novel multi-view photometric stereo (PS) method.

Paper
Add Code

VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field

1 code implementation • 14 Apr 2024 • Fei Xue, Ignas Budvytis, Daniel Olmeda Reino, Roberto Cipolla

However, in spite of high efficiency, APRs and SCRs have limited accuracy especially in large-scale outdoor scenes; HMs are accurate but need to store a large number of 2D descriptors for matching, resulting in poor efficiency.

Autonomous Driving regression

Paper
Code

PRAM: Place Recognition Anywhere Model for Efficient Visual Localization

no code implementations • 11 Apr 2024 • Fei Xue, Ignas Budvytis, Roberto Cipolla

Humans localize themselves efficiently in known environments by first recognizing landmarks defined on certain objects and their spatial relationships, and then verifying the location by aligning detailed structures of recognized objects with those in the memory.

Landmark Recognition Visual Localization

Paper
Add Code

ReCoRe: Regularized Contrastive Representation Learning of World Model

no code implementations • 14 Dec 2023 • Rudra P. K. Poudel, Harit Pandya, Stephan Liwicki, Roberto Cipolla

To address these challenges, we present a world model that learns invariant features using (i) contrastive unsupervised learning and (ii) an intervention-invariant regularizer.

Contrastive Learning Depth Estimation +7

Paper
Add Code

LanGWM: Language Grounded World Model

no code implementations • 29 Nov 2023 • Rudra P. K. Poudel, Harit Pandya, Chao Zhang, Roberto Cipolla

Furthermore, our proposed technique of explicit language-grounded visual representation learning has the potential to improve models for human-robot interaction because our extracted visual features are language grounded.

Model-based Reinforcement Learning Out-of-Distribution Generalization +2

Paper
Add Code

A Neural Height-Map Approach for the Binocular Photometric Stereo Problem

no code implementations • 10 Nov 2023 • Fotios Logothetis, Ignas Budvytis, Roberto Cipolla

As in recent neural multi-view shape estimation frameworks such as NeRF, SIREN and inverse graphics approaches to multi-view photometric stereo (e. g. PS-NeRF) we formulate shape estimation task as learning of a differentiable surface and texture representation by minimising surface normal discrepancy for normals estimated from multiple varying light images for two views as well as discrepancy between rendered surface intensity and observed images.

Paper
Add Code

FOUND: Foot Optimization with Uncertain Normals for Surface Deformation Using Synthetic Data

1 code implementation • 27 Oct 2023 • Oliver Boyne, Gwangbin Bae, James Charles, Roberto Cipolla

Our FOUND approach tackles this, with 4 main contributions: (i) SynFoot, a synthetic dataset of 50, 000 photorealistic foot images, paired with ground truth surface normals and keypoints; (ii) an uncertainty-aware surface normal predictor trained on our synthetic dataset; (iii) an optimization scheme for fitting a generative foot model to a series of images; and (iv) a benchmark dataset of calibrated images and high resolution ground truth geometry.

Surface Normal Estimation Surface Reconstruction

Paper
Code

Sparse Multi-Object Render-and-Compare

no code implementations • 17 Oct 2023 • Florian Langer, Ignas Budvytis, Roberto Cipolla

Introducing a new network architecture Multi-SPARC we learn to perform CAD model alignments for multiple detected objects jointly.

Object

Paper
Add Code

HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution Estimation

1 code implementation • CVPR 2023 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla

Monocular 3D human pose and shape estimation is an ill-posed problem since multiple 3D solutions can explain a 2D image of a subject.

Ranked #73 on 3D Human Pose Estimation on 3DPW (MPJPE metric)

3D human pose and shape estimation Normalising Flows

Paper
Code

IMP: Iterative Matching and Pose Estimation with Adaptive Pooling

1 code implementation • CVPR 2023 • Fei Xue, Ignas Budvytis, Roberto Cipolla

Previous methods solve feature matching and pose estimation using a two-stage process by first finding matches and then estimating the pose.

Pose Estimation

130

Paper
Code

SFD2: Semantic-guided Feature Detection and Description

1 code implementation • CVPR 2023 • Fei Xue, Ignas Budvytis, Roberto Cipolla

Visual localization is a fundamental task for various applications including autonomous driving and robotics.

2k 4k +2

101

Paper
Code

FIND: An Unsupervised Implicit 3D Model of Articulated Human Feet

1 code implementation • 21 Oct 2022 • Oliver Boyne, James Charles, Roberto Cipolla

In this paper we present a high fidelity and articulated 3D human foot model.

Disentanglement

Paper
Code

Model-Based Imitation Learning for Urban Driving

1 code implementation • 14 Oct 2022 • Anthony Hu, Gianluca Corrado, Nicolas Griffiths, Zak Murez, Corina Gurau, Hudson Yeo, Alex Kendall, Roberto Cipolla, Jamie Shotton

Our approach is the first camera-only method that models static scene, dynamic scene, and ego-behaviour in an urban driving environment.

Autonomous Driving Bird's-Eye View Semantic Segmentation +3

323

Paper
Code

A CNN Based Approach for the Point-Light Photometric Stereo Problem

no code implementations • 10 Oct 2022 • Fotios Logothetis, Roberto Mecca, Ignas Budvytis, Roberto Cipolla

Reconstructing the 3D shape of an object using several images under different light sources is a very challenging task, especially when realistic assumptions such as light propagation and attenuation, perspective viewing geometry and specular light reflection are considered.

Paper
Add Code

IronDepth: Iterative Refinement of Single-View Depth using Surface Normal and its Uncertainty

1 code implementation • 7 Oct 2022 • Gwangbin Bae, Ignas Budvytis, Roberto Cipolla

The depth of each pixel can be propagated to a query pixel, using the predicted surface normal as guidance.

Ranked #40 on Monocular Depth Estimation on NYU-Depth V2

Monocular Depth Estimation

166

Paper
Code

DigiFace-1M: 1 Million Digital Face Images for Face Recognition

1 code implementation • 5 Oct 2022 • Gwangbin Bae, Martin de La Gorce, Tadas Baltrusaitis, Charlie Hewitt, Dong Chen, Julien Valentin, Roberto Cipolla, Jingjing Shen

Such models are trained on large-scale datasets that contain millions of real human face images collected from the internet.

Ranked #2 on Synthetic Face Recognition on CPLFW (Accuracy metric)

Attribute Synthetic Data Generation +1

267

Paper
Code

SPARC: Sparse Render-and-Compare for CAD model alignment in a single RGB image

1 code implementation • 3 Oct 2022 • Florian Langer, Gwangbin Bae, Ignas Budvytis, Roberto Cipolla

This combined information is the input to a pose prediction network, SPARC-Net which we train to predict a 9 DoF CAD model pose update.

Pose Prediction Retrieval

Paper
Code

Contrastive Unsupervised Learning of World Model with Invariant Causal Features

no code implementations • 29 Sep 2022 • Rudra P. K. Poudel, Harit Pandya, Roberto Cipolla

In particular, we use contrastive unsupervised learning to learn the invariant causal features, which enforces invariance across augmentations of irrelevant parts or styles of the observation.

Data Augmentation Depth Estimation +5

Paper
Add Code

Efficient Large-Scale Localization by Global Instance Recognition

no code implementations • CVPR 2022 • Fei Xue, Ignas Budvytis, Daniel Olmeda Reino, Roberto Cipolla

Hierarchical frameworks consisting of both coarse and fine localization are often used as the standard pipeline for large-scale visual localization.

Visual Localization

Paper
Add Code

Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

1 code implementation • CVPR 2022 • Gwangbin Bae, Ignas Budvytis, Roberto Cipolla

To this end, we propose MaGNet, a novel framework for fusing single-view depth probability with multi-view geometry, to improve the accuracy, robustness and efficiency of multi-view depth estimation.

Depth Estimation

203

Paper
Code

Discrete neural representations for explainable anomaly detection

no code implementations • 10 Dec 2021 • Stanislaw Szymanowicz, James Charles, Roberto Cipolla

The aim of this work is to detect and automatically generate high-level explanations of anomalous events in video.

Anomaly Detection Object +1

Paper
Add Code

Probabilistic Estimation of 3D Human Shape and Pose with a Semantic Local Parametric Model

no code implementations • 30 Nov 2021 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla

This paper addresses the problem of 3D human body shape and pose estimation from RGB images.

Pose Estimation

Paper
Add Code

Leveraging Geometry for Shape Estimation from a Single RGB Image

1 code implementation • 10 Nov 2021 • Florian Langer, Ignas Budvytis, Roberto Cipolla

In this work we demonstrate how cross-domain keypoint matches from an RGB image to a rendered CAD model allow for more precise object pose predictions compared to ones obtained through direct predictions.

Object Retrieval

Paper
Code

Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild

1 code implementation • ICCV 2021 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla

Thus, it is desirable to estimate a distribution over 3D body shape and pose conditioned on the input image instead of a single 3D reconstruction.

Ranked #1 on 3D Human Shape Estimation on SSP-3D

3D Human Pose Estimation 3D Human Shape Estimation +1

186

Paper
Code

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

1 code implementation • ICCV 2021 • Gwangbin Bae, Ignas Budvytis, Roberto Cipolla

Experimental results show that the proposed method outperforms the state-of-the-art in ScanNet and NYUv2, and that the estimated uncertainty correlates well with the prediction error.

Ranked #1 on Surface Normal Estimation on NYU-Depth V2

Decoder Scene Understanding +2

209

Paper
Code

X-MAN: Explaining multiple sources of anomalies in video

no code implementations • 16 Jun 2021 • Stanislaw Szymanowicz, James Charles, Roberto Cipolla

In an effort to tackle this problem we make the following contributions: (1) we show how to build interpretable feature representations suitable for detecting anomalies with state of the art performance, (2) we propose an interpretable probabilistic anomaly detector which can describe the reason behind it's response using high level concepts, (3) we are the first to directly consider object interactions for anomaly detection and (4) we propose a new task of explaining anomalies and release a large dataset for evaluating methods on this task.

Anomaly Detection Decision Making

Paper
Add Code

LUCES: A Dataset for Near-Field Point Light Source Photometric Stereo

no code implementations • 27 Apr 2021 • Roberto Mecca, Fotios Logothetis, Ignas Budvytis, Roberto Cipolla

In order to fill the gap in evaluating near-field photometric stereo methods, we introduce LUCES the first real-world 'dataset for near-fieLd point light soUrCe photomEtric Stereo' of 14 objects of a varying of materials.

Paper
Add Code

FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras

1 code implementation • ICCV 2021 • Anthony Hu, Zak Murez, Nikhil Mohan, Sofía Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall

We present FIERY: a probabilistic future prediction model in bird's-eye view from monocular cameras.

Ranked #1 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU veh - 224x480 - No vis filter - 100x50 at 0.25 metric)

Autonomous Driving Bird's-Eye View Semantic Segmentation +4

543

Paper
Code

Probabilistic 3D Human Shape and Pose Estimation from Multiple Unconstrained Images in the Wild

no code implementations • CVPR 2021 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla

In contrast, we propose a new task: shape and pose estimation from a group of multiple images of a human subject, without constraints on subject pose, camera viewpoint or background conditions between images in the group.

Ranked #3 on 3D Human Shape Estimation on SSP-3D

3D Human Shape Estimation Pose Prediction

Paper
Add Code

Synthetic Training for Accurate 3D Human Pose and Shape Estimation in the Wild

1 code implementation • 21 Sep 2020 • Akash Sengupta, Ignas Budvytis, Roberto Cipolla

Thus, we propose STRAPS (Synthetic Training for Real Accurate Pose and Shape), a system that utilises proxy representations, such as silhouettes and 2D joints, as inputs to a shape and pose regression neural network, which is trained with synthetic training data (generated on-the-fly during training using the SMPL statistical body model) to overcome data scarcity.

Ranked #1 on 3D Human Shape Estimation on MoVi

3D human pose and shape estimation 3D Human Shape Estimation +3

163

Paper
Code

A CNN Based Approach for the Near-Field Photometric Stereo Problem

no code implementations • 12 Sep 2020 • Fotios Logothetis, Ignas Budvytis, Roberto Mecca, Roberto Cipolla

Secondly, we compute the depth by integrating the normal field in order to iteratively estimate light directions and attenuation which is used to compensate the input images to compute reflectance samples for the next iteration.

Paper
Add Code

Embodied Visual Navigation with Automatic Curriculum Learning in Real Environments

no code implementations • 11 Sep 2020 • Steven D. Morad, Roberto Mecca, Rudra P. K. Poudel, Stephan Liwicki, Roberto Cipolla

We present NavACL, a method of automatic curriculum learning tailored to the navigation task.

Navigate reinforcement-learning +2

Paper
Add Code

PX-NET: Simple and Efficient Pixel-Wise Training of Photometric Stereo Networks

no code implementations • ICCV 2021 • Fotios Logothetis, Ignas Budvytis, Roberto Mecca, Roberto Cipolla

We show that global physical effects can be approximated on the observation map domain and this simplifies and speeds up the data creation procedure.

Data Augmentation

Paper
Add Code

Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop

2 code implementations • ECCV 2020 • Benjamin Biggs, Oliver Boyne, James Charles, Andrew Fitzgibbon, Roberto Cipolla

We introduce an automatic, end-to-end method for recovering the 3D pose and shape of dogs from monocular internet images.

Paper
Code

Predicting Semantic Map Representations from Images using Pyramid Occupancy Networks

1 code implementation • CVPR 2020 • Thomas Roddick, Roberto Cipolla

Autonomous vehicles commonly rely on highly detailed birds-eye-view maps of their environment, which capture both static elements of the scene such as road layout as well as dynamic elements such as other cars and pedestrians.

3D Object Detection Autonomous Vehicles +2

299

Paper
Code

Learning a Spatio-Temporal Embedding for Video Instance Segmentation

1 code implementation • 19 Dec 2019 • Anthony Hu, Alex Kendall, Roberto Cipolla

We present a novel embedding approach for video instance segmentation.

Instance Segmentation Semantic Segmentation +1

Paper
Code

Large Scale Joint Semantic Re-Localisation and Scene Understanding via Globally Unique Instance Coordinate Regression

no code implementations • 23 Sep 2019 • Ignas Budvytis, Marvin Teichmann, Tomas Vojir, Roberto Cipolla

We obtain smaller mean distance and angular errors than state-of-the-art 6-DoF pose estimation algorithms based on direct pose regression and pose estimation from scene coordinates on all datasets.

Autonomous Driving Pose Estimation +2

Paper
Add Code

Orientation-aware Semantic Segmentation on Icosahedron Spheres

1 code implementation • ICCV 2019 • Chao Zhang, Stephan Liwicki, William Smith, Roberto Cipolla

For the spherical domain, several methods recently adopt an icosahedron mesh, but systems are typically rotation invariant or require significant memory and parameters, thus enabling execution only at very low resolutions.

Ranked #21 on Semantic Segmentation on Stanford2D3D Panoramic

Autonomous Driving Semantic Segmentation

Paper
Code

Fast-SCNN: Fast Semantic Segmentation Network

25 code implementations • 12 Feb 2019 • Rudra P. K. Poudel, Stephan Liwicki, Roberto Cipolla

The encoder-decoder framework is state-of-the-art for offline semantic image segmentation.

Ranked #7 on Semantic Segmentation on SynPASS

Decoder Image Segmentation +3

8,339

Paper
Code

Orthographic Feature Transform for Monocular 3D Object Detection

1 code implementation • 20 Nov 2018 • Thomas Roddick, Alex Kendall, Roberto Cipolla

This allows us to reason holistically about the spatial configuration of the scene in a domain where scale is consistent and distances between objects are meaningful.

3D Object Detection From Monocular Images Monocular 3D Object Detection +2

197

Paper
Code

Creatures great and SMAL: Recovering the shape and motion of animals from video

no code implementations • 14 Nov 2018 • Benjamin Biggs, Thomas Roddick, Andrew Fitzgibbon, Roberto Cipolla

We present a system to recover the 3D shape and motion of a wide variety of quadrupeds from video.

Video Segmentation Video Semantic Segmentation

Paper
Add Code

A Differential Volumetric Approach to Multi-View Photometric Stereo

no code implementations • ICCV 2019 • Fotios Logothetis, Roberto Mecca, Roberto Cipolla

In this work, we present a volumetric approach to the multi-view photometric stereo problem.

3D Volumetric Reconstruction

Paper
Add Code

Convolutional CRFs for Semantic Segmentation

1 code implementation • ICLR 2019 • Marvin T. T. Teichmann, Roberto Cipolla

For the challenging semantic image segmentation task the most efficient models have traditionally combined the structured modelling capabilities of Conditional Random Fields (CRFs) with the feature extraction power of CNNs.

Image Segmentation Segmentation +1

562

Paper
Code

Semi-Calibrated Near Field Photometric Stereo

no code implementations • CVPR 2017 • Fotios Logothetis, Roberto Mecca, Roberto Cipolla

3D reconstruction from shading information through Photometric Stereo is considered a very challenging problem in Computer Vision.

3D Reconstruction

Paper
Add Code

Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics

16 code implementations • CVPR 2018 • Alex Kendall, Yarin Gal, Roberto Cipolla

Numerous deep learning applications benefit from multi-task learning with multiple regression and classification objectives.

General Classification Instance Segmentation +3

834

Paper
Code

Geometric Loss Functions for Camera Pose Regression with Deep Learning

1 code implementation • CVPR 2017 • Alex Kendall, Roberto Cipolla

Deep learning has shown to be effective for robust and real-time monocular image relocalisation.

Position regression

Paper
Code

MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving

16 code implementations • 22 Dec 2016 • Marvin Teichmann, Michael Weber, Marius Zoellner, Roberto Cipolla, Raquel Urtasun

While most approaches to semantic reasoning have focused on improving performance, in this paper we argue that computational times are very important in order to enable real time applications such as autonomous driving.

Autonomous Driving General Classification +2

1,842

Paper
Code

Understanding Real World Indoor Scenes With Synthetic Data

no code implementations • CVPR 2016 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla

Scene understanding is a prerequisite to many high level tasks for any automated intelligent machine operating in real world environments.

Scene Understanding

Paper
Add Code

Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

no code implementations • CVPR 2017 • Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi

We propose a new method for creating computationally efficient and compact convolutional neural networks (CNNs) using a novel sparse connection structure that resembles a tree root.

Paper
Add Code

Refining Architectures of Deep Convolutional Neural Networks

no code implementations • CVPR 2016 • Sukrit Shankar, Duncan Robertson, Yani Ioannou, Antonio Criminisi, Roberto Cipolla

Deep Convolutional Neural Networks (CNNs) have recently evinced immense success for various image recognition tasks.

Paper
Add Code

SceneNet: Understanding Real World Indoor Scenes With Synthetic Data

1 code implementation • 22 Nov 2015 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla

Scene understanding is a prerequisite to many high level tasks for any automated intelligent machine operating in real world environments.

Scene Understanding

135

Paper
Code

Training CNNs with Low-Rank Filters for Efficient Image Classification

no code implementations • 20 Nov 2015 • Yani Ioannou, Duncan Robertson, Jamie Shotton, Roberto Cipolla, Antonio Criminisi

Applying our method to a near state-of-the-art network for CIFAR, we achieved comparable accuracy with 46% less compute and 55% fewer parameters.

Classification General Classification +1

Paper
Add Code

Spatio-temporal video autoencoder with differentiable memory

1 code implementation • 19 Nov 2015 • Viorica Patraucean, Ankur Handa, Roberto Cipolla

At each time step, the system receives as input a video frame, predicts the optical flow based on the current observation and the LSTM memory state as a dense transformation map, and applies it to the current frame to generate the next frame.

Decoder Motion Estimation +3

288

Paper
Code

TemplateNet for Depth-Based Object Instance Recognition

no code implementations • 10 Nov 2015 • Ujwal Bonde, Vijay Badrinarayanan, Roberto Cipolla, Minh-Tri Pham

We present a novel deep architecture termed templateNet for depth based object instance recognition.

Object

Paper
Add Code

Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding

22 code implementations • 9 Nov 2015 • Alex Kendall, Vijay Badrinarayanan, Roberto Cipolla

Semantic segmentation is an important tool for visual scene understanding and a meaningful measure of uncertainty is essential for decision making.

Decision Making Decoder +3

1,079

Paper
Code

Symmetry-invariant optimization in deep networks

no code implementations • 5 Nov 2015 • Vijay Badrinarayanan, Bamdev Mishra, Roberto Cipolla

Recent works have highlighted scale invariance or symmetry that is present in the weight space of a typical deep network and the adverse effect that it has on the Euclidean gradient based stochastic gradient descent optimization.

Computational Efficiency Image Segmentation +1

Paper
Add Code

Understanding symmetries in deep networks

no code implementations • 3 Nov 2015 • Vijay Badrinarayanan, Bamdev Mishra, Roberto Cipolla

Consequently, training the network boils down to using stochastic gradient descent updates on the unit-norm manifold.

Computational Efficiency

Paper
Add Code

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

74 code implementations • 2 Nov 2015 • Vijay Badrinarayanan, Alex Kendall, Roberto Cipolla

We show that SegNet provides good performance with competitive inference time and more efficient inference memory-wise as compared to other architectures.

Ranked #3 on Medical Image Segmentation on RITE

Crowd Counting Decoder +7

8,339

Paper
Code

Modelling Uncertainty in Deep Learning for Camera Relocalization

1 code implementation • 19 Sep 2015 • Alex Kendall, Roberto Cipolla

Using a Bayesian convolutional neural network implementation we obtain an estimate of the model's relocalization uncertainty and improve state of the art localization accuracy on a large scale outdoor dataset.

Camera Relocalization

493

Paper
Code

PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization

6 code implementations • ICCV 2015 • Alex Kendall, Matthew Grimes, Roberto Cipolla

We present a robust and real-time monocular six degree of freedom relocalization system.

Camera Relocalization Transfer Learning

493

Paper
Code

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling

5 code implementations • 27 May 2015 • Vijay Badrinarayanan, Ankur Handa, Roberto Cipolla

These methods lack a mechanism to map deep layer feature maps to input dimensions.

Decoder Object Categorization

101

Paper
Code

SynthCam3D: Semantic Understanding With Synthetic Indoor Scenes

no code implementations • 1 May 2015 • Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla

We are interested in automatic scene understanding from geometric cues.

Scene Understanding Segmentation +1

Paper
Add Code

DEEP-CARVING: Discovering Visual Attributes by Carving Deep Neural Nets

no code implementations • CVPR 2015 • Sukrit Shankar, Vikas K. Garg, Roberto Cipolla

To ameliorate this limitation, we propose Deep-Carving, a novel training procedure with CNNs, that helps the net efficiently carve itself for the task of multiple attribute prediction.

Attribute Image Retrieval

Paper
Add Code

Bi-label Propagation for Generic Multiple Object Tracking

no code implementations • CVPR 2014 • Wenhan Luo, Tae-Kyun Kim, Bjorn Stenger, Xiaowei Zhao, Roberto Cipolla

In this paper, we propose a label propagation framework to handle the multiple object tracking (MOT) problem for a generic object type (cf.

Multiple Object Tracking Object

Paper
Add Code

Expressive Visual Text-to-Speech Using Active Appearance Models

no code implementations • CVPR 2013 • Robert Anderson, Bjorn Stenger, Vincent Wan, Roberto Cipolla

This paper presents a complete system for expressive visual text-to-speech (VTTS), which is capable of producing expressive output, in the form of a 'talking head', given an input text and a set of continuous expression weights.

Paper
Add Code

Unconstrained Monocular 3D Human Pose Estimation by Action Detection and Cross-Modality Regression Forest

no code implementations • CVPR 2013 • Tsz-Ho Yu, Tae-Kyun Kim, Roberto Cipolla

This work addresses the challenging problem of unconstrained 3D human pose estimation (HPE) from a novel perspective.

2D Pose Estimation Action Detection +2

Paper
Add Code

MCBoost: Multiple Classifier Boosting for Perceptual Co-clustering of Images and Visual Features

no code implementations • NeurIPS 2008 • Tae-Kyun Kim, Roberto Cipolla

We present a new co-clustering problem of images and visual features.

Clustering Object +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.