Search Results for author: Mathieu Salzmann

Found 180 papers, 68 papers with code

DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses

1 code implementation20 Mar 2024 Chen Zhao, Tong Zhang, Zheng Dang, Mathieu Salzmann

Determining the relative pose of an object between two images is pivotal to the success of generalizable object pose estimation.

Object Pose Estimation

CLOAF: CoLlisiOn-Aware Human Flow

no code implementations14 Mar 2024 Andrey Davydov, Martin Engilberge, Mathieu Salzmann, Pascal Fua

Even the best current algorithms for estimating body 3D shape and pose yield results that include body self-intersections.

HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields

1 code implementation26 Feb 2024 Haozhe Qi, Chen Zhao, Mathieu Salzmann, Alexander Mathis

These representations are typically explicit, such as 3D point clouds or meshes, and thus provide information in the direct surroundings of the intermediate hand pose estimate.

 Ranked #1 on hand-object pose on HO-3D (using extra training data)

hand-object pose Object +1

Using Motion Cues to Supervise Single-Frame Body Pose and Shape Estimation in Low Data Regimes

no code implementations5 Feb 2024 Andrey Davydov, Alexey Sidnev, Artsiom Sanakoyeu, Yuhua Chen, Mathieu Salzmann, Pascal Fua

When enough annotated training data is available, supervised deep-learning algorithms excel at estimating human body pose and shape using a single camera.

Optical Flow Estimation

DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud Registration

no code implementations5 Dec 2023 Zhi Chen, Yufan Ren, Tong Zhang, Zheng Dang, Wenbing Tao, Sabine Süsstrunk, Mathieu Salzmann

We propose formulating PCR as a denoising diffusion probabilistic process, mapping noisy transformations to the ground truth.

Denoising Point Cloud Registration

GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence

1 code implementation23 Nov 2023 Van Nguyen Nguyen, Thibault Groueix, Mathieu Salzmann, Vincent Lepetit

We present GigaPose, a fast, robust, and accurate method for CAD-based novel object pose estimation in RGB images.

3D Reconstruction Pose Estimation

TIC-TAC: A Framework To Learn And Evaluate Your Covariance

1 code implementation29 Oct 2023 Megh Shukla, Mathieu Salzmann, Alexandre Alahi

We study the problem of unsupervised heteroscedastic covariance estimation, where the goal is to learn the multivariate target distribution $\mathcal{N}(y, \Sigma_y | x )$ given an observation $x$.

Pose Estimation

3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation

no code implementations5 Oct 2023 Chen Zhao, Tong Zhang, Mathieu Salzmann

Our goal then is to estimate the relative object pose between this reference view and a query image that depicts the object in a different pose.

Object Pose Estimation

Understanding Pose and Appearance Disentanglement in 3D Human Pose Estimation

no code implementations20 Sep 2023 Krishna Kanth Nakka, Mathieu Salzmann

In this paper, we carry out in-depth analysis to understand to what degree the state-of-the-art disentangled representation learning methods truly separate the appearance information from the pose one.

3D Human Pose Estimation Adversarial Attack +2

AutoSynth: Learning to Generate 3D Training Data for Object Point Cloud Registration

no code implementations ICCV 2023 Zheng Dang, Mathieu Salzmann

Specifically, AutoSynth automatically curates an optimal dataset by exploring a search space encompassing millions of potential datasets with diverse 3D shapes at a low cost. To achieve this, we generate synthetic 3D datasets by assembling shape primitives, and develop a meta-learning strategy to search for the best training data for 3D registration on real point clouds.

Meta-Learning Point Cloud Registration

Vision Transformer Adapters for Generalizable Multitask Learning

no code implementations ICCV 2023 Deblina Bhattacharjee, Sabine Süsstrunk, Mathieu Salzmann

We introduce the first multitasking vision transformer adapters that learn generalizable task affinities which can be applied to novel tasks and domains.

Unsupervised Domain Adaptation

Dense Multitask Learning to Reconfigure Comics

no code implementations16 Jul 2023 Deblina Bhattacharjee, Sabine Süsstrunk, Mathieu Salzmann

In this paper, we develop a MultiTask Learning (MTL) model to achieve dense predictions for comics panels to, in turn, facilitate the transfer of comics from one publication channel to another by assisting authors in the task of reconfiguring their narratives.

Unsupervised Image-To-Image Translation

LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields

1 code implementation20 Apr 2023 Tang Tao, Longfei Gao, Guangrun Wang, Yixing Lao, Peng Chen, Hengshuang Zhao, Dayang Hao, Xiaodan Liang, Mathieu Salzmann, Kaicheng Yu

We address this challenge by formulating, to the best of our knowledge, the first differentiable end-to-end LiDAR rendering framework, LiDAR-NeRF, leveraging a neural radiance field (NeRF) to facilitate the joint learning of geometry and the attributes of 3D points.

3D Reconstruction Novel View Synthesis +1

Robust Outlier Rejection for 3D Registration with Variational Bayes

1 code implementation CVPR 2023 Haobo Jiang, Zheng Dang, Zhen Wei, Jin Xie, Jian Yang, Mathieu Salzmann

Embedded with the inlier/outlier label, the posterior feature distribution is label-dependent and discriminative.

Bayesian Inference

De-coupling and De-positioning Dense Self-supervised Learning

no code implementations29 Mar 2023 Congpei Qiu, Tong Zhang, Wei Ke, Mathieu Salzmann, Sabine Süsstrunk

Dense Self-Supervised Learning (SSL) methods address the limitations of using image-level feature representations when handling images with multiple objects.

Data Augmentation Object +5

NOPE: Novel Object Pose Estimation from a Single Image

1 code implementation23 Mar 2023 Van Nguyen Nguyen, Thibault Groueix, Yinlin Hu, Mathieu Salzmann, Vincent Lepetit

The practicality of 3D object pose estimation remains limited for many applications due to the need for prior knowledge of a 3D model and a training period for new objects.

Object Pose Estimation

Rigidity-Aware Detection for 6D Object Pose Estimation

2 code implementations CVPR 2023 Yang Hai, Rui Song, Jiaojiao Li, Mathieu Salzmann, Yinlin Hu

To address this, we propose a rigidity-aware detection method exploiting the fact that, in 6D pose estimation, the target objects are rigid.

6D Pose Estimation 6D Pose Estimation using RGB +3

Linear-Covariance Loss for End-to-End Learning of 6D Pose Estimation

1 code implementation ICCV 2023 Fulin Liu, Yinlin Hu, Mathieu Salzmann

Here, we argue that this conflicts with the averaging nature of the PnP problem, leading to gradients that may encourage the network to degrade the accuracy of individual correspondences.

6D Pose Estimation 6D Pose Estimation using RGB

MixCycle: Mixup Assisted Semi-Supervised 3D Single Object Tracking with Cycle Consistency

1 code implementation ICCV 2023 Qiao Wu, Jiaqi Yang, Kun Sun, Chu'ai Zhang, Yanning Zhang, Mathieu Salzmann

Specifically, we introduce two cycle-consistency strategies for supervision: 1) Self tracking cycles, which leverage labels to help the model converge better in the early stages of training; 2) forward-backward cycles, which strengthen the tracker's robustness to motion variations and the template noise caused by the template update strategy.

3D Single Object Tracking Data Augmentation +1

Modular Quantization-Aware Training: Increasing Accuracy by Decreasing Precision in 6D Object Pose Estimation

no code implementations12 Mar 2023 Saqib Javed, Chengkun Li, Andrew Price, Yinlin Hu, Mathieu Salzmann

Edge applications, such as collaborative robotics and spacecraft rendezvous, demand efficient 6D object pose estimation on resource-constrained embedded platforms.

6D Pose Estimation 6D Pose Estimation using RGB +1

CLIP the Gap: A Single Domain Generalization Approach for Object Detection

1 code implementation CVPR 2023 Vidit Vidit, Martin Engilberge, Mathieu Salzmann

Single Domain Generalization (SDG) tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain.

Domain Generalization Image Classification +5

TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction

no code implementations5 Jan 2023 Bahar Aydemir, Ludo Hoffstetter, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk

Deep saliency prediction algorithms complement the object recognition features, they typically rely on additional information, such as scene context, semantic relationships, gaze direction, and object dissimilarity.

Object Object Recognition +1

TempSAL - Uncovering Temporal Information for Deep Saliency Prediction

no code implementations CVPR 2023 Bahar Aydemir, Ludo Hoffstetter, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk

Deep saliency prediction algorithms complement the object recognition features, they typically rely on additional information such as scene context, semantic relationships, gaze direction, and object dissimilarity.

Object Object Recognition +1

Center-Based Decoupled Point-cloud Registration for 6D Object Pose Estimation

no code implementations ICCV 2023 Haobo Jiang, Zheng Dang, Shuo Gu, Jin Xie, Mathieu Salzmann, Jian Yang

Our method decouples the translation from the entire transformation by predicting the object center and estimating the rotation in a center-aware manner.

6D Pose Estimation using RGB Object +2

DSI2I: Dense Style for Unpaired Image-to-Image Translation

no code implementations26 Dec 2022 Baran Ozaydin, Tong Zhang, Sabine Süsstrunk, Mathieu Salzmann

Unpaired exemplar-based image-to-image (UEI2I) translation aims to translate a source image to a target image domain with the style of a target image exemplar, without ground-truth input-translation pairs.

Image-to-Image Translation Translation

LocPoseNet: Robust Location Prior for Unseen Object Pose Estimation

no code implementations29 Nov 2022 Chen Zhao, Yinlin Hu, Mathieu Salzmann

The prior can be used to initialize the 3D object translation and facilitate 3D object rotation estimation.

6D Pose Estimation 6D Pose Estimation using RGB +4

Unsupervised 3D Keypoint Discovery with Multi-View Geometry

no code implementations23 Nov 2022 Sina Honari, Chen Zhao, Mathieu Salzmann, Pascal Fua

Analyzing and training 3D body posture models depend heavily on the availability of joint labels that are commonly acquired through laborious manual annotation of body joints or via marker-based joint localization using carefully curated markers and capturing systems.

3D Human Pose Estimation Keypoint Estimation +1

DrapeNet: Garment Generation and Self-Supervised Draping

1 code implementation CVPR 2023 Luca De Luigi, Ren Li, Benoît Guillard, Mathieu Salzmann, Pascal Fua

Recent approaches to drape garments quickly over arbitrary human bodies leverage self-supervision to eliminate the need for large training sets.

Contact-aware Human Motion Forecasting

1 code implementation8 Oct 2022 Wei Mao, Miaomiao Liu, Richard Hartley, Mathieu Salzmann

In this paper, we tackle the task of scene-aware 3D human motion forecasting, which consists of predicting future human poses given a 3D scene and a past human motion.

Human Pose Forecasting Motion Forecasting

Perspective Aware Road Obstacle Detection

1 code implementation4 Oct 2022 Krzysztof Lis, Sina Honari, Pascal Fua, Mathieu Salzmann

While road obstacle detection techniques have become increasingly effective, they typically ignore the fact that, in practice, the apparent size of the obstacles decreases as their distance to the vehicle increases.

3D Pose Based Feedback for Physical Exercises

1 code implementation5 Aug 2022 Ziyi Zhao, Sena Kiciroglu, Hugues Vinzant, Yuan Cheng, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua

To evaluate our approach, we introduce a dataset with 3 different physical exercises.

Fast Adversarial Training with Adaptive Step Size

no code implementations6 Jun 2022 Zhichao Huang, Yanbo Fan, Chen Liu, Weizhong Zhang, Yong Zhang, Mathieu Salzmann, Sabine Süsstrunk, Jue Wang

While adversarial training and its variants have shown to be the most effective algorithms to defend against adversarial attacks, their extremely slow training process makes it hard to scale to large datasets like ImageNet.

Weakly-supervised Action Transition Learning for Stochastic Human Motion Prediction

1 code implementation CVPR 2022 Wei Mao, Miaomiao Liu, Mathieu Salzmann

We introduce the task of action-driven stochastic human motion prediction, which aims to predict multiple plausible future motions given a sequence of action labels and a short motion history.

motion prediction Stochastic Human Motion Prediction

Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions

2 code implementations CVPR 2022 Van Nguyen Nguyen, Yinlin Hu, Yang Xiao, Mathieu Salzmann, Vincent Lepetit

It relies on a small set of training objects to learn local object representations, which allow us to locally match the input image to a set of "templates", rendered images of the CAD models for the new objects.

6D Pose Estimation 6D Pose Estimation using RGB +1

Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy

no code implementations CVPR 2022 Tong Zhang, Congpei Qiu, Wei Ke, Sabine Süsstrunk, Mathieu Salzmann

In essence, this strategy ignores the fact that two crops may truly contain different image information, e. g., background and small objects, and thus tends to restrain the diversity of the learned representations.

Self-Supervised Learning Transfer Learning

MatchNorm: Learning-based Point Cloud Registration for 6D Object Pose Estimation in the Real World

no code implementations29 Mar 2022 Zheng Dang, Lizhou Wang, Yu Guo, Mathieu Salzmann

Our two contributions are general and can be applied to many existing learning-based 3D object registration frameworks, which we illustrate by implementing them in two of them, DCP and IDAM.

6D Pose Estimation using RGB Object +1

Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation

1 code implementation18 Mar 2022 Yinlin Hu, Pascal Fua, Mathieu Salzmann

Given a rough pose estimate obtained from a first network, it uses a second network to predict a dense 2D correspondence field between the image rendered using the rough pose and the real image and infers the required pose correction.

6D Pose Estimation using RGB

Fusing Local Similarities for Retrieval-based 3D Orientation Estimation of Unseen Objects

no code implementations16 Mar 2022 Chen Zhao, Yinlin Hu, Mathieu Salzmann

In this paper, we tackle the task of estimating the 3D orientation of previously-unseen objects from monocular images.

Retrieval

Robust Binary Models by Pruning Randomly-initialized Networks

1 code implementation3 Feb 2022 Chen Liu, Ziqi Zhao, Sabine Süsstrunk, Mathieu Salzmann

In this paper, we introduce an approach to obtain robust yet compact models by pruning randomly-initialized binary networks.

On the Impact of Hard Adversarial Instances on Overfitting in Adversarial Training

no code implementations14 Dec 2021 Chen Liu, Zhichao Huang, Mathieu Salzmann, Tong Zhang, Sabine Süsstrunk

This lets us show that the decay in generalization performance of adversarial training is a result of the model's attempt to fit hard adversarial instances.

Adversarial Parametric Pose Prior

no code implementations CVPR 2022 Andrey Davydov, Anastasia Remizova, Victor Constantin, Sina Honari, Mathieu Salzmann, Pascal Fua

The Skinned Multi-Person Linear (SMPL) model can represent a human body by mapping pose and shape parameters to body meshes.

3D Reconstruction

Learning Transferable Adversarial Perturbations

1 code implementation NeurIPS 2021 Krishna Kanth Nakka, Mathieu Salzmann

While effective, deep neural networks (DNNs) are vulnerable to adversarial attacks.

Dyadic Human Motion Prediction

no code implementations1 Dec 2021 Isinsu Katircioglu, Costa Georgantas, Mathieu Salzmann, Pascal Fua

To evaluate this, and because no existing motion prediction datasets depict two closely-interacting subjects, we introduce the LindyHop600K dance dataset.

Human motion prediction Motion Forecasting +1

What Stops Learning-based 3D Registration from Working in the Real World?

no code implementations19 Nov 2021 Zheng Dang, Lizhou Wang, Junning Qiu, Minglei Lu, Mathieu Salzmann

We summarise our findings into a set of guidelines and demonstrate their effectiveness by applying them to different baseline methods, DCP and IDAM.

Point Cloud Registration

Temporally-Consistent Surface Reconstruction using Metrically-Consistent Atlases

1 code implementation12 Nov 2021 Jan Bednarik, Noam Aigerman, Vladimir G. Kim, Siddhartha Chaudhuri, Shaifali Parashar, Mathieu Salzmann, Pascal Fua

The key to making these correspondences semantically meaningful is to guarantee that the metric tensors computed at corresponding points are as similar as possible.

Surface Reconstruction

Estimating Image Depth in the Comics Domain

1 code implementation7 Oct 2021 Deblina Bhattacharjee, Martin Everaert, Mathieu Salzmann, Sabine Süsstrunk

Estimating the depth of comics images is challenging as such images a) are monocular; b) lack ground-truth depth annotations; c) differ across different artistic styles; d) are sparse and noisy.

Depth Estimation Depth Prediction +2

An Analysis of Super-Net Heuristics in Weight-Sharing NAS

no code implementations4 Oct 2021 Kaicheng Yu, René Ranftl, Mathieu Salzmann

Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware.

Neural Architecture Search

Improving Adversarial Defense with Self-supervised Test-time Fine-tuning

no code implementations29 Sep 2021 Zhichao Huang, Chen Liu, Mathieu Salzmann, Sabine Süsstrunk, Tong Zhang

Although adversarial training and its variants currently constitute the most effective way to achieve robustness against adversarial attacks, their poor generalization limits their performance on the test samples.

Adversarial Defense

Generating Smooth Pose Sequences for Diverse Human Motion Prediction

1 code implementation ICCV 2021 Wei Mao, Miaomiao Liu, Mathieu Salzmann

Recent progress in stochastic motion prediction, i. e., predicting multiple possible future human motions given a single past pose sequence, has led to producing truly diverse future motions and even providing control over the motion of some body parts.

Ranked #2 on Human Pose Forecasting on AMASS (ADE metric)

Human motion prediction Human Pose Forecasting +2

Multi-level Motion Attention for Human Motion Prediction

1 code implementation17 Jun 2021 Wei Mao, Miaomiao Liu, Mathieu Salzmann, Hongdong Li

Whether based on recurrent or feed-forward neural networks, existing learning based methods fail to model the observation that human motion tends to repeat itself, even for complex sports actions and cooking activities.

Human motion prediction motion prediction

Attention-based Domain Adaptation for Single Stage Detectors

no code implementations14 Jun 2021 Vidit Vidit, Mathieu Salzmann

While domain adaptation has been used to improve the performance of object detectors when the training and test data follow different distributions, previous work has mostly focused on two-stage detectors.

Domain Adaptation

Distilling Image Classifiers in Object Detectors

1 code implementation NeurIPS 2021 Shuxuan Guo, Jose M. Alvarez, Mathieu Salzmann

Knowledge distillation constitutes a simple yet effective way to improve the performance of a compact student network by exploiting the knowledge of a more powerful teacher.

Knowledge Distillation Object +3

DAAIN: Detection of Anomalous and Adversarial Input using Normalizing Flows

1 code implementation30 May 2021 Samuel von Baußnern, Johannes Otterbach, Adrian Loy, Mathieu Salzmann, Thomas Wollmann

We demonstrate the effectiveness of our approach using an ESPNet trained on the Cityscapes dataset as segmentation model, an affine Normalizing Flow as density estimator and use blue noise to ensure homogeneous sampling.

Image Segmentation Segmentation +1

SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation

2 code implementations30 Apr 2021 Robin Chan, Krzysztof Lis, Svenja Uhlemeyer, Hermann Blum, Sina Honari, Roland Siegwart, Pascal Fua, Mathieu Salzmann, Matthias Rottmann

State-of-the-art semantic or instance segmentation deep neural networks (DNNs) are usually trained on a closed set of semantic classes.

Instance Segmentation Object +2

Temporally-Coherent Surface Reconstruction via Metric-Consistent Atlases

1 code implementation ICCV 2021 Jan Bednarik, Vladimir G. Kim, Siddhartha Chaudhuri, Shaifali Parashar, Mathieu Salzmann, Pascal Fua, Noam Aigerman

We propose a method for the unsupervised reconstruction of a temporally-coherent sequence of surfaces from a sequence of time-evolving point clouds, yielding dense, semantically meaningful correspondences between all keyframes.

Surface Reconstruction

Landmark Regularization: Ranking Guided Super-Net Training in Neural Architecture Search

1 code implementation CVPR 2021 Kaicheng Yu, Rene Ranftl, Mathieu Salzmann

Weight sharing has become a de facto standard in neural architecture search because it enables the search to be done on commodity hardware.

Neural Architecture Search

Modeling Object Dissimilarity for Deep Saliency Prediction

1 code implementation8 Apr 2021 Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang, Seungryong Kim, Mathieu Salzmann, Sabine Süsstrunk

Saliency prediction has made great strides over the past two decades, with current techniques modeling low-level information, such as color, intensity and size contrasts, and high-level ones, such as attention and gaze direction for entire objects.

Object Saliency Prediction

Robust Differentiable SVD

2 code implementations8 Apr 2021 Wei Wang, Zheng Dang, Yinlin Hu, Pascal Fua, Mathieu Salzmann

Eigendecomposition of symmetric matrices is at the heart of many computer vision algorithms.

Image Classification Style Transfer

Progressive Correspondence Pruning by Consensus Learning

1 code implementation ICCV 2021 Chen Zhao, Yixiao Ge, Feng Zhu, Rui Zhao, Hongsheng Li, Mathieu Salzmann

Correspondence selection aims to correctly select the consistent matches (inliers) from an initial set of putative correspondences.

Denoising Pose Estimation +1

Learning to Generate the Unknowns for Open-set Domain Adaptation

no code implementations1 Jan 2021 Mahsa Baktashmotlagh, Tianle Chen, Mathieu Salzmann

In this setting, existing techniques focus on the challenging task of isolating the unknown target samples, so as to avoid the negative transfer resulting from aligning the source feature distributions with the broader target one that encompasses the additional unknown classes.

Domain Adaptation

Temporally-Transferable Perturbations: Efficient, One-Shot Adversarial Attacks for Online Visual Object Trackers

no code implementations30 Dec 2020 Krishna Kanth Nakka, Mathieu Salzmann

While these methods were shown to be vulnerable to adversarial attacks, as most deep networks for visual recognition tasks, the existing attacks for VOT trackers all require perturbing the search region of every input frame to be effective, which comes at a non-negligible cost, considering that VOT is a real-time task.

Visual Object Tracking

Detecting Road Obstacles by Erasing Them

no code implementations25 Dec 2020 Krzysztof Lis, Sina Honari, Pascal Fua, Mathieu Salzmann

Vehicles can encounter a myriad of obstacles on the road, and it is impossible to record them all beforehand to train a detector.

Unsupervised Domain Adaptation with Temporal-Consistent Self-Training for 3D Hand-Object Joint Reconstruction

no code implementations21 Dec 2020 Mengshi Qi, Edoardo Remelli, Mathieu Salzmann, Pascal Fua

Deep learning-solutions for hand-object 3D pose and shape estimation are now very effective when an annotated dataset is available to train them to handle the scenarios and lighting conditions they will encounter at test time.

Generative Adversarial Network Unsupervised Domain Adaptation

Long Term Motion Prediction Using Keyposes

1 code implementation8 Dec 2020 Sena Kiciroglu, Wei Wang, Mathieu Salzmann, Pascal Fua

Long term human motion prediction is essential in safety-critical applications such as human-robot interaction and autonomous driving.

Autonomous Driving Human motion prediction +1

Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking

no code implementations CVPR 2021 Fatemeh Saleh, Sadegh Aliakbarian, Hamid Rezatofighi, Mathieu Salzmann, Stephen Gould

Despite the recent advances in multiple object tracking (MOT), achieved by joint detection and tracking, dealing with long occlusions remains a challenge.

Multiple Object Tracking

Temporal Representation Learning on Monocular Videos for 3D Human Pose Estimation

no code implementations2 Dec 2020 Sina Honari, Victor Constantin, Helge Rhodin, Mathieu Salzmann, Pascal Fua

In this paper we propose an unsupervised feature extraction method to capture temporal information on monocular videos, where we detect and encode subject of interest in each frame and leverage contrastive self-supervised (CSS) learning to extract rich latent vectors.

3D Human Pose Estimation 3D Pose Estimation +1

Counting People by Estimating People Flows

1 code implementation1 Dec 2020 Weizhe Liu, Mathieu Salzmann, Pascal Fua

Modern methods for counting people in crowded scenes rely on deep networks to estimate people densities in individual images.

Active Learning Crowd Counting +1

PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop Layers

1 code implementation CVPR 2021 Frank Yu, Mathieu Salzmann, Pascal Fua, Helge Rhodin

Our conclusion is that it is important to utilize camera calibration information when available, for classical and deep-learning-based computer vision alike.

3D Reconstruction Camera Calibration

A Closed-Form Solution to Local Non-Rigid Structure-from-Motion

no code implementations23 Nov 2020 Shaifali Parashar, Yuxuan Long, Mathieu Salzmann, Pascal Fua

A recent trend in Non-Rigid Structure-from-Motion (NRSfM) is to express local, differential constraints between pairs of images, from which the surface normal at any point can be obtained by solving a system of polynomial equations.

3D Registration for Self-Occluded Objects in Context

no code implementations23 Nov 2020 Zheng Dang, Fei Wang, Mathieu Salzmann

While much progress has been made on the task of 3D point cloud registration, there still exists no learning-based method able to estimate the 6D pose of an object observed by a 2. 5D sensor in a scene.

Instance Segmentation Point Cloud Registration +2

Self-supervised Segmentation via Background Inpainting

no code implementations11 Nov 2020 Isinsu Katircioglu, Helge Rhodin, Victor Constantin, Jörg Spörri, Mathieu Salzmann, Pascal Fua

While supervised object detection and segmentation methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on.

Human Detection Object +4

Better Patch Stitching for Parametric Surface Reconstruction

1 code implementation14 Oct 2020 Zhantao Deng, Jan Bednařík, Mathieu Salzmann, Pascal Fua

We introduce an approach that explicitly encourages global consistency of the local mappings.

Surface Reconstruction

Motion Prediction Using Temporal Inception Module

1 code implementation6 Oct 2020 Tim Lebailly, Sena Kiciroglu, Mathieu Salzmann, Pascal Fua, Wei Wang

We argue that the diverse temporal scales are important as they allow us to look at the past frames with different receptive fields, which can lead to better predictions.

Autonomous Driving Human motion prediction +1

Robust RGB-based 6-DoF Pose Estimation without Real Pose Annotations

no code implementations19 Aug 2020 Zhigang Li, Yinlin Hu, Mathieu Salzmann, Xiangyang Ji

We achieve state of the art performance on LINEMOD, and OccludedLINEMOD in without real-pose setting, even outperforming methods that rely on real annotations during training on Occluded-LINEMOD.

Pose Estimation

GarNet++: Improving Fast and Accurate Static3D Cloth Draping by Curvature Loss

no code implementations20 Jul 2020 Erhan Gundogdu, Victor Constantin, Shaifali Parashar, Amrollah Seifoddini, Minh Dang, Mathieu Salzmann, Pascal Fua

We introduce a two-stream deep network model that produces a visually plausible draping of a template cloth on virtual 3D bodies by extracting features from both the body and garment shapes.

Volumetric Transformer Networks

no code implementations ECCV 2020 Seungryong Kim, Sabine Süsstrunk, Mathieu Salzmann

We design our VTN as an encoder-decoder network, with modules dedicated to letting the information flow across the feature channels, to account for the dependencies between the semantic parts.

Fine-Grained Image Recognition Image Retrieval +1

Towards Robust Fine-grained Recognition by Maximal Separation of Discriminative Features

no code implementations10 Jun 2020 Krishna Kanth Nakka, Mathieu Salzmann

In this paper, we identify the proximity of the latent representations of different classes in fine-grained recognition networks as a key factor to the success of adversarial attacks.

General Classification

Learning 3D-3D Correspondences for One-shot Partial-to-partial Registration

no code implementations8 Jun 2020 Zheng Dang, Fei Wang, Mathieu Salzmann

While 3D-3D registration is traditionally tacked by optimization-based methods, recent work has shown that learning-based techniques could achieve faster and more robust results.

Domain Adaptive Multibranch Networks

no code implementations ICLR 2020 Róger Bermúdez-Chacón, Mathieu Salzmann, Pascal Fua

We tackle unsupervised domain adaptation by accounting for the fact that different domains may need to be processed differently to arrive to a common feature representation effective for recognition.

Unsupervised Domain Adaptation

ArTIST: Autoregressive Trajectory Inpainting and Scoring for Tracking

no code implementations16 Apr 2020 Fatemeh Saleh, Sadegh Aliakbarian, Mathieu Salzmann, Stephen Gould

One of the core components in online multiple object tracking (MOT) frameworks is associating new detections with existing tracklets, typically done via a scoring function.

Human motion prediction motion prediction +1

Eigendecomposition-Free Training of Deep Networks for Linear Least-Square Problems

no code implementations15 Apr 2020 Zheng Dang, Kwang Moo Yi, Yinlin Hu, Fei Wang, Pascal Fua, Mathieu Salzmann

In this paper, we introduce an eigendecomposition-free approach to training a deep network whose loss depends on the eigenvector corresponding to a zero eigenvalue of a matrix predicted by the network.

Denoising Pose Estimation

How to Train Your Super-Net: An Analysis of Training Heuristics in Weight-Sharing NAS

no code implementations9 Mar 2020 Kaicheng Yu, Rene Ranftl, Mathieu Salzmann

Weight sharing promises to make neural architecture search (NAS) tractable even on commodity hardware.

Neural Architecture Search

Contextually Plausible and Diverse 3D Human Motion Prediction

no code implementations ICCV 2021 Sadegh Aliakbarian, Fatemeh Sadat Saleh, Lars Petersson, Stephen Gould, Mathieu Salzmann

We tackle the task of diverse 3D human motion prediction, that is, forecasting multiple plausible future 3D poses given a sequence of observed 3D poses.

Human motion prediction Image Captioning +1

Training Provably Robust Models by Polyhedral Envelope Regularization

1 code implementation10 Dec 2019 Chen Liu, Mathieu Salzmann, Sabine Süsstrunk

Training certifiable neural networks enables one to obtain models with robustness guarantees against adversarial attacks.

Indirect Local Attacks for Context-aware Semantic Segmentation Networks

1 code implementation ECCV 2020 Krishna Kanth Nakka, Mathieu Salzmann

Recently, deep networks have achieved impressive semantic segmentation performance, in particular thanks to their use of larger contextual information.

Segmentation Semantic Segmentation

Using Depth for Pixel-Wise Detection of Adversarial Attacks in Crowd Counting

no code implementations26 Nov 2019 Weizhe Liu, Mathieu Salzmann, Pascal Fua

State-of-the-art methods for counting people in crowded scenes rely on deep networks to estimate crowd density.

Crowd Counting Density Estimation

Estimating People Flows to Better Count Them in Crowded Scenes

1 code implementation ECCV 2020 Weizhe Liu, Mathieu Salzmann, Pascal Fua

In this paper, we advocate estimating people flows across image locations between consecutive images and inferring the people densities from these flows instead of directly regressing.

Optical Flow Estimation

Shape Reconstruction by Learning Differentiable Surface Representations

1 code implementation CVPR 2020 Jan Bednarik, Shaifali Parashar, Erhan Gundogdu, Mathieu Salzmann, Pascal Fua

Generative models that produce point clouds have emerged as a powerful tool to represent 3D surfaces, and the best current ones rely on learning an ensemble of parametric representations.

Single-Stage 6D Object Pose Estimation

1 code implementation CVPR 2020 Yinlin Hu, Pascal Fua, Wei Wang, Mathieu Salzmann

Second, training the deep network relies on a surrogate loss that does not directly reflect the final 6D pose estimation task.

6D Pose Estimation 6D Pose Estimation using RGB +1

Learning Trajectory Dependencies for Human Motion Prediction

5 code implementations ICCV 2019 Wei Mao, Miaomiao Liu, Mathieu Salzmann, Hongdong Li

In this paper, we propose a simple feed-forward deep network for motion prediction, which takes into account both temporal smoothness and spatial dependencies among human body joints.

Human motion prediction Human Pose Forecasting +2

Learning Variations in Human Motion via Mix-and-Match Perturbation

no code implementations2 Aug 2019 Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Lars Petersson, Stephen Gould, Amirhossein Habibian

In this paper, we introduce an approach to stochastically combine the root of variations with previous pose information, which forces the model to take the noise into account.

Human motion prediction motion prediction

Self-supervised Training of Proposal-based Segmentation via Background Prediction

no code implementations18 Jul 2019 Isinsu Katircioglu, Helge Rhodin, Victor Constantin, Jörg Spörri, Mathieu Salzmann, Pascal Fua

While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on.

Object object-detection +2

Recurrent U-Net for Resource-Constrained Segmentation

no code implementations ICCV 2019 Wei Wang, Kaicheng Yu, Joachim Hugonot, Pascal Fua, Mathieu Salzmann

State-of-the-art segmentation methods rely on very deep networks that are not always easy to train without very large training datasets and tend to be relatively slow to run on standard GPUs.

Hand Segmentation Road Segmentation +1

Detecting the Unexpected via Image Resynthesis

3 code implementations ICCV 2019 Krzysztof Lis, Krishna Nakka, Pascal Fua, Mathieu Salzmann

In this paper, we tackle the more realistic scenario where unexpected objects of unknown classes can appear at test time.

Resynthesis Semantic Segmentation

Neural Scene Decomposition for Multi-Person Motion Capture

1 code implementation CVPR 2019 Helge Rhodin, Victor Constantin, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua

To this end, we introduce a self-supervised approach to learning what we call a neural scene decomposition (NSD) that can be exploited for 3D pose estimation.

3D Pose Estimation Instance Segmentation +1

Overcoming Multi-Model Forgetting

no code implementations ICLR 2019 Yassine Benyahia, Kaicheng Yu, Kamil Bennani-Smires, Martin Jaggi, Anthony Davison, Mathieu Salzmann, Claudiu Musat

We identify a phenomenon, which we refer to as multi-model forgetting, that occurs when sequentially training multiple deep networks with partially-shared parameters; the performance of previously-trained models degrades as one optimizes a subsequent one, due to the overwriting of shared parameters.

Neural Architecture Search

Interpretable BoW Networks for Adversarial Example Detection

no code implementations8 Jan 2019 Krishna Kanth Nakka, Mathieu Salzmann

The reason behind the prediction for a new sample can then be interpreted by looking at the visual representation of the most highly activated codeword.

Generative Adversarial Network

Segmentation-driven 6D Object Pose Estimation

5 code implementations CVPR 2019 Yinlin Hu, Joachim Hugonot, Pascal Fua, Mathieu Salzmann

The most recent trend in estimating the 6D pose of rigid objects has been to train deep networks to either directly regress the pose from the image or to predict the 2D locations of 3D keypoints, from which the pose can be obtained using a PnP algorithm.

6D Pose Estimation 6D Pose Estimation using RGB +3

Beyond One Glance: Gated Recurrent Architecture for Hand Segmentation

no code implementations27 Nov 2018 Wei Wang, Kaicheng Yu, Joachim Hugonot, Pascal Fua, Mathieu Salzmann

As evidenced by our results on standard hand segmentation benchmarks and on our own dataset, our approach outperforms these other, simpler recurrent segmentation techniques, as well as the state-of-the-art hand segmentation one.

Hand Segmentation Mixed Reality +2

ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks

no code implementations NeurIPS 2020 Shuxuan Guo, Jose M. Alvarez, Mathieu Salzmann

As evidenced by our experiments, our approach outperforms both training the compact network from scratch and performing knowledge distillation from a teacher.

General Classification Image Classification +5

Tracing in 2D to Reduce the Annotation Effort for 3D Deep Delineation

1 code implementation26 Nov 2018 Mateusz Koziński, Agata Mosinska, Mathieu Salzmann, Pascal Fua

The difficulty of obtaining annotations to build training databases still slows down the adoption of recent deep learning approaches for biomedical image analysis.

Context-Aware Crowd Counting

3 code implementations CVPR 2019 Weizhe Liu, Mathieu Salzmann, Pascal Fua

State-of-the-art methods for counting people in crowded scenes rely on deep networks to estimate crowd density.

Crowd Counting

Effective Use of Synthetic Data for Urban Scene Semantic Segmentation

no code implementations ECCV 2018 Fatemeh Sadat Saleh, Mohammad Sadegh Aliakbarian, Mathieu Salzmann, Lars Petersson, Jose M. Alvarez

Our approach builds on the observation that foreground and background classes are not affected in the same manner by the domain shift, and thus should be treated differently.

Domain Adaptation Semantic Segmentation

Learning Factorized Representations for Open-set Domain Adaptation

no code implementations ICLR 2019 Mahsa Baktashmotlagh, Masoud Faraki, Tom Drummond, Mathieu Salzmann

To this end, we rely on the intuition that the source and target samples depicting the known classes can be generated by a shared subspace, whereas the target samples from unknown classes come from a different, private subspace.

Domain Adaptation

Deep Attentional Structured Representation Learning for Visual Recognition

no code implementations14 May 2018 Krishna Kanth Nakka, Mathieu Salzmann

Structured representations, such as Bags of Words, VLAD and Fisher Vectors, have proven highly effective to tackle complex visual recognition tasks.

Representation Learning Scene Recognition

Geometry-aware Deep Network for Single-Image Novel View Synthesis

no code implementations CVPR 2018 Miaomiao Liu, Xuming He, Mathieu Salzmann

By contrast, in this paper, we propose to exploit the 3D geometry of the scene to synthesize a novel view.

Novel View Synthesis

Learning to Reconstruct Texture-less Deformable Surfaces from a Single View

1 code implementation23 Mar 2018 Jan Bednařík, Pascal Fua, Mathieu Salzmann

Recent years have seen the development of mature solutions for reconstructing deformable surfaces from a single image, provided that they are relatively well-textured.

3D Reconstruction

Eigendecomposition-free Training of Deep Networks with Zero Eigenvalue-based Losses

no code implementations ECCV 2018 Zheng Dang, Kwang Moo Yi, Yinlin Hu, Fei Wang, Pascal Fua, Mathieu Salzmann

Many classical Computer Vision problems, such as essential matrix computation and pose estimation from 3D to 2D correspondences, can be solved by finding the eigenvector corresponding to the smallest, or zero, eigenvalue of a matrix representing a linear system.

3D Pose Estimation

Statistically Motivated Second Order Pooling

1 code implementation23 Jan 2018 Kaicheng Yu, Mathieu Salzmann

Our approach is motivated by a statistical analysis of the network's activations, relying on operations that lead to a Gaussian-distributed final representation, as inherently used by first-order deep networks.

Residual Parameter Transfer for Deep Domain Adaptation

no code implementations CVPR 2018 Artem Rozantsev, Mathieu Salzmann, Pascal Fua

The goal of Deep Domain Adaptation is to make it possible to use Deep Nets trained in one domain where there is enough annotated training data in another where there is little or none.

Domain Adaptation

Learning to Find Good Correspondences

3 code implementations CVPR 2018 Kwang Moo Yi, Eduard Trulls, Yuki Ono, Vincent Lepetit, Mathieu Salzmann, Pascal Fua

We develop a deep architecture to learn to find good correspondences for wide-baseline stereo.

Compression-aware Training of Deep Networks

no code implementations NeurIPS 2017 Jose M. Alvarez, Mathieu Salzmann

In recent years, great progress has been made in a variety of application domains thanks to the development of increasingly deeper neural networks.

Soft Correspondences in Multimodal Scene Parsing

no code implementations28 Sep 2017 Sarah Taghavi Namin, Mohammad Najafi, Mathieu Salzmann, Lars Petersson

We propose to address this issue, by formulating multimodal semantic labeling as inference in a CRF and introducing latent nodes to explicitly model inconsistencies between two modalities.

Scene Parsing

Bringing Background into the Foreground: Making All Classes Equal in Weakly-supervised Video Semantic Segmentation

no code implementations ICCV 2017 Fatemeh Sadat Saleh, Mohammad Sadegh Aliakbarian, Mathieu Salzmann, Lars Petersson, Jose M. Alvarez

Our experiments demonstrate the benefits of our classifier heatmaps and of our two-stream architecture on challenging urban scene datasets and on the YouTube-Objects benchmark, where we obtain state-of-the-art results.

Autonomous Navigation Segmentation +3

Adaptive Low-Rank Kernel Subspace Clustering

1 code implementation17 Jul 2017 Pan Ji, Ian Reid, Ravi Garg, Hongdong Li, Mathieu Salzmann

In this paper, we present a kernel subspace clustering method that can handle non-linear models.

Clustering Image Clustering +1

Indoor Scene Parsing With Instance Segmentation, Semantic Labeling and Support Relationship Inference

no code implementations CVPR 2017 Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu

In particular, while some of them aim at segmenting the image into regions, such as object or surface instances, others aim at inferring the semantic labels of given regions, or their support relationships.

Instance Segmentation Scene Parsing +1

Imposing Hard Constraints on Deep Networks: Promises and Limitations

no code implementations7 Jun 2017 Pablo Márquez-Neila, Mathieu Salzmann, Pascal Fua

Imposing constraints on the output of a Deep Neural Net is one way to improve the quality of its predictions while loosening the requirements for labeled training data.

Incorporating Network Built-in Priors in Weakly-supervised Semantic Segmentation

no code implementations6 Jun 2017 Fatemeh Sadat Saleh, Mohammad Sadegh Aliakbarian, Mathieu Salzmann, Lars Petersson, Jose M. Alvarez, Stephen Gould

We then show how to obtain multi-class masks by the fusion of foreground/background ones with information extracted from a weakly-supervised localization network.

Object Recognition Segmentation +3

Encouraging LSTMs to Anticipate Actions Very Early

1 code implementation ICCV 2017 Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Basura Fernando, Lars Petersson, Lars Andersson

In contrast to the widely studied problem of recognizing an action given a complete sequence, action anticipation aims to identify the action from only partially available videos.

Action Anticipation Autonomous Navigation

Second-order Convolutional Neural Networks

no code implementations20 Mar 2017 Kaicheng Yu, Mathieu Salzmann

By performing linear combinations and element-wise nonlinear operations, these networks can be thought of as extracting solely first-order information from an input image.

Image Classification

Memory Efficient Max Flow for Multi-label Submodular MRFs

1 code implementation CVPR 2016 Thalaiyasingam Ajanthan, Richard Hartley, Mathieu Salzmann

Multi-label submodular Markov Random Fields (MRFs) have been shown to be solvable using max-flow based on an encoding of the labels proposed by Ishikawa, in which each variable $X_i$ is represented by $\ell$ nodes (where $\ell$ is the number of labels) arranged in a column.

Boundary-aware Instance Segmentation

no code implementations CVPR 2017 Zeeshan Hayder, Xuming He, Mathieu Salzmann

In this context, existing methods typically propose candidate objects, usually as bounding boxes, and directly predict a binary mask within each such proposal.

Instance Segmentation Object +3

Efficient Linear Programming for Dense CRFs

no code implementations CVPR 2017 Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, Philip H. S. Torr, M. Pawan Kumar

To this end, we develop a proximal minimization framework, where the dual of each proximal problem is optimized via block coordinate descent.

Semantic Segmentation

Learning the Number of Neurons in Deep Networks

no code implementations NeurIPS 2016 Jose M. Alvarez, Mathieu Salzmann

In this paper, we introduce an approach to automatically determining the number of neurons in each layer of a deep network during learning.

Learning to Co-Generate Object Proposals With a Deep Structured Network

no code implementations CVPR 2016 Zeeshan Hayder, Xuming He, Mathieu Salzmann

In particular, we introduce a deep structured network that jointly predicts the objectness scores and the bounding box locations of multiple object candidates.

Object object-detection +2

Semantic-Aware Depth Super-Resolution in Outdoor Scenes

no code implementations31 May 2016 Miaomiao Liu, Mathieu Salzmann, Xuming He

Despite much progress, state-of-the-art techniques suffer from two drawbacks: (i) they rely on the assumption that intensity edges coincide with depth discontinuities, which, unfortunately, is only true in controlled environments; and (ii) they typically exploit the availability of high-resolution training depth maps, which can often not be acquired in practice due to the sensors' limitations.

Super-Resolution

Dimensionality Reduction on SPD Manifolds: The Emergence of Geometry-Aware Methods

no code implementations20 May 2016 Mehrtash Harandi, Mathieu Salzmann, Richard Hartley

This lets us formulate dimensionality reduction as the problem of finding a projection that yields a low-dimensional manifold either with maximum discriminative power in the supervised scenario, or with maximum variance of the data in the unsupervised one.

Dimensionality Reduction

Beyond Sharing Weights for Deep Domain Adaptation

no code implementations21 Mar 2016 Artem Rozantsev, Mathieu Salzmann, Pascal Fua

To this end, we introduce a two-stream architecture, where one operates in the source domain and the other in the target domain.

Domain Adaptation Object Recognition

Robust Multi-body Feature Tracker: A Segmentation-free Approach

no code implementations CVPR 2016 Pan Ji, Hongdong Li, Mathieu Salzmann, Yiran Zhong

Feature tracking is a fundamental problem in computer vision, with applications in many computer vision tasks, such as visual SLAM and action recognition.

Action Recognition Motion Segmentation +2

Cutting Edge: Soft Correspondences in Multimodal Scene Parsing

no code implementations ICCV 2015 Sarah Taghavi Namin, Mohammad Najafi, Mathieu Salzmann, Lars Petersson

In this paper, we address the problem of data misalignment and label inconsistencies, e. g., due to moving objects, in semantic labeling, which violate the assumption of existing techniques.

Scene Parsing

Structural Kernel Learning for Large Scale Multiclass Object Co-Detection

no code implementations ICCV 2015 Zeeshan Hayder, Xuming He, Mathieu Salzmann

To exploit the correlations between objects, we build a fully-connected CRF on the candidates, which explicitly incorporates both geometric layout relations across object classes and similarity relations across multiple images.

Object object-detection +1

Sample and Filter: Nonparametric Scene Parsing via Efficient Filtering

no code implementations CVPR 2016 Mohammad Najafi, Sarah Taghavi Namin, Mathieu Salzmann, Lars Petersson

By contrast, nonparametric approaches, which bypass any learning phase and directly transfer the labels from the training data to the query images, can readily exploit new labeled samples as they become available.

Scene Parsing Superpixels

Shape Interaction Matrix Revisited and Robustified: Efficient Subspace Clustering with Corrupted and Incomplete Data

1 code implementation ICCV 2015 Pan Ji, Mathieu Salzmann, Hongdong Li

The Shape Interaction Matrix (SIM) is one of the earliest approaches to performing subspace clustering (i. e., separating points drawn from a union of subspaces).

Clustering Face Clustering +1

When VLAD met Hilbert

no code implementations CVPR 2016 Mehrtash Harandi, Mathieu Salzmann, Fatih Porikli

Vectors of Locally Aggregated Descriptors (VLAD) have emerged as powerful image/video representations that compete with or even outperform state-of-the-art approaches on many challenging visual recognition tasks.

General Classification

Indoor Scene Structure Analysis for Single Image Depth Estimation

no code implementations CVPR 2015 Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu

We tackle the problem of single image depth estimation, which, without additional knowledge, suffers from many ambiguities.

Depth Estimation

Riemannian Coding and Dictionary Learning: Kernels to the Rescue

no code implementations CVPR 2015 Mehrtash Harandi, Mathieu Salzmann

While sparse coding on non-flat Riemannian manifolds has recently become increasingly popular, existing solutions either are dedicated to specific manifolds, or rely on optimization problems that are difficult to solve, especially when it comes to dictionary learning.

Dictionary Learning

Kernel Methods on the Riemannian Manifold of Symmetric Positive Definite Matrices

no code implementations CVPR 2013 Sadeep Jayasumana, Richard Hartley, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

To encode the geometry of the manifold in the mapping, we introduce a family of provably positive definite kernels on the Riemannian manifold of SPD matrices.

Motion Segmentation Pedestrian Detection +2

Optimizing Over Radial Kernels on Compact Manifolds

no code implementations CVPR 2014 Sadeep Jayasumana, Richard Hartley, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

We tackle the problem of optimizing over all possible positive definite radial kernels on Riemannian manifolds for classification.

General Classification

A Framework for Shape Analysis via Hilbert Space Embedding

no code implementations13 Dec 2014 Sadeep Jayasumana, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

We propose a framework for 2D shape analysis using positive definite kernels defined on Kendall's shape manifold.

Clustering General Classification +1

Kernel Methods on Riemannian Manifolds with Gaussian RBF Kernels

no code implementations30 Nov 2014 Sadeep Jayasumana, Richard Hartley, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

We then use the proposed framework to identify positive definite kernels on two specific manifolds commonly encountered in computer vision: the Riemannian manifold of symmetric positive definite matrices and the Grassmann manifold, i. e., the Riemannian manifold of linear subspaces of a Euclidean space.

Iteratively Reweighted Graph Cut for Multi-label MRFs with Non-convex Priors

no code implementations CVPR 2015 Thalaiyasingam Ajanthan, Richard Hartley, Mathieu Salzmann, Hongdong Li

While widely acknowledged as highly effective in computer vision, multi-label MRFs with non-convex priors are difficult to optimize.

Kernel Coding: General Formulation and Special Cases

no code implementations30 Aug 2014 Mehrtash Harandi, Mathieu Salzmann

In contrast, here, we study the problem of performing coding in a high-dimensional Hilbert space, where the classes are expected to be more easily separable.

Dictionary Learning

Expanding the Family of Grassmannian Kernels: An Embedding Perspective

no code implementations4 Jul 2014 Mehrtash T. Harandi, Mathieu Salzmann, Sadeep Jayasumana, Richard Hartley, Hongdong Li

Modeling videos and image-sets as linear subspaces has proven beneficial for many visual recognition tasks.

Clustering

From Manifold to Manifold: Geometry-Aware Dimensionality Reduction for SPD Matrices

no code implementations4 Jul 2014 Mehrtash T. Harandi, Mathieu Salzmann, Richard Hartley

In particular, we search for a projection that yields a low-dimensional manifold with maximum discriminative power encoded via an affinity-weighted similarity measure based on metrics on the manifold.

Dimensionality Reduction

Domain Adaptation on the Statistical Manifold

no code implementations CVPR 2014 Mahsa Baktashmotlagh, Mehrtash T. Harandi, Brian C. Lovell, Mathieu Salzmann

Here, we propose to make better use of the structure of this manifold and rely on the distance on the manifold to compare the source and target distributions.

Object Recognition Unsupervised Domain Adaptation

Continuous Inference in Graphical Models with Polynomial Energies

no code implementations CVPR 2013 Mathieu Salzmann

In this paper, we tackle the problem of performing inference in graphical models whose energy is a polynomial function of continuous variables.

Mirror Surface Reconstruction from a Single Image

no code implementations CVPR 2013 Miaomiao Liu, Richard Hartley, Mathieu Salzmann

In such conditions, our differential geometry analysis provides a theoretical proof that the shape of the mirror surface can be uniquely recovered if the pose of the reference target is known.

Surface Reconstruction

Factorized Latent Spaces with Structured Sparsity

no code implementations NeurIPS 2010 Yangqing Jia, Mathieu Salzmann, Trevor Darrell

Recent approaches to multi-view learning have shown that factorizing the information into parts that are shared across all views and parts that are private to each view could effectively account for the dependencies and independencies between the different input modalities.

MULTI-VIEW LEARNING Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.