Search Results for author: Laura Leal-Taixé

Found 64 papers, 35 papers with code

SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow

no code implementations17 Apr 2024 Orcun Cetintas, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé

Increasing the annotation efficiency of trajectory annotations from videos has the potential to enable the next generation of data-hungry tracking algorithms to thrive on large-scale datasets.

Better Call SAL: Towards Learning to Segment Anything in Lidar

no code implementations19 Mar 2024 Aljoša Ošep, Tim Meinhardt, Francesco Ferroni, Neehar Peri, Deva Ramanan, Laura Leal-Taixé

We propose $\texttt{SAL}$ ($\texttt{S}$egment $\texttt{A}$nything in $\texttt{L}$idar) method consisting of a text-promptable zero-shot model for segmenting and classifying any object in Lidar, and a pseudo-labeling engine that facilitates model training without manual supervision.

Panoptic Segmentation

The NeRFect Match: Exploring NeRF Features for Visual Localization

no code implementations14 Mar 2024 Qunjie Zhou, Maxim Maximov, Or Litany, Laura Leal-Taixé

Significantly, we introduce NeRFMatch, an advanced 2D-3D matching function that capitalizes on the internal knowledge of NeRF learned via view synthesis.

regression Visual Localization

SeMoLi: What Moves Together Belongs Together

no code implementations29 Feb 2024 Jenny Seidenschwarz, Aljoša Ošep, Francesco Ferroni, Simon Lucey, Laura Leal-Taixé

Recent results suggest that heuristic-based clustering methods in conjunction with object trackers can be used to pseudo-label instances of moving objects and use these as supervisory signals to train 3D object detectors in Lidar data without manual supervision.

Clustering Object +5

Staged Contact-Aware Global Human Motion Forecasting

1 code implementation16 Sep 2023 Luca Scofano, Alessio Sampieri, Elisabeth Schiele, Edoardo De Matteis, Laura Leal-Taixé, Fabio Galasso

So far, only Mao et al. NeurIPS'22 have addressed scene-aware global motion, cascading the prediction of future scene contact points and the global motion estimation.

Human Pose Forecasting Motion Estimation +2

G-MSM: Unsupervised Multi-Shape Matching with Graph-based Affinity Priors

no code implementations CVPR 2023 Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Daniel Cremers

We present G-MSM (Graph-based Multi-Shape Matching), a novel unsupervised learning approach for non-rigid shape correspondence.

Unifying Short and Long-Term Tracking with Graph Hierarchies

1 code implementation CVPR 2023 Orcun Cetintas, Guillem Brasó, Laura Leal-Taixé

Tracking objects over long videos effectively means solving a spectrum of problems, from short-term association for un-occluded objects to long-term association for objects that are occluded and then reappear in the scene.

Multiple Object Tracking

Soft Augmentation for Image Classification

1 code implementation CVPR 2023 Yang Liu, Shen Yan, Laura Leal-Taixé, James Hays, Deva Ramanan

We draw inspiration from human visual classification studies and propose generalizing augmentation with invariant transforms to soft augmentation where the learning target softens non-linearly as a function of the degree of the transform applied to the sample: e. g., more aggressive image crop augmentations produce less confident learning targets.

Classification Data Augmentation +1

Learning to Discover and Detect Objects

1 code implementation19 Oct 2022 Vladimir Fomenko, Ismail Elezi, Deva Ramanan, Laura Leal-Taixé, Aljoša Ošep

We then train our network to learn to classify each RoI, either as one of the known classes, seen in the source dataset, or one of the novel classes, with a long-tail distribution constraint on the class assignments, reflecting the natural frequency of classes in the real world.

Novel Class Discovery Novel Object Detection +3

Quo Vadis: Is Trajectory Forecasting the Key Towards Long-Term Multi-Object Tracking?

1 code implementation14 Oct 2022 Patrick Dendorfer, Vladimir Yugay, Aljoša Ošep, Laura Leal-Taixé

While we have significantly advanced short-term tracking performance, bridging longer occlusion gaps remains elusive: state-of-the-art object trackers only bridge less than 10% of occlusions longer than three seconds.

Multi-Object Tracking Trajectory Forecasting

PolarMOT: How Far Can Geometric Relations Take Us in 3D Multi-Object Tracking?

no code implementations3 Aug 2022 Aleksandr Kim, Guillem Brasó, Aljoša Ošep, Laura Leal-Taixé

This allows our graph neural network to learn to effectively encode temporal and spatial interactions and fully leverage contextual and motion cues to obtain final scene interpretation by posing data association as edge classification.

3D Multi-Object Tracking Edge Classification

DeVIS: Making Deformable Transformers Work for Video Instance Segmentation

1 code implementation22 Jul 2022 Adrià Caelles, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé

To reason about all VIS subtasks jointly over multiple frames, we present temporal multi-scale deformable attention with instance-aware object queries.

Instance Segmentation object-detection +4

A Unified Framework for Implicit Sinkhorn Differentiation

1 code implementation CVPR 2022 Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Florian Bernard, Daniel Cremers

The Sinkhorn operator has recently experienced a surge of popularity in computer vision and related fields.

Is Geometry Enough for Matching in Visual Localization?

1 code implementation24 Mar 2022 Qunjie Zhou, Sérgio Agostinho, Aljosa Osep, Laura Leal-Taixé

In this paper, we propose to go beyond the well-established approach to vision-based localization that relies on visual descriptor matching between a query image and a 3D point cloud.

Visual Localization

Opening Up Open World Tracking

no code implementations CVPR 2022 Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé

A benchmark that would allow us to perform an apple-to-apple comparison of existing efforts is a crucial first step towards advancing this important research field.

Ranked #3 on Open-World Video Segmentation on BURST-val (using extra training data)

Multi-Object Tracking Object +1

The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation

1 code implementation ICCV 2021 Guillem Brasó, Nikita Kister, Laura Leal-Taixé

We introduce CenterGroup, an attention-based framework to estimate human poses from a set of identity-agnostic keypoints and person center predictions in an image.

Clustering Multi-Person Pose Estimation

Spatial Context Awareness for Unsupervised Change Detection in Optical Satellite Images

no code implementations5 Oct 2021 Lukas Kondmann, Aysim Toker, Sudipan Saha, Bernhard Schölkopf, Laura Leal-Taixé, Xiao Xiang Zhu

It uses this model to analyze differences in the pixel and its spatial context-based predictions in subsequent time periods for change detection.

Change Detection Earth Observation

Scalable Sinkhorn Backpropagation

no code implementations29 Sep 2021 Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Florian Bernard, Daniel Cremers

Our main contribution is deriving a simple and efficient algorithm that performs this backward pass in closed form.

Rolling Shutter Correction

(Just) A Spoonful of Refinements Helps the Registration Error Go Down

1 code implementation ICCV 2021 Sérgio Agostinho, Aljoša Ošep, Alessio Del Bue, Laura Leal-Taixé

However, given the initial rotation estimate supplied by Kabsch, we show we can improve point correspondence learning during model training by extending the original optimization problem.

Point Cloud Registration

EagerMOT: 3D Multi-Object Tracking via Sensor Fusion

3 code implementations29 Apr 2021 Aleksandr Kim, Aljoša Ošep, Laura Leal-Taixé

Multi-object tracking (MOT) enables mobile robots to perform well-informed motion planning and navigation by localizing surrounding objects in 3D space and time.

3D Multi-Object Tracking Motion Planning +4

Opening up Open-World Tracking

no code implementations22 Apr 2021 Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé

We hope to open a new front in multi-object tracking research that will hopefully bring us a step closer to intelligent systems that can operate safely in the real world.

Multi-Object Tracking Object

Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization

1 code implementation CVPR 2021 Aysim Toker, Qunjie Zhou, Maxim Maximov, Laura Leal-Taixé

The goal of cross-view image based geo-localization is to determine the location of a given street view image by matching it against a collection of geo-tagged satellite images.

Image Generation Retrieval

4D Panoptic LiDAR Segmentation

1 code implementation CVPR 2021 Mehmet Aygün, Aljoša Ošep, Mark Weber, Maxim Maximov, Cyrill Stachniss, Jens Behley, Laura Leal-Taixé

In this paper, we propose 4D panoptic LiDAR segmentation to assign a semantic class and a temporally-consistent instance ID to a sequence of 3D points.

4D Panoptic Segmentation Benchmarking +4

Learning Intra-Batch Connections for Deep Metric Learning

2 code implementations15 Feb 2021 Jenny Seidenschwarz, Ismail Elezi, Laura Leal-Taixé

To this end, we propose an approach based on message passing networks that takes all the relations in a mini-batch into account.

Clustering Image Retrieval +2

Deep Shells: Unsupervised Shape Correspondence with Optimal Transport

1 code implementation NeurIPS 2020 Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Daniel Cremers

We propose a novel unsupervised learning approach to 3D shape correspondence that builds a multiscale matching pipeline into a deep neural network.

MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking

no code implementations15 Oct 2020 Patrick Dendorfer, Aljoša Ošep, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth, Laura Leal-Taixé

We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched in late 2014, to collect existing and new data, and create a framework for the standardized evaluation of multiple object tracking methods.

Multiple Object Tracking Multiple People Tracking +3

Goal-GAN: Multimodal Trajectory Prediction Based on Goal Position Estimation

2 code implementations2 Oct 2020 Patrick Dendorfer, Aljoša Ošep, Laura Leal-Taixé

Inspired by human navigation, we model the task of trajectory prediction as an intuitive two-stage process: (i) goal estimation, which predicts the most likely target positions of the agent, followed by a (ii) routing module which estimates a set of plausible trajectories that route towards the estimated goal.

Position Trajectory Prediction

Making a Case for 3D Convolutions for Object Segmentation in Videos

1 code implementation26 Aug 2020 Sabarinath Mahadevan, Ali Athar, Aljoša Ošep, Sebastian Hennen, Laura Leal-Taixé, Bastian Leibe

On the other hand, 3D convolutional networks have been successfully applied for video classification tasks, but have not been leveraged as effectively to problems involving dense per-pixel interpretation of videos compared to their 2D convolutional counterparts and lag behind the aforementioned networks in terms of performance.

Segmentation Semantic Segmentation +5

Focus on defocus: bridging the synthetic to real domain gap for depth estimation

1 code implementation CVPR 2020 Maxim Maximov, Kevin Galim, Laura Leal-Taixé

We are able to train our model completely on synthetic data and directly apply it to a wide range of real-world images.

 Ranked #1 on Depth Estimation on NYU-Depth V2 (RMSE metric)

Depth Estimation Depth Prediction

CIAGAN: Conditional Identity Anonymization Generative Adversarial Networks

1 code implementation CVPR 2020 Maxim Maximov, Ismail Elezi, Laura Leal-Taixé

In many real-world scenarios like people tracking or action recognition, it is important to be able to process the data while taking careful consideration in protecting people's identity.

Action Recognition De-identification +1

Planning from Images with Deep Latent Gaussian Process Dynamics

1 code implementation L4DC 2020 Nathanael Bosch, Jan Achterhold, Laura Leal-Taixé, Jörg Stückler

We propose to learn a deep latent Gaussian process dynamics (DLGPD) model that learns low-dimensional system dynamics from environment interactions with visual observations.

Gaussian Processes Transfer Learning

MOT20: A benchmark for multi object tracking in crowded scenes

1 code implementation19 Mar 2020 Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixé

The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of multiple object tracking methods.

Multi-Object Tracking Multiple Object Tracking with Transformer +2

Learn to Predict Sets Using Feed-Forward Neural Networks

no code implementations30 Jan 2020 Hamid Rezatofighi, Tianyu Zhu, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Anton Milan, Daniel Cremers, Laura Leal-Taixé, Ian Reid

In our formulation we define a likelihood for a set distribution represented by a) two discrete distributions defining the set cardinally and permutation variables, and b) a joint distribution over set elements with a fixed cardinality.

Multi-Label Image Classification object-detection +1

Learning a Neural Solver for Multiple Object Tracking

2 code implementations16 Dec 2019 Guillem Brasó, Laura Leal-Taixé

Graphs offer a natural way to formulate Multiple Object Tracking (MOT) within the tracking-by-detection paradigm.

Multi-Object Tracking Multiple Object Tracking +1

HistoNet: Predicting size histograms of object instances

1 code implementation11 Dec 2019 Kishan Sharma, Moritz Gold, Christian Zurbruegg, Laura Leal-Taixé, Jan Dirk Wegner

Our method results in an overall improvement in the count and size distribution prediction as compared to state-of-the-art instance segmentation method Mask R-CNN.

Instance Segmentation Object +2

Towards Generalizing Sensorimotor Control Across Weather Conditions

no code implementations25 Jul 2019 Qadeer Khan, Patrick Wenzel, Daniel Cremers, Laura Leal-Taixé

The ability of deep learning models to generalize well across different scenarios depends primarily on the quality and quantity of annotated data.

Image-to-Image Translation Translation

Learning Temporal Coherence via Self-Supervision for GAN-based Video Generation

13 code implementations23 Nov 2018 Mengyu Chu, You Xie, Jonas Mayer, Laura Leal-Taixé, Nils Thuerey

Additionally, we propose a first set of metrics to quantitatively evaluate the accuracy as well as the perceptual quality of the temporal evolution.

Image Super-Resolution Motion Compensation +3

Modular Vehicle Control for Transferring Semantic Information Between Weather Conditions Using GANs

1 code implementation3 Jul 2018 Patrick Wenzel, Qadeer Khan, Daniel Cremers, Laura Leal-Taixé

To this end, we propose to divide the task of vehicle control into two independent modules: a control module which is only trained on one weather condition for which labeled steering data is available, and a perception module which is used as an interface between new weather conditions and the fixed control module.

Generative Adversarial Network Self-Driving Cars

Deep Perm-Set Net: Learn to predict sets with unknown permutation and cardinality using deep neural networks

no code implementations ICLR 2019 S. Hamid Rezatofighi, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Daniel Cremers, Laura Leal-Taixé, Ian Reid

We demonstrate the validity of this new formulation on two relevant vision problems: object detection, for which our formulation outperforms state-of-the-art detectors such as Faster R-CNN and YOLO, and a complex CAPTCHA test, where we observe that, surprisingly, our set based network acquired the ability of mimicking arithmetics without any rules being coded.

object-detection Object Detection

Deep Appearance Maps

no code implementations ICCV 2019 Maxim Maximov, Laura Leal-Taixé, Mario Fritz, Tobias Ritschel

Second, we demonstrate how another network can be used to map from an image or video frames to a DAM network to reproduce this appearance, without using a lengthy optimization such as stochastic gradient descent (learning-to-learn).

Video Object Segmentation Without Temporal Information

no code implementations18 Sep 2017 Kevis-Kokitsi Maninis, Sergi Caelles, Yu-Hua Chen, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc van Gool

Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames.

Foreground Segmentation Object +5

Fusion of Head and Full-Body Detectors for Multi-Object Tracking

no code implementations23 May 2017 Roberto Henschel, Laura Leal-Taixé, Daniel Cremers, Bodo Rosenhahn

In order to track all persons in a scene, the tracking-by-detection paradigm has proven to be a very effective approach.

Multi-Object Tracking

Deep Depth From Focus

5 code implementations4 Apr 2017 Caner Hazirbas, Sebastian Georg Soyer, Maximilian Christian Staab, Laura Leal-Taixé, Daniel Cremers

Depth from focus (DFF) is one of the classical ill-posed inverse problems in computer vision.

Depth Estimation

One-Shot Video Object Segmentation

8 code implementations CVPR 2017 Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc van Gool

This paper tackles the task of semi-supervised video object segmentation, i. e., the separation of an object from the background in a video, given the mask of the first frame.

Foreground Segmentation Object +4

Tracking with multi-level features

no code implementations25 Jul 2016 Roberto Henschel, Laura Leal-Taixé, Bodo Rosenhahn, Konrad Schindler

We present a novel formulation of the multiple object tracking problem which integrates low and mid-level features.

Clustering Multiple Object Tracking

Learning by tracking: Siamese CNN for robust target association

no code implementations26 Apr 2016 Laura Leal-Taixé, Cristian Canton Ferrer, Konrad Schindler

This paper introduces a novel approach to the task of data association within the context of pedestrian tracking, by introducing a two-stage learning scheme to match pairs of detections.

Multiple People Tracking Optical Flow Estimation +1

MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking

2 code implementations8 Apr 2015 Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, Konrad Schindler

We discuss the challenges of creating such a framework, collecting existing and new data, gathering state-of-the-art methods to be tested on the datasets, and finally creating a unified evaluation system.

3D Reconstruction Multiple Object Tracking +3

Multiple object tracking with context awareness

no code implementations24 Nov 2014 Laura Leal-Taixé

Multiple people tracking is a key problem for many applications such as surveillance, animation or car navigation, and a key input for tasks such as activity recognition.

Activity Recognition Multiple Object Tracking +2

Cannot find the paper you are looking for? You can Submit a new open access paper.