Search Results for author: Igor Gilitschenski

Found 37 papers, 12 papers with code

RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting

no code implementations16 Apr 2024 Ashkan Mirzaei, Riccardo de Lutio, Seung Wook Kim, David Acuna, Jonathan Kelly, Sanja Fidler, Igor Gilitschenski, Zan Gojcic

In this work, we propose an approach for 3D scene inpainting -- the task of coherently replacing parts of the reconstructed scene with desired content.

3D Inpainting Image Inpainting

Producing and Leveraging Online Map Uncertainty in Trajectory Prediction

1 code implementation25 Mar 2024 Xunjiang Gu, Guanyu Song, Igor Gilitschenski, Marco Pavone, Boris Ivanovic

High-definition (HD) maps have played an integral role in the development of modern autonomous vehicle (AV) stacks, albeit with high associated labeling and maintenance costs.

Trajectory Forecasting

Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence

no code implementations9 Mar 2024 Marcel Hussing, Claas Voelcker, Igor Gilitschenski, Amir-Massoud Farahmand, Eric Eaton

We show that deep reinforcement learning can maintain its ability to learn without resetting network parameters in settings where the number of gradient updates greatly exceeds the number of environment samples.

Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations

no code implementations27 Oct 2023 Tristan Aumentado-Armstrong, Ashkan Mirzaei, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

The resulting latent-space NeRF can produce novel views with higher quality than standard colour-space NeRFs, as the AE can correct certain visual artifacts, while rendering over three times faster.

Continual Learning Novel View Synthesis

Watch Your Steps: Local Image and Scene Editing by Text Instructions

no code implementations17 Aug 2023 Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski

A field is trained on relevance maps of training views, denoted as the relevance field, defining the 3D region within which modifications should be made.

Denoising Image Generation

trajdata: A Unified Interface to Multiple Human Trajectory Datasets

2 code implementations NeurIPS 2023 Boris Ivanovic, Guanyu Song, Igor Gilitschenski, Marco Pavone

The field of trajectory forecasting has grown significantly in recent years, partially owing to the release of numerous large-scale, real-world human trajectory datasets for autonomous vehicles (AVs) and pedestrian motion tracking.

Autonomous Vehicles Motion Forecasting +1

$λ$-models: Effective Decision-Aware Reinforcement Learning with Latent Models

no code implementations30 Jun 2023 Claas A Voelcker, Arash Ahmadian, Romina Abachi, Igor Gilitschenski, Amir-Massoud Farahmand

The idea of decision-aware model learning, that models should be accurate where it matters for decision-making, has gained prominence in model-based reinforcement learning.

Continuous Control Decision Making +2

EventCLIP: Adapting CLIP for Event-based Object Recognition

1 code implementation10 Jun 2023 Ziyi Wu, Xudong Liu, Igor Gilitschenski

Recent advances in zero-shot and few-shot classification heavily rely on the success of pre-trained vision-language models (VLMs) such as CLIP.

Few-Shot Learning Object +3

SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models

no code implementations NeurIPS 2023 Ziyi Wu, Jingyu Hu, Wuyue Lu, Igor Gilitschenski, Animesh Garg

Finally, we demonstrate the scalability of SlotDiffusion to unconstrained real-world datasets such as PASCAL VOC and COCO, when integrated with self-supervised pre-trained image encoders.

Image Generation Object +5

CAMM: Building Category-Agnostic and Animatable 3D Models from Monocular Videos

1 code implementation14 Apr 2023 Tianshu Kuai, Akash Karthikeyan, Yash Kant, Ashkan Mirzaei, Igor Gilitschenski

Animating an object in 3D often requires an articulated structure, e. g. a kinematic chain or skeleton of the manipulated object with proper skinning weights, to obtain smooth movements and surface deformations.

Object Surface Reconstruction

Invertible Neural Skinning

no code implementations CVPR 2023 Yash Kant, Aliaksandr Siarohin, Riza Alp Guler, Menglei Chai, Jian Ren, Sergey Tulyakov, Igor Gilitschenski

Next, we combine PIN with a differentiable LBS module to build an expressive and end-to-end Invertible Neural Skinning (INS) pipeline.

Building Scalable Video Understanding Benchmarks through Sports

no code implementations17 Jan 2023 Aniket Agarwal, Alex Zhang, Karthik Narasimhan, Igor Gilitschenski, Vishvak Murahari, Yash Kant

Our human studies indicate that ASAP can align videos and annotations with high fidelity, precision, and speed.

Video Understanding

SparsePose: Sparse-View Camera Pose Regression and Refinement

no code implementations CVPR 2023 Samarth Sinha, Jason Y. Zhang, Andrea Tagliasacchi, Igor Gilitschenski, David B. Lindell

Camera pose estimation is a key step in standard 3D reconstruction pipelines that operate on a dense set of images of a single object or scene.

3D Reconstruction Pose Estimation +1

Solving Continuous Control via Q-learning

1 code implementation22 Oct 2022 Tim Seyde, Peter Werner, Wilko Schwarting, Igor Gilitschenski, Martin Riedmiller, Daniela Rus, Markus Wulfmeier

While there has been substantial success for solving continuous control with actor-critic methods, simpler critic-only methods such as Q-learning find limited application in the associated high-dimensional action spaces.

Continuous Control Multi-agent Reinforcement Learning +1

LaTeRF: Label and Text Driven Object Radiance Fields

no code implementations4 Jul 2022 Ashkan Mirzaei, Yash Kant, Jonathan Kelly, Igor Gilitschenski

In this paper we introduce LaTeRF, a method for extracting an object of interest from a scene given 2D images of the entire scene, known camera poses, a natural language description of the object, and a set of point-labels of object and non-object points in the input images.

Object

Housekeep: Tidying Virtual Households using Commonsense Reasoning

1 code implementation22 May 2022 Yash Kant, Arun Ramachandran, Sriram Yenamandra, Igor Gilitschenski, Dhruv Batra, Andrew Szot, Harsh Agrawal

Instead, the agent must learn from and is evaluated against human preferences of which objects belong where in a tidy house.

Language Modelling Large Language Model

Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space

1 code implementation19 Feb 2021 Wilko Schwarting, Tim Seyde, Igor Gilitschenski, Lucas Liebenwein, Ryan Sander, Sertac Karaman, Daniela Rus

We demonstrate the effectiveness of our algorithm in learning competitive behaviors on a novel multi-agent racing benchmark that requires planning from image observations.

Reinforcement Learning (RL)

Deep Orientation Uncertainty Learning based on a Bingham Loss

1 code implementation ICLR 2020 Igor Gilitschenski, Roshni Sahoo, Wilko Schwarting, Alexander Amini, Sertac Karaman, Daniela Rus

Reasoning about uncertain orientations is one of the core problems in many perception tasks such as object pose estimation or motion estimation.

Motion Estimation Pose Estimation

SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks

2 code implementations11 Oct 2019 Cenk Baykal, Lucas Liebenwein, Igor Gilitschenski, Dan Feldman, Daniela Rus

We introduce a pruning algorithm that provably sparsifies the parameters of a trained model in a way that approximately preserves the model's predictive accuracy.

LandmarkBoost: Efficient Visual Context Classifiers for Robust Localization

no code implementations12 Jul 2018 Marcin Dymczyk, Igor Gilitschenski, Juan Nieto, Simon Lynen, Bernhard Zeisl, Roland Siegwart

We propose LandmarkBoost - an approach that, in contrast to the conventional 2D-3D matching methods, casts the search problem as a landmark classification task.

Pose Retrieval Retrieval

Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds

no code implementations ICLR 2019 Cenk Baykal, Lucas Liebenwein, Igor Gilitschenski, Dan Feldman, Daniela Rus

We present an efficient coresets-based neural network compression algorithm that sparsifies the parameters of a trained fully-connected neural network in a manner that provably approximates the network's output.

Generalization Bounds Neural Network Compression

Directional Statistics and Filtering Using libDirectional

no code implementations28 Dec 2017 Gerhard Kurz, Igor Gilitschenski, Florian Pfaff, Lukas Drude, Uwe D. Hanebeck, Reinhold Haeb-Umbach, Roland Y. Siegwart

In this paper, we present libDirectional, a MATLAB library for directional statistics and directional estimation.

maplab: An Open Framework for Research in Visual-inertial Mapping and Localization

1 code implementation28 Nov 2017 Thomas Schneider, Marcin Dymczyk, Marius Fehr, Kevin Egger, Simon Lynen, Igor Gilitschenski, Roland Siegwart

On the other hand, maplab provides the research community with a collection of multisession mapping tools that include map merging, visual-inertial batch optimization, and loop closure.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.