Search Results for author: Martin Danelljan

Found 99 papers, 68 papers with code

SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking

1 code implementation17 Sep 2024 Siyuan Li, Lei Ke, Yung-Hsu Yang, Luigi Piccinelli, Mattia Segù, Martin Danelljan, Luc van Gool

Due to the complexity of motion patterns in the large-vocabulary scenarios and unstable classification of the novel objects, the motion and semantics cues are either ignored or applied based on heuristics in the final matching steps by existing methods.

Multiple Object Tracking

Matching Anything by Segmenting Anything

1 code implementation CVPR 2024 Siyuan Li, Lei Ke, Martin Danelljan, Luigi Piccinelli, Mattia Segu, Luc van Gool, Fisher Yu

The robust association of the same objects across video frames in complex scenes is crucial for many applications, especially Multiple Object Tracking (MOT).

Domain Generalization Multiple Object Tracking +2

Analyzing Local Representations of Self-supervised Vision Transformers

no code implementations31 Dec 2023 Ani Vanyan, Alvard Barseghyan, Hakob Tamazyan, Vahan Huroyan, Hrant Khachatrian, Martin Danelljan

In this paper, we present a comparative analysis of various self-supervised Vision Transformers (ViTs), focusing on their local representative power.

Contrastive Learning Few-Shot Semantic Segmentation +2

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

1 code implementation1 Dec 2023 Mingqiao Ye, Martin Danelljan, Fisher Yu, Lei Ke

To address this issue, we propose Gaussian Grouping, which extends Gaussian Splatting to jointly reconstruct and segment anything in open-world 3D scenes.

Colorization Novel View Synthesis +3

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

1 code implementation6 Aug 2023 Chunming He, Kai Li, Yachao Zhang, Yulun Zhang, Zhenhua Guo, Xiu Li, Martin Danelljan, Fisher Yu

On the prey side, we propose an adversarial training framework, Camouflageator, which introduces an auxiliary generator to generate more camouflaged objects that are harder for a COD method to detect.

object-detection Object Detection

Cascade-DETR: Delving into High-Quality Universal Object Detection

1 code implementation ICCV 2023 Mingqiao Ye, Lei Ke, Siyuan Li, Yu-Wing Tai, Chi-Keung Tang, Martin Danelljan, Fisher Yu

While dominating on the COCO benchmark, recent Transformer-based detection methods are not competitive in diverse domains.

Decoder Object +3

Prompting Diffusion Representations for Cross-Domain Semantic Segmentation

no code implementations5 Jul 2023 Rui Gong, Martin Danelljan, Han Sun, Julio Delgado Mangas, Luc van Gool

Intrigued by this result, we set out to explore how well diffusion-pretrained representations generalize to new domains, a crucial ability for any representation.

Domain Generalization Image Generation +2

Segment Anything Meets Point Tracking

1 code implementation3 Jul 2023 Frano Rajič, Lei Ke, Yu-Wing Tai, Chi-Keung Tang, Martin Danelljan, Fisher Yu

The Segment Anything Model (SAM) has established itself as a powerful zero-shot image segmentation model, enabled by efficient point-centric annotation and prompt-based models.

Interactive Video Object Segmentation Object +5

StyleGenes: Discrete and Efficient Latent Distributions for GANs

no code implementations30 Apr 2023 Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc van Gool

Thus, by independently sampling a variant for each gene and combining them into the final latent vector, our approach can represent a vast number of unique latent samples from a compact set of learnable parameters.

Disentanglement Diversity

NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions

1 code implementation22 Mar 2023 Mohamad Shahbazi, Evangelos Ntavelis, Alessio Tonioni, Edo Collins, Danda Pani Paudel, Martin Danelljan, Luc van Gool

Pose-conditioned convolutional generative models struggle with high-quality 3D-consistent image generation from single-view datasets, due to their lack of sufficient 3D priors.

Image Generation Inductive Bias

How Reliable is Your Regression Model's Uncertainty Under Real-World Distribution Shifts?

1 code implementation7 Feb 2023 Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

We then employ our benchmark to evaluate many of the most common uncertainty estimation methods, as well as two state-of-the-art uncertainty scores from the task of out-of-distribution detection.

Out-of-Distribution Detection regression

Continuous Pseudo-Label Rectified Domain Adaptive Semantic Segmentation With Implicit Neural Representations

no code implementations CVPR 2023 Rui Gong, Qin Wang, Martin Danelljan, Dengxin Dai, Luc van Gool

Unsupervised domain adaptation (UDA) for semantic segmentation aims at improving the model performance on the unlabeled target domain by leveraging a labeled source domain.

Pseudo Label Semantic Segmentation +1

Beyond SOT: Tracking Multiple Generic Objects at Once

1 code implementation22 Dec 2022 Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc van Gool, Alina Kuznetsova

Our approach achieves a 4x faster run-time in case of 10 concurrent objects compared to tracking each object independently and outperforms existing single object trackers on our new benchmark.

Attribute Object +1

Fast Hierarchical Learning for Few-Shot Object Detection

no code implementations10 Oct 2022 Yihang She, Goutam Bhat, Martin Danelljan, Fisher Yu

These approaches however suffer from ``catastrophic forgetting'' issue due to finetuning of base detector, leading to sub-optimal performance on the base classes.

Few-Shot Object Detection Object +2

ManiFlow: Implicitly Representing Manifolds with Normalizing Flows

no code implementations18 Aug 2022 Janis Postels, Martin Danelljan, Luc van Gool, Federico Tombari

In contrast to prior work, we approach this problem by generating samples from the original data distribution given full knowledge about the perturbed distribution and the noise model.

Surface Reconstruction

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility

1 code implementation14 Aug 2022 Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Akshay Dudhane, Martin Danelljan, Hisham Cholakkal, Salman Khan, Luc van Gool, Fahad Shahbaz Khan

While being greatly benefiting to the tracking research, existing benchmarks do not pose the same difficulty as before with recent trackers achieving higher performance mainly due to (i) the introduction of more sophisticated transformers-based methods and (ii) the lack of diverse scenarios with adverse visibility such as, severe weather conditions, camouflage and imaging effects.

Visual Object Tracking Visual Tracking

Video Mask Transfiner for High-Quality Video Instance Segmentation

1 code implementation28 Jul 2022 Lei Ke, Henghui Ding, Martin Danelljan, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu

While Video Instance Segmentation (VIS) has seen rapid progress, current approaches struggle to predict high-quality masks with accurate boundary details.

Instance Segmentation Semantic Segmentation +2

Tracking Every Thing in the Wild

1 code implementation26 Jul 2022 Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, Fisher Yu

Our experiments show that TETA evaluates trackers more comprehensively, and TETer achieves significant improvements on the challenging large-scale datasets BDD100K and TAO compared to the state-of-the-art.

Benchmarking Classification +2

Arbitrary-Scale Image Synthesis

1 code implementation CVPR 2022 Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc van Gool

Positional encodings have enabled recent works to train a single adversarial network that can generate images of different scales.

Image Generation

Transforming Model Prediction for Tracking

1 code implementation CVPR 2022 Christoph Mayer, Martin Danelljan, Goutam Bhat, Matthieu Paul, Danda Pani Paudel, Fisher Yu, Luc van Gool

Optimization based tracking methods have been widely successful by integrating a target model prediction module, providing effective global reasoning by minimizing an objective function.

Inductive Bias Visual Object Tracking

Robust Visual Tracking by Segmentation

2 code implementations21 Mar 2022 Matthieu Paul, Martin Danelljan, Christoph Mayer, Luc van Gool

We infer a bounding box from the segmentation mask, validate our tracker on challenging tracking datasets and achieve the new state of the art on LaSOT with a success AUC score of 69. 7%.

Decoder Segmentation +5

Transform your Smartphone into a DSLR Camera: Learning the ISP in the Wild

no code implementations20 Mar 2022 Ardhendu Shekhar Tripathi, Martin Danelljan, Samarth Shukla, Radu Timofte, Luc van Gool

We propose a trainable Image Signal Processing (ISP) framework that produces DSLR quality images given RAW images captured by a smartphone.

Motion Estimation

Adiabatic Quantum Computing for Multi Object Tracking

no code implementations CVPR 2022 Jan-Nico Zaech, Alexander Liniger, Martin Danelljan, Dengxin Dai, Luc van Gool

Multi-Object Tracking (MOT) is most often approached in the tracking-by-detection paradigm, where object detections are associated through time.

Multi-Object Tracking Object

Fast Online Video Super-Resolution with Deformable Attention Pyramid

no code implementations3 Feb 2022 Dario Fuoli, Martin Danelljan, Radu Timofte, Luc van Gool

Our DAP aligns and integrates information from the recurrent state into the current frame prediction.

Video Super-Resolution

RePaint: Inpainting using Denoising Diffusion Probabilistic Models

3 code implementations CVPR 2022 Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc van Gool

In this work, we propose RePaint: A Denoising Diffusion Probabilistic Model (DDPM) based inpainting approach that is applicable to even extreme masks.

Denoising Image Inpainting

Collapse by Conditioning: Training Class-conditional GANs with Limited Data

1 code implementation ICLR 2022 Mohamad Shahbazi, Martin Danelljan, Danda Pani Paudel, Luc van Gool

On the contrary, we observe that class-conditioning causes mode collapse in limited data settings, where unconditional learning leads to satisfactory generative ability.

Generative Adversarial Network

Efficient Visual Tracking with Exemplar Transformers

2 code implementations17 Dec 2021 Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc van Gool

E. T. Track, our visual tracker that incorporates Exemplar Transformer modules, runs at 47 FPS on a CPU.

Visual Object Tracking Visual Tracking

Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-resolution

no code implementations5 Nov 2021 Andreas Lugmayr, Martin Danelljan, Fisher Yu, Luc van Gool, Radu Timofte

Super-resolution is an ill-posed problem, where a ground-truth high-resolution image represents only one possibility in the space of plausible solutions.

Super-Resolution

Learning Proposals for Practical Energy-Based Regression

1 code implementation22 Oct 2021 Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression.

regression

Dense Gaussian Processes for Few-Shot Segmentation

1 code implementation7 Oct 2021 Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, Martin Danelljan

Given the support set, our dense GP learns the mapping from local deep image features to mask values, capable of capturing complex appearance distributions.

Decoder Few-Shot Semantic Segmentation +2

PDC-Net+: Enhanced Probabilistic Dense Correspondence Network

1 code implementation28 Sep 2021 Prune Truong, Martin Danelljan, Radu Timofte, Luc van Gool

In order to apply dense methods to real-world applications, such as pose estimation, image manipulation, or 3D reconstruction, it is therefore crucial to estimate the confidence of the predicted matches.

3D Reconstruction Geometric Matching +6

TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation

1 code implementation10 Sep 2021 Rui Gong, Martin Danelljan, Dengxin Dai, Danda Pani Paudel, Ajad Chhatkuli, Fisher Yu, Luc van Gool

In many real-world settings, the target domain task requires a different taxonomy than the one imposed by the source domain.

Contrastive Learning Domain Adaptation +1

Deep Reparametrization of Multi-Frame Super-Resolution and Denoising

2 code implementations ICCV 2021 Goutam Bhat, Martin Danelljan, Fisher Yu, Luc van Gool, Radu Timofte

The deep reparametrization allows us to directly model the image formation process in the latent space, and to integrate learned image priors into the prediction.

Burst Image Super-Resolution Denoising +2

Learnable Online Graph Representations for 3D Multi-Object Tracking

no code implementations23 Apr 2021 Jan-Nico Zaech, Dengxin Dai, Alexander Liniger, Martin Danelljan, Luc van Gool

Tracking of objects in 3D is a fundamental task in computer vision that finds use in a wide range of applications such as autonomous driving, robotics or augmented reality.

3D Multi-Object Tracking Autonomous Driving

Warp Consistency for Unsupervised Learning of Dense Correspondences

1 code implementation ICCV 2021 Prune Truong, Martin Danelljan, Fisher Yu, Luc van Gool

From our observations and empirical results, we design a general unsupervised objective employing two of the derived constraints.

Dense Pixel Correspondence Estimation Triplet

Learning Target Candidate Association to Keep Track of What Not to Track

1 code implementation ICCV 2021 Christoph Mayer, Martin Danelljan, Danda Pani Paudel, Luc van Gool

To tackle the problem of lacking ground-truth correspondences between distractor objects in visual tracking, we propose a training strategy that combines partial annotations with self-supervision.

Visual Object Tracking Visual Tracking

Deep Gaussian Processes for Few-Shot Segmentation

no code implementations30 Mar 2021 Joakim Johnander, Johan Edstedt, Martin Danelljan, Michael Felsberg, Fahad Shahbaz Khan

Through the expressivity of the GP, our approach is capable of modeling complex appearance distributions in the deep feature space.

Decoder Gaussian Processes +1

Local Memory Attention for Fast Video Semantic Segmentation

1 code implementation5 Jan 2021 Matthieu Paul, Martin Danelljan, Luc van Gool, Radu Timofte

Our approach aggregates a rich representation of the semantic information in past frames into a memory module.

Decoder Segmentation +2

Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU

1 code implementation ICCV 2021 Shipra Jain, Danda Paudel Pani, Martin Danelljan, Luc van Gool

In this paper, we propose a novel training methodology to train and scale the existing semantic segmentation models for a large number of semantic classes without increasing the memory overhead.

Image Classification object-detection +3

Accurate 3D Object Detection using Energy-Based Models

1 code implementation8 Dec 2020 Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

On the KITTI dataset, our proposed approach consistently outperforms the SA-SSD baseline across all 3DOD metrics, demonstrating the potential of EBM-based regression for highly accurate 3DOD.

3D Object Detection Object +2

Learning Video Instance Segmentation with Recurrent Graph Neural Networks

no code implementations7 Dec 2020 Joakim Johnander, Emil Brissman, Martin Danelljan, Michael Felsberg

Most existing approaches to video instance segmentation comprise multiple modules that are heuristically combined to produce the final output.

Graph Neural Network Instance Segmentation +4

Fast Few-Shot Classification by Few-Iteration Meta-Learning

1 code implementation1 Oct 2020 Ardhendu Shekhar Tripathi, Martin Danelljan, Luc van Gool, Radu Timofte

By employing an efficient initialization module and a Steepest Descent based optimization algorithm, our base learner predicts a powerful classifier within only a few iterations.

Classification General Classification +3

Video Object Segmentation with Episodic Graph Memory Networks

1 code implementation ECCV 2020 Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, Luc van Gool

How to make a segmentation model efficiently adapt to a specific video and to online target appearance variations are fundamentally crucial issues in the field of video object segmentation.

Object Segmentation +4

The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures

1 code implementation CVPR 2021 Yawei Li, Wen Li, Martin Danelljan, Kai Zhang, Shuhang Gu, Luc van Gool, Radu Timofte

Based on that, we articulate the heterogeneity hypothesis: with the same training protocol, there exists a layer-wise differentiated network architecture (LW-DNA) that can outperform the original network with regular channel configurations but with a lower level of model complexity.

Image Classification Image Restoration +1

SRFlow: Learning the Super-Resolution Space with Normalizing Flow

8 code implementations ECCV 2020 Andreas Lugmayr, Martin Danelljan, Luc van Gool, Radu Timofte

SRFlow therefore directly accounts for the ill-posed nature of the problem, and learns to predict diverse photo-realistic high-resolution images.

Ranked #7 on Image Super-Resolution on DIV2K val - 4x upscaling (using extra training data)

Diversity Image Manipulation +1

How to Train Your Energy-Based Model for Regression

1 code implementation4 May 2020 Fredrik K. Gustafsson, Martin Danelljan, Radu Timofte, Thomas B. Schön

While they are commonly employed for generative image modeling, recent work has applied EBMs also for regression tasks, achieving state-of-the-art performance on object detection and visual tracking.

object-detection Object Detection +3

Learning Human-Object Interaction Detection using Interaction Points

1 code implementation CVPR 2020 Tiancai Wang, Tong Yang, Martin Danelljan, Fahad Shahbaz Khan, Xiangyu Zhang, Jian Sun

Human-object interaction (HOI) detection strives to localize both the human and an object as well as the identification of complex interactions between them.

Human-Object Interaction Detection Keypoint Detection +2

Know Your Surroundings: Exploiting Scene Information for Object Tracking

1 code implementation ECCV 2020 Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte

Such approaches are however prone to fail in case of e. g. fast appearance changes or presence of distractor objects, where a target appearance model alone is insufficient for robust tracking.

Object Tracking

Learning Fast and Robust Target Models for Video Object Segmentation

2 code implementations CVPR 2020 Andreas Robinson, Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

The target appearance model consists of a light-weight module, which is learned during the inference stage using fast optimization techniques to predict a coarse but robust target segmentation.

One-shot visual object segmentation Segmentation +2

GLU-Net: Global-Local Universal Network for Dense Flow and Correspondences

2 code implementations CVPR 2020 Prune Truong, Martin Danelljan, Radu Timofte

Establishing dense correspondences between a pair of images is an important and general problem, covering geometric matching, optical flow and semantic correspondences.

Dense Pixel Correspondence Estimation Geometric Matching +1

Energy-Based Models for Deep Probabilistic Regression

1 code implementation ECCV 2020 Fredrik K. Gustafsson, Martin Danelljan, Goutam Bhat, Thomas B. Schön

In our proposed approach, we create an energy-based model of the conditional target density p(y|x), using a deep neural network to predict the un-normalized density from (x, y).

 Ranked #1 on Object Detection on COCO test-dev (Hardware Burden metric)

Head Pose Estimation object-detection +4

Unsupervised Learning for Real-World Super-Resolution

no code implementations20 Sep 2019 Andreas Lugmayr, Martin Danelljan, Radu Timofte

Instead of directly addressing this problem, most works employ the popular bicubic downsampling strategy to artificially generate a corresponding low resolution image.

Image Super-Resolution

Multi-Modal Fusion for End-to-End RGB-T Tracking

1 code implementation30 Aug 2019 Lichao Zhang, Martin Danelljan, Abel Gonzalez-Garcia, Joost Van de Weijer, Fahad Shahbaz Khan

Our tracker is trained in an end-to-end manner, enabling the components to learn how to fuse the information from both modalities.

Image-to-Image Translation Rgb-T Tracking

Learning the Model Update for Siamese Trackers

1 code implementation ICCV 2019 Lichao Zhang, Abel Gonzalez-Garcia, Joost Van de Weijer, Martin Danelljan, Fahad Shahbaz Khan

In general, this template is linearly combined with the accumulated template from the previous frame, resulting in an exponential decay of information over time.

Visual Tracking

Evaluating Scalable Bayesian Deep Learning Methods for Robust Computer Vision

1 code implementation4 Jun 2019 Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

We therefore accept this task and propose a comprehensive evaluation framework for scalable epistemic uncertainty estimation methods in deep learning.

Deep Learning Depth Completion +1

Discriminative Online Learning for Fast Video Object Segmentation

no code implementations18 Apr 2019 Andreas Robinson, Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

We propose a novel approach, based on a dedicated target appearance model that is exclusively learned online to discriminate between the target and background image regions.

Object One-shot visual object segmentation +4

Learning Discriminative Model Prediction for Tracking

2 code implementations ICCV 2019 Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte

The current strive towards end-to-end trainable computer vision systems imposes major challenges for the task of visual tracking.

Visual Object Tracking Visual Tracking

ATOM: Accurate Tracking by Overlap Maximization

4 code implementations CVPR 2019 Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

We argue that this approach is fundamentally limited since target estimation is a complex task, requiring high-level knowledge about the object.

General Classification Visual Object Tracking +1

Synthetic data generation for end-to-end thermal infrared tracking

no code implementations4 Jun 2018 Lichao Zhang, Abel Gonzalez-Garcia, Joost Van de Weijer, Martin Danelljan, Fahad Shahbaz Khan

These methods provide us with a large labeled dataset of synthetic TIR sequences, on which we can train end-to-end optimal features for tracking.

Image-to-Image Translation Synthetic Data Generation +2

Density Adaptive Point Set Registration

1 code implementation CVPR 2018 Felix Järemo Lawin, Martin Danelljan, Fahad Shahbaz Khan, Per-Erik Forssén, Michael Felsberg

Contrary to previous works, we model the underlying structure of the scene as a latent probability distribution, and thereby induce invariance to point set density changes.

Deep Motion Features for Visual Tracking

no code implementations20 Dec 2016 Susanna Gladh, Martin Danelljan, Fahad Shahbaz Khan, Michael Felsberg

To the best of our knowledge, we are the first to propose fusing appearance information with deep motion features for visual tracking.

Action Recognition Optical Flow Estimation +3

ECO: Efficient Convolution Operators for Tracking

5 code implementations CVPR 2017 Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

Moreover, our fast variant, using hand-crafted features, operates at 60 Hz on a single CPU, while obtaining 65. 0% AUC on OTB-2015.

Diversity Visual Object Tracking

Discriminative Scale Space Tracking

no code implementations20 Sep 2016 Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg

Compared to the standard exhaustive scale search, our approach achieves a gain of 2. 5% in average overlap precision on the OTB dataset.

Visual Object Tracking

Learning Spatially Regularized Correlation Filters for Visual Tracking

no code implementations ICCV 2015 Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg

These methods utilize a periodic assumption of the training samples to efficiently learn a classifier on all patches in the target neighborhood.

Visual Tracking

A Probabilistic Framework for Color-Based Point Set Registration

no code implementations CVPR 2016 Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan, Michael Felsberg

On the Stanford Lounge dataset, our approach achieves a relative reduction of the failure rate by 78% compared to the baseline.

Cannot find the paper you are looking for? You can Submit a new open access paper.