3 code implementations • CVPR 2022 • Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc van Gool
In this work, we propose RePaint: A Denoising Diffusion Probabilistic Model (DDPM) based inpainting approach that is applicable to even extreme masks.
5 code implementations • ICCV 2021 • Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu, Luc van Gool
Inspired by the recent advance in unsupervised contrastive representation learning, we propose a pixel-wise contrastive framework for semantic segmentation in the fully supervised setting.
9 code implementations • 23 Aug 2021 • Jingyun Liang, JieZhang Cao, Guolei Sun, Kai Zhang, Luc van Gool, Radu Timofte
In particular, the deep feature extraction module is composed of several residual Swin Transformer blocks (RSTB), each of which has several Swin Transformer layers together with a residual connection.
Ranked #2 on Color Image Denoising on urban100 sigma15
3 code implementations • CVPR 2022 • Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc van Gool
The HSI representations are highly similar and correlated across the spectral dimension.
Ranked #2 on Spectral Reconstruction on ARAD-1K
3 code implementations • 17 Apr 2022 • Yuanhao Cai, Jing Lin, Zudi Lin, Haoqian Wang, Yulun Zhang, Hanspeter Pfister, Radu Timofte, Luc van Gool
Existing leading methods for spectral reconstruction (SR) focus on designing deeper or wider convolutional neural networks (CNNs) to learn the end-to-end mapping from the RGB image to its hyperspectral image (HSI).
Ranked #1 on Spectral Reconstruction on ARAD-1K
11 code implementations • 8 May 2017 • Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc van Gool
Furthermore, based on the temporal segment networks, we won the video classification track at the ActivityNet challenge 2016 among 24 teams, which demonstrates the effectiveness of TSN and the proposed good practices.
Ranked #5 on Video Classification on COIN
19 code implementations • 2 Aug 2016 • Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc van Gool
The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network.
Ranked #3 on Multimodal Activity Recognition on EV-Action
2 code implementations • CVPR 2020 • Martin Danelljan, Luc van Gool, Radu Timofte
In this work, we therefore propose a probabilistic regression formulation and apply it to tracking.
Ranked #4 on Object Tracking on FE108
2 code implementations • ICCV 2019 • Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte
The current strive towards end-to-end trainable computer vision systems imposes major challenges for the task of visual tracking.
Ranked #5 on Object Tracking on FE108
1 code implementation • ECCV 2020 • Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte
Such approaches are however prone to fail in case of e. g. fast appearance changes or presence of distractor objects, where a target appearance model alone is insufficient for robust tracking.
Ranked #8 on Object Tracking on FE108
2 code implementations • ECCV 2020 • Goutam Bhat, Felix Järemo Lawin, Martin Danelljan, Andreas Robinson, Michael Felsberg, Luc van Gool, Radu Timofte
This allows us to achieve a rich internal representation of the target in the current frame, significantly increasing the segmentation accuracy of our approach.
1 code implementation • ICCV 2021 • Bin Zhao, Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte
This effectively limits the performance and generalization capabilities of existing video segmentation methods.
1 code implementation • ICCV 2021 • Christoph Mayer, Martin Danelljan, Danda Pani Paudel, Luc van Gool
To tackle the problem of lacking ground-truth correspondences between distractor objects in visual tracking, we propose a training strategy that combines partial annotations with self-supervision.
Ranked #5 on Visual Object Tracking on OTB-2015
2 code implementations • 17 Dec 2021 • Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc van Gool
E. T. Track, our visual tracker that incorporates Exemplar Transformer modules, runs at 47 FPS on a CPU.
1 code implementation • CVPR 2022 • Christoph Mayer, Martin Danelljan, Goutam Bhat, Matthieu Paul, Danda Pani Paudel, Fisher Yu, Luc van Gool
Optimization based tracking methods have been widely successful by integrating a target model prediction module, providing effective global reasoning by minimizing an objective function.
Ranked #21 on Visual Object Tracking on LaSOT (Precision metric)
2 code implementations • 21 Mar 2022 • Matthieu Paul, Martin Danelljan, Christoph Mayer, Luc van Gool
We infer a bounding box from the segmentation mask, validate our tracker on challenging tracking datasets and achieve the new state of the art on LaSOT with a success AUC score of 69. 7%.
1 code implementation • 14 Aug 2022 • Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Akshay Dudhane, Martin Danelljan, Hisham Cholakkal, Salman Khan, Luc van Gool, Fahad Shahbaz Khan
While being greatly benefiting to the tracking research, existing benchmarks do not pose the same difficulty as before with recent trackers achieving higher performance mainly due to (i) the introduction of more sophisticated transformers-based methods and (ii) the lack of diverse scenarios with adverse visibility such as, severe weather conditions, camouflage and imaging effects.
1 code implementation • 22 Dec 2022 • Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc van Gool, Alina Kuznetsova
Our approach achieves a 4x faster run-time in case of 10 concurrent objects compared to tracking each object independently and outperforms existing single object trackers on our new benchmark.
22 code implementations • 15 Feb 2018 • Davy Neven, Bert de Brabandere, Stamatios Georgoulis, Marc Proesmans, Luc van Gool
By doing so, we ensure a lane fitting which is robust against road plane changes, unlike existing approaches that rely on a fixed, pre-defined transformation.
Ranked #15 on Lane Detection on TuSimple
3 code implementations • ICCV 2017 • Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Kenneth Vanhoey, Luc van Gool
Despite a rapid rise in the quality of built-in smartphone cameras, their physical limitations - small sensor size, compact lenses and the lack of specific hardware, - impede them to achieve the quality results of DSLR cameras.
2 code implementations • 1 Jul 2021 • Janis Postels, Mattia Segu, Tao Sun, Luca Sieber, Luc van Gool, Fisher Yu, Federico Tombari
We find that, while DUMs scale to realistic vision tasks and perform well on OOD detection, the practicality of current methods is undermined by poor calibration under distributional shifts.
Out of Distribution (OOD) Detection Semantic Segmentation +1
2 code implementations • ECCV 2020 • Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Marc Proesmans, Luc van Gool
First, a self-supervised task from representation learning is employed to obtain semantically meaningful features.
Ranked #4 on Unsupervised Image Classification on ImageNet
2 code implementations • NeurIPS 2021 • Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc van Gool
Contrastive self-supervised learning has outperformed supervised pretraining on many downstream tasks like segmentation and object detection.
1 code implementation • 28 Jan 2022 • Jingyun Liang, JieZhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc van Gool
Besides, parallel warping is used to further fuse information from neighboring frames by parallel feature warping.
Ranked #1 on Deblurring on BASED
3 code implementations • ICCV 2021 • Kai Zhang, Jingyun Liang, Luc van Gool, Radu Timofte
It is widely acknowledged that single image super-resolution (SISR) methods would not perform well if the assumed degradation model deviates from those in real images.
2 code implementations • CVPR 2018 • Kevis-Kokitsi Maninis, Sergi Caelles, Jordi Pont-Tuset, Luc van Gool
This paper explores the use of extreme points in an object (left-most, right-most, top, bottom pixels) as input to obtain precise object segmentation for images and videos.
1 code implementation • CVPR 2020 • Kai Zhang, Luc van Gool, Radu Timofte
As a result, the proposed network inherits the flexibility of model-based methods to super-resolve blurry, noisy images for different scale factors via a single model, while maintaining the advantages of learning-based methods.
7 code implementations • ECCV 2020 • Andreas Lugmayr, Martin Danelljan, Luc van Gool, Radu Timofte
SRFlow therefore directly accounts for the ill-posed nature of the problem, and learns to predict diverse photo-realistic high-resolution images.
Ranked #6 on Image Super-Resolution on DIV2K val - 4x upscaling (using extra training data)
1 code implementation • ECCV 2020 • Simon Vandenhende, Stamatios Georgoulis, Luc van Gool
In this paper, we argue about the importance of considering task interactions at multiple scales when distilling task information in a multi-task learning setup.
Ranked #7 on Semantic Segmentation on UrbanLF
1 code implementation • 28 Apr 2020 • Simon Vandenhende, Stamatios Georgoulis, Wouter Van Gansbeke, Marc Proesmans, Dengxin Dai, Luc van Gool
In this survey, we provide a well-rounded view on state-of-the-art deep learning approaches for MTL in computer vision, explicitly emphasizing on dense prediction tasks.
1 code implementation • 1 Feb 2019 • Wouter Van Gansbeke, Bert de Brabandere, Davy Neven, Marc Proesmans, Luc van Gool
The problem with such a two-step approach is that the parameters of the network are not optimized for the true task of interest (estimating the lane curvature parameters) but for a proxy task (segmenting the lane markings), resulting in sub-optimal performance.
4 code implementations • CVPR 2021 • Prune Truong, Martin Danelljan, Luc van Gool, Radu Timofte
Establishing dense correspondences between a pair of images is an important and general problem.
1 code implementation • ICCV 2021 • Prune Truong, Martin Danelljan, Fisher Yu, Luc van Gool
From our observations and empirical results, we design a general unsupervised objective employing two of the derived constraints.
1 code implementation • 28 Sep 2021 • Prune Truong, Martin Danelljan, Radu Timofte, Luc van Gool
In order to apply dense methods to real-world applications, such as pose estimation, image manipulation, or 3D reconstruction, it is therefore crucial to estimate the confidence of the predicted matches.
1 code implementation • CVPR 2022 • Prune Truong, Martin Danelljan, Fisher Yu, Luc van Gool
We propose Probabilistic Warp Consistency, a weakly-supervised learning objective for semantic matching.
4 code implementations • 31 Aug 2020 • Kai Zhang, Yawei Li, WangMeng Zuo, Lei Zhang, Luc van Gool, Radu Timofte
Recent works on plug-and-play image restoration have shown that a denoiser can implicitly serve as the image prior for model-based methods to solve many inverse problems.
2 code implementations • 24 Mar 2022 • Kai Zhang, Yawei Li, Jingyun Liang, JieZhang Cao, Yulun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, Luc van Gool
While recent years have witnessed a dramatic upsurge of exploiting deep neural networks toward solving image denoising, existing methods mostly rely on simple noise assumptions, such as additive white Gaussian noise (AWGN), JPEG compression noise and camera sensor noise, and a general-purpose blind denoising method for real images remains unsolved.
Ranked #1 on Image Denoising on urban100 sigma15
8 code implementations • CVPR 2017 • Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc van Gool
This paper tackles the task of semi-supervised video object segmentation, i. e., the separation of an object from the background in a video, given the mask of the first frame.
2 code implementations • CVPR 2017 • Ali Diba, Vivek Sharma, Luc van Gool
Advantages of TLEs are: (a) they encode the entire video into a compact feature representation, learning the semantics and a discriminative feature space; (b) they are applicable to all kinds of networks like 2D and 3D CNNs for video classification; and (c) they model feature interactions in a more expressive way and without loss of information.
1 code implementation • ICCV 2019 • Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, Luc van Gool
We present a learned image compression system based on GANs, operating at extremely low bitrates.
2 code implementations • CVPR 2022 • Xiaowan Hu, Yuanhao Cai, Jing Lin, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc van Gool
On the one hand, the proposed HR spatial-spectral attention module with its efficient feature fusion provides continuous and fine pixel-level features.
Ranked #5 on Spectral Reconstruction on Real HSI
1 code implementation • 9 Mar 2022 • Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc van Gool
Many algorithms have been developed to solve the inverse problem of coded aperture snapshot spectral imaging (CASSI), i. e., recovering the 3D hyperspectral images (HSIs) from a 2D compressive measurement.
Ranked #2 on Spectral Reconstruction on Real HSI
1 code implementation • 20 May 2022 • Yuanhao Cai, Jing Lin, Haoqian Wang, Xin Yuan, Henghui Ding, Yulun Zhang, Radu Timofte, Luc van Gool
In coded aperture snapshot spectral compressive imaging (CASSI) systems, hyperspectral image (HSI) reconstruction methods are employed to recover the spatial-spectral signal from a compressed measurement.
Ranked #1 on Spectral Reconstruction on Real HSI
1 code implementation • CVPR 2022 • Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc van Gool
Estimating 3D human poses from monocular videos is a challenging task due to depth ambiguity and self-occlusion.
Ranked #22 on 3D Human Pose Estimation on MPI-INF-3DHP
1 code implementation • 14 Feb 2019 • Wouter Van Gansbeke, Davy Neven, Bert de Brabandere, Luc van Gool
However, we additionally propose a fusion method with RGB guidance from a monocular camera in order to leverage object information and to correct mistakes in the sparse input.
Ranked #5 on Depth Completion on KITTI Depth Completion
1 code implementation • arXiv 2019 • Wouter Van Gansbeke, Davy Neven, Bert de Brabandere, Luc van Gool
For autonomous vehicles and robotics the use of LiDAR is indispensable in order to achieve precise depth predictions.
8 code implementations • 8 Aug 2017 • Bert De Brabandere, Davy Neven, Luc van Gool
In this work we propose to tackle the problem with a discriminative loss function, operating at the pixel level, that encourages a convolutional network to produce a representation of the image that can easily be clustered into instances with a simple post-processing step.
Ranked #4 on Multi-Human Parsing on MHP v1.0
3 code implementations • CVPR 2022 • Lukas Hoyer, Dengxin Dai, Luc van Gool
It improves the state of the art by 10. 8 mIoU for GTA-to-Cityscapes and 5. 4 mIoU for Synthia-to-Cityscapes and enables learning even difficult classes such as train, bus, and truck well.
Ranked #5 on Semantic Segmentation on DensePASS
3 code implementations • 26 Apr 2023 • Lukas Hoyer, Dengxin Dai, Luc van Gool
As previous UDA&DG semantic segmentation methods are mostly based on outdated networks, we benchmark more recent architectures, reveal the potential of Transformers, and design the DAFormer network tailored for UDA&DG.
4 code implementations • 27 Mar 2022 • Ge-Peng Ji, Guobao Xiao, Yu-Cheng Chou, Deng-Ping Fan, Kai Zhao, Geng Chen, Luc van Gool
We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era.
Ranked #2 on Video Polyp Segmentation on SUN-SEG-Easy (Unseen)
2 code implementations • ICCV 2021 • Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc van Gool
To achieve this, we introduce a two-step framework that adopts a predetermined mid-level prior in a contrastive optimization objective to learn pixel embeddings.
Ranked #3 on Unsupervised Semantic Segmentation on ImageNet-S-50
3 code implementations • CVPR 2019 • Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool
We propose the first practical learned lossless image compression system, L3C, and show that it outperforms the popular engineered codecs, PNG, WebP and JPEG 2000.
Ranked #3 on Image Compression on ImageNet32
1 code implementation • 16 Sep 2013 • Michael Van den Bergh, Xavier Boix, Gemma Roig, Luc van Gool
We define a robust and fast to evaluate energy function, based on enforcing color similarity between the bound- aries and the superpixel color histogram.
1 code implementation • ICCV 2023 • Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, Luc van Gool
Diffusion model (DM) has achieved SOTA performance by modeling the image synthesis process into a sequential application of a denoising network.
1 code implementation • 27 Mar 2024 • Luigi Piccinelli, Yung-Hsu Yang, Christos Sakaridis, Mattia Segu, Siyuan Li, Luc van Gool, Fisher Yu
However, the remarkable accuracy of recent MMDE methods is confined to their training domains.
Ranked #2 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)
2 code implementations • ICCV 2023 • Erik Sandström, Yue Li, Luc van Gool, Martin R. Oswald
We propose a dense neural simultaneous localization and mapping (SLAM) approach for monocular RGBD input which anchors the features of a neural scene representation in a point cloud that is iteratively generated in an input-dependent data-driven manner.
3 code implementations • 13 Feb 2020 • Andrey Ignatov, Luc van Gool, Radu Timofte
The model is trained to convert RAW Bayer data obtained directly from mobile camera sensor into photos captured with a professional high-end DSLR camera, making the solution independent of any particular mobile ISP implementation.
1 code implementation • CVPR 2023 • Yawei Li, Yuchen Fan, Xiaoyu Xiang, Denis Demandolx, Rakesh Ranjan, Radu Timofte, Luc van Gool
The aim of this paper is to propose a mechanism to efficiently and explicitly model image hierarchies in the global, regional, and local range for image restoration.
Ranked #1 on Image Defocus Deblurring on DPD (Dual-view)
1 code implementation • 30 Nov 2017 • Bernhard Kratzwald, Zhiwu Huang, Danda Pani Paudel, Acharya Dinesh, Luc van Gool
In this paper, we aim to improve the state-of-the-art video generative adversarial networks (GANs) with a view towards multi-functional applications.
8 code implementations • CVPR 2018 • Yuhua Chen, Wen Li, Christos Sakaridis, Dengxin Dai, Luc van Gool
The results demonstrate the effectiveness of our proposed approach for robust object detection in various domain shift scenarios.
3 code implementations • 5 Jun 2022 • Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, JieZhang Cao, Kai Zhang, Radu Timofte, Luc van Gool
Specifically, RVRT divides the video into multiple clips and uses the previously inferred clip feature to estimate the subsequent clip feature.
1 code implementation • CVPR 2022 • Tianfei Zhou, Wenguan Wang, Ender Konukoglu, Luc van Gool
Prevalent semantic segmentation solutions, despite their different network designs (FCN based or attention based) and mask decoding strategies (parametric softmax based or pixel-query based), can be placed in one category, by considering the softmax weights or query vectors as learnable class prototypes.
3 code implementations • CVPR 2023 • Zixiang Zhao, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Shuang Xu, Zudi Lin, Radu Timofte, Luc van Gool
We then introduce a dual-branch Transformer-CNN feature extractor with Lite Transformer (LT) blocks leveraging long-range attention to handle low-frequency global features and Invertible Neural Networks (INN) blocks focusing on extracting high-frequency local information.
3 code implementations • ICCV 2023 • Zixiang Zhao, Haowen Bai, Yuanzhi Zhu, Jiangshe Zhang, Shuang Xu, Yulun Zhang, Kai Zhang, Deyu Meng, Radu Timofte, Luc van Gool
To leverage strong generative priors and address challenges such as unstable training and lack of interpretability for GAN-based generative methods, we propose a novel fusion algorithm based on the denoising diffusion probabilistic model (DDPM).
3 code implementations • 19 May 2023 • Zixiang Zhao, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Kai Zhang, Shuang Xu, Dongdong Chen, Radu Timofte, Luc van Gool
These components enable the net training to follow the principles of the natural sensing-imaging process while satisfying the equivariant imaging prior.
1 code implementation • CVPR 2021 • M. Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc van Gool, Rainer Stiefelhagen
Action segmentation refers to inferring boundaries of semantically consistent visual concepts in videos and is an important requirement for many video understanding tasks.
Ranked #1 on Action Segmentation on MPII Cooking 2 Dataset
2 code implementations • 15 May 2023 • Yuanzhi Zhu, Kai Zhang, Jingyun Liang, JieZhang Cao, Bihan Wen, Radu Timofte, Luc van Gool
Although diffusion models have shown impressive performance for high-quality image synthesis, their potential to serve as a generative denoiser prior to the plug-and-play IR methods remains to be further explored.
3 code implementations • CVPR 2022 • Ozan Unal, Dengxin Dai, Luc van Gool
Densely annotating LiDAR point clouds remains too expensive and time-consuming to keep up with the ever growing volume of data.
Ranked #2 on 3D Semantic Segmentation on ScribbleKITTI
2 code implementations • 26 Feb 2022 • Shengqu Cai, Anton Obukhov, Dengxin Dai, Luc van Gool
We propose a pipeline to generate Neural Radiance Fields~(NeRF) of an object or a scene of a specific class, conditioned on a single input image.
1 code implementation • CVPR 2022 • Shengqu Cai, Anton Obukhov, Dengxin Dai, Luc van Gool
We propose a pipeline to generate Neural Radiance Fields (NeRF) of an object or a scene of a specific class, conditioned on a single input image.
1 code implementation • 2 Aug 2016 • Yuanjun Xiong, Li-Min Wang, Zhe Wang, Bo-Wen Zhang, Hang Song, Wei Li, Dahua Lin, Yu Qiao, Luc van Gool, Xiaoou Tang
This paper presents the method that underlies our submission to the untrimmed video classification task of ActivityNet Challenge 2016.
2 code implementations • ICCV 2021 • Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc van Gool
Our end-to-end agent achieves a 78% success rate while generalizing to a new town and new weather on the NoCrash-dense benchmark and state-of-the-art performance on the challenging public routes of the CARLA LeaderBoard.
1 code implementation • CVPR 2023 • Lukas Hoyer, Dengxin Dai, Haoran Wang, Luc van Gool
MIC significantly improves the state-of-the-art performance across the different recognition tasks for synthetic-to-real, day-to-nighttime, and clear-to-adverse-weather UDA.
1 code implementation • 12 Jun 2021 • JieZhang Cao, Yawei Li, Kai Zhang, Luc van Gool
Specifically, to tackle the first issue, we present a spatial-temporal convolutional self-attention layer with a theoretical understanding to exploit the locality information.
1 code implementation • CVPR 2021 • Lukas Hoyer, Dengxin Dai, Yuhua Chen, Adrian Köring, Suman Saha, Luc van Gool
Training deep networks for semantic segmentation requires large amounts of labeled training data, which presents a major challenge in practice, as labeling segmentation masks is a highly labor-intensive process.
Ranked #4 on Semi-Supervised Semantic Segmentation on Cityscapes 100 samples labeled (using extra training data)
1 code implementation • 28 Aug 2021 • Lukas Hoyer, Dengxin Dai, Qin Wang, Yuhua Chen, Luc van Gool
Training deep networks for semantic segmentation requires large amounts of labeled training data, which presents a major challenge in practice, as labeling segmentation masks is a highly labor-intensive process.
1 code implementation • 27 Apr 2022 • Lukas Hoyer, Dengxin Dai, Luc van Gool
Therefore, we propose HRDA, a multi-resolution training approach for UDA, that combines the strengths of small high-resolution crops to preserve fine segmentation details and large low-resolution crops to capture long-range context dependencies with a learned scale attention, while maintaining a manageable GPU memory footprint.
Ranked #3 on Semantic Segmentation on GTAV-to-Cityscapes Labels
2 code implementations • NeurIPS 2020 • Prune Truong, Martin Danelljan, Luc van Gool, Radu Timofte
We propose GOCor, a fully differentiable dense matching module, acting as a direct replacement to the feature correlation layer.
Dense Pixel Correspondence Estimation Feature Correlation +2
1 code implementation • 8 Aug 2017 • Davy Neven, Bert de Brabandere, Stamatios Georgoulis, Marc Proesmans, Luc van Gool
Most approaches for instance-aware semantic labeling traditionally focus on accuracy.
4 code implementations • CVPR 2019 • Davy Neven, Bert de Brabandere, Marc Proesmans, Luc van Gool
In this work we propose a new clustering loss function for proposal-free instance segmentation.
Ranked #1000000000 on Instance Segmentation on Cityscapes test
3 code implementations • CVPR 2020 • Ren Yang, Fabian Mentzer, Luc van Gool, Radu Timofte
In our HLVC approach, the hierarchical quality benefits the coding efficiency, since the high quality information facilitates the compression and enhancement of low quality frames at encoder and decoder sides, respectively.
2 code implementations • 24 Jun 2020 • Ren Yang, Fabian Mentzer, Luc van Gool, Radu Timofte
The experiments show that our approach achieves the state-of-the-art learned video compression performance in terms of both PSNR and MS-SSIM.
4 code implementations • 29 Jun 2020 • Ren Yang, Luc van Gool, Radu Timofte
At the time of writing this report, several learned video compression methods are superior to DVC, but currently none of them provides open source codes.
3 code implementations • 7 Sep 2021 • Ren Yang, Radu Timofte, Luc van Gool
This paper proposes a Perceptual Learned Video Compression (PLVC) approach with recurrent conditional GAN.
2 code implementations • CVPR 2022 • Qin Wang, Olga Fink, Luc van Gool, Dengxin Dai
However, real-world machine perception systems are running in non-stationary and continually changing environments where the target domain distribution can change over time.
1 code implementation • CVPR 2018 • Limin Wang, Wei Li, Wen Li, Luc van Gool
Specifically, SMART blocks decouple the spatiotemporal learning module into an appearance branch for spatial modeling and a relation branch for temporal modeling.
Ranked #51 on Action Recognition on UCF101
2 code implementations • ICCV 2021 • Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc van Gool
In this work, we study the problem of extracting a directed graph representing the local road network in BEV coordinates, from a single onboard camera image.
Ranked #3 on Lane Detection on nuScenes
2 code implementations • 4 Apr 2024 • Wencan Cheng, Hao Tang, Luc van Gool, Jong Hwan Ko
Extracting keypoint locations from input hand frames, known as 3D hand pose estimation, is a critical task in various human-computer interaction applications.
2 code implementations • 17 Jan 2017 • Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Pablo Arbeláez, Luc van Gool
We present Convolutional Oriented Boundaries (COB), which produces multiscale oriented contours and region hierarchies starting from generic image classification Convolutional Neural Networks (CNNs).
1 code implementation • 9 Aug 2016 • Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Pablo Arbeláez, Luc van Gool
We present Convolutional Oriented Boundaries (COB), which produces multiscale oriented contours and region hierarchies starting from generic image classification Convolutional Neural Networks (CNNs).
1 code implementation • ICCV 2021 • Jingyun Liang, Andreas Lugmayr, Kai Zhang, Martin Danelljan, Luc van Gool, Radu Timofte
More specifically, HCFlow learns a bijective mapping between HR and LR image pairs by modelling the distribution of the LR image and the rest high-frequency component simultaneously.
1 code implementation • CVPR 2018 • Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool
During training, the auto-encoder makes use of the context model to estimate the entropy of its representation, and the context model is concurrently updated to learn the dependencies between the symbols in the latent representation.
1 code implementation • CVPR 2018 • Chengde Wan, Thomas Probst, Luc van Gool, Angela Yao
Specifically, we decompose the pose parameters into a set of per-pixel estimations, i. e., 2D heat maps, 3D heat maps and unit 3D directional vector fields.
Ranked #4 on Hand Pose Estimation on MSRA Hands
1 code implementation • 2 Jul 2021 • Tianfei Zhou, Fatih Porikli, David Crandall, Luc van Gool, Wenguan Wang
Video segmentation -- partitioning video frames into multiple segments or objects -- plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing.
3 code implementations • CVPR 2021 • Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte
We propose a novel architecture for the burst super-resolution task.
Ranked #6 on Burst Image Super-Resolution on SyntheticBurst
Burst Image Super-Resolution Multi-Frame Super-Resolution +1
2 code implementations • ICCV 2021 • Goutam Bhat, Martin Danelljan, Fisher Yu, Luc van Gool, Radu Timofte
The deep reparametrization allows us to directly model the image formation process in the latent space, and to integrate learned image priors into the prediction.
Ranked #4 on Burst Image Super-Resolution on BurstSR
2 code implementations • 30 May 2022 • Peng Zheng, Huazhu Fu, Deng-Ping Fan, Qi Fan, Jie Qin, Yu-Wing Tai, Chi-Keung Tang, Luc van Gool
In this paper, we present a novel end-to-end group collaborative learning network, termed GCoNet+, which can effectively and efficiently (250 fps) identify co-salient objects in natural scenes.
Ranked #1 on Co-Salient Object Detection on CoCA
1 code implementation • 26 Sep 2018 • Asha Anoosheh, Torsten Sattler, Radu Timofte, Marc Pollefeys, Luc van Gool
We then compare the daytime and translated night images to obtain a pose estimate for the night image using the known 6-DOF position of the closest day image.
1 code implementation • CVPR 2018 • Liqian Ma, Qianru Sun, Stamatios Georgoulis, Luc van Gool, Bernt Schiele, Mario Fritz
Generating novel, yet realistic, images of persons is a challenging task due to the complex interplay between the different image factors, such as the foreground, background and pose information.
Ranked #2 on Gesture-to-Gesture Translation on Senz3D
1 code implementation • ICCV 2021 • Jingyun Liang, Guolei Sun, Kai Zhang, Luc van Gool, Radu Timofte
Extensive experiments on synthetic and real images show that the proposed MANet not only performs favorably for both spatially variant and invariant kernel estimation, but also leads to state-of-the-art blind SR performance when combined with non-blind SR methods.
1 code implementation • CVPR 2022 • Martin Hahner, Christos Sakaridis, Mario Bijelic, Felix Heide, Fisher Yu, Dengxin Dai, Luc van Gool
Due to the difficulty of collecting and annotating training data in this setting, we propose a physically based method to simulate the effect of snowfall on real clear-weather LiDAR point clouds.
Ranked #1 on 3D Object Detection on Heavy Snowfall
2 code implementations • CVPR 2017 • Limin Wang, Yuanjun Xiong, Dahua Lin, Luc van Gool
We exploit the learned models for action recognition (WSR) and detection (WSD) on the untrimmed video datasets of THUMOS14 and ActivityNet.
Ranked #3 on Action Classification on ActivityNet-1.2
Weakly Supervised Action Localization Weakly-Supervised Action Recognition
1 code implementation • 13 Dec 2019 • Hisham Cholakkal, Guolei Sun, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Luc van Gool
Our RLC framework further reduces the annotation cost arising from large numbers of object categories in a dataset by only using lower-count supervision for a subset of categories and class-labels for the remaining ones.
Image Classification Image-level Supervised Instance Segmentation +3
1 code implementation • ICCV 2021 • Martin Hahner, Christos Sakaridis, Dengxin Dai, Luc van Gool
2) Through extensive experiments with several state-of-the-art detection approaches, we show that our fog simulation can be leveraged to significantly improve the performance for 3D object detection in the presence of fog.
Ranked #1 on 3D Object Detection on Dense Fog
2 code implementations • ECCV 2020 • Guolei Sun, Wenguan Wang, Jifeng Dai, Luc van Gool
Moreover, our approach ranked 1st place in the Weakly-Supervised Semantic Segmentation Track of CVPR2020 Learning from Imperfect Data Challenge.
1 code implementation • 6 Jan 2022 • Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc van Gool
Exploiting similar and sharper scene patches in spatio-temporal neighborhoods is critical for video deblurring.
Ranked #1 on Deblurring on DVD
1 code implementation • 20 May 2022 • Jing Lin, Xiaowan Hu, Yuanhao Cai, Haoqian Wang, Youliang Yan, Xueyi Zou, Yulun Zhang, Luc van Gool
On the other hand, we equip the sequence-to-sequence model with an unsupervised optical flow estimator to maximize its potential.
Ranked #2 on Video Enhancement on MFQE v2
1 code implementation • CVPR 2021 • Jingyun Liang, Kai Zhang, Shuhang Gu, Luc van Gool, Radu Timofte
Kernel estimation is generally one of the key problems for blind image super-resolution (SR).
1 code implementation • NeurIPS 2016 • Bert De Brabandere, Xu Jia, Tinne Tuytelaars, Luc van Gool
In a traditional convolutional layer, the learned filters stay fixed after training.
Ranked #1 on Video Prediction on KTH (Cond metric)
1 code implementation • 19 Dec 2017 • Asha Anoosheh, Eirikur Agustsson, Radu Timofte, Luc Van Gool
This year alone has seen unprecedented leaps in the area of learning-based image translation, namely CycleGAN, by Zhu et al.
Ranked #6 on Facial Expression Translation on CelebA
1 code implementation • 30 Nov 2022 • Bin Xia, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Radu Timofte, Luc van Gool
It consists of a knowledge distillation based implicit degradation estimator network (KD-IDE) and an efficient SR network.
1 code implementation • NeurIPS 2023 • Evangelos Ntavelis, Aliaksandr Siarohin, Kyle Olszewski, Chaoyang Wang, Luc van Gool, Sergey Tulyakov
We present a novel approach to the generation of static and articulated 3D assets that has a 3D autodecoder at its core.
1 code implementation • 30 Nov 2021 • Lei Sun, Christos Sakaridis, Jingyun Liang, Qi Jiang, Kailun Yang, Peng Sun, Yaozu Ye, Kaiwei Wang, Luc van Gool
Traditional frame-based cameras inevitably suffer from motion blur due to long exposure times.
Ranked #3 on Deblurring on GoPro (using extra training data)
1 code implementation • 25 Jul 2022 • JieZhang Cao, Jingyun Liang, Kai Zhang, Yawei Li, Yulun Zhang, Wenguan Wang, Luc van Gool
Reference-based image super-resolution (RefSR) aims to exploit auxiliary reference (Ref) images to super-resolve low-resolution (LR) images.
1 code implementation • ECCV 2020 • Qinghao Meng, Wenguan Wang, Tianfei Zhou, Jianbing Shen, Luc van Gool, Dengxin Dai
This work proposes a weakly supervised approach for 3D object detection, only requiring a small set of weakly annotated scenes, associated with a few precisely labeled object instances.
1 code implementation • CVPR 2022 • Vaishakh Patil, Christos Sakaridis, Alexander Liniger, Luc van Gool
We focus on the supervised setup, in which ground-truth depth is available only at training time.
Ranked #6 on Depth Estimation on NYU-Depth V2
2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang
The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.
2 code implementations • 2 Oct 2022 • Bin Xia, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Radu Timofte, Luc van Gool
In this study, we reconsider components in binary convolution, such as residual connection, BatchNorm, activation function, and structure, for IR tasks.
2 code implementations • 13 Feb 2023 • Ce Liu, Suryansh Kumar, Shuhang Gu, Radu Timofte, Luc van Gool
While state-of-the-art deep neural network methods for SIDP learn the scene depth from images in a supervised setting, they often overlook the invaluable invariances and priors in the rigid scene space, such as the regularity of the scene.
Ranked #18 on Monocular Depth Estimation on NYU-Depth V2
2 code implementations • 12 Apr 2021 • Yawei Li, Kai Zhang, JieZhang Cao, Radu Timofte, Luc van Gool
The importance of locality mechanisms is validated in two ways: 1) A wide range of design choices (activation function, layer placement, expansion ratio) are available for incorporating locality mechanisms and all proper choices can lead to a performance gain over the baseline, and 2) The same locality mechanism is successfully applied to 4 vision transformers, which shows the generalization of the locality concept.
Ranked #623 on Image Classification on ImageNet
3 code implementations • 22 Nov 2017 • Ali Diba, Mohsen Fayyaz, Vivek Sharma, Amir Hossein Karami, Mohammad Mahdi Arzani, Rahman Yousefzadeh, Luc van Gool
Thus, by finetuning this network, we beat the performance of generic and recent methods in 3D CNNs, which were trained on large video datasets, e. g. Sports-1M, and finetuned on the target datasets, e. g. HMDB51/UCF101.
1 code implementation • CVPR 2023 • JieZhang Cao, Qin Wang, Yongqin Xian, Yawei Li, Bingbing Ni, Zhiming Pi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc van Gool
We explicitly design an implicit attention network to learn the ensemble weights for the nearby local features.
1 code implementation • CVPR 2022 • Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc van Gool, Bernt Schiele, Federico Tombari, Fisher Yu
Adapting to a continuously evolving environment is a safety-critical challenge inevitably faced by all autonomous driving systems.
2 code implementations • 29 Nov 2017 • Miriam Bellver, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Xavier Giro-i-Nieto, Jordi Torres, Luc van Gool
A fully automatic technique for segmenting the liver and localizing its unhealthy tissues is a convenient tool in order to diagnose hepatic diseases and assess the response to the according treatments.
1 code implementation • 5 Jan 2019 • Dengxin Dai, Christos Sakaridis, Simon Hecker, Luc van Gool
The method is based on the fact that the results of semantic segmentation in moderately adverse conditions (light fog) can be bootstrapped to solve the same problem in highly adverse conditions (dense fog).
Ranked #5 on Domain Adaptation on Cityscapes-to-FoggyDriving
1 code implementation • ECCV 2020 • Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, Luc van Gool
How to make a segmentation model efficiently adapt to a specific video and to online target appearance variations are fundamentally crucial issues in the field of video object segmentation.
1 code implementation • 25 May 2022 • Ge-Peng Ji, Deng-Ping Fan, Yu-Cheng Chou, Dengxin Dai, Alexander Liniger, Luc van Gool
This paper introduces DGNet, a novel deep framework that exploits object gradient supervision for camouflaged object detection (COD).
2 code implementations • 15 Sep 2023 • Tianfu Wang, Menelaos Kanakis, Konrad Schindler, Luc van Gool, Anton Obukhov
Diffusion-based text-to-image models ignited immense attention from the vision community, artists, and content creators.
2 code implementations • CVPR 2021 • Valentin Wolf, Andreas Lugmayr, Martin Danelljan, Luc van Gool, Radu Timofte
We propose DeFlow, a method for learning stochastic image degradations from unpaired data.
1 code implementation • CVPR 2016 • Alex Locher, Michal Perdoch, Luc van Gool
This work proposes a progressive patch based multi-view stereo algorithm able to deliver a dense point cloud at any time.
1 code implementation • 13 May 2018 • Dinesh Acharya, Zhiwu Huang, Danda Paudel, Luc van Gool
In this work, we explore the benefits of using a man- ifold network structure for covariance pooling to improve facial expression recognition.
Facial Expression Recognition Facial Expression Recognition (FER)
1 code implementation • CVPR 2020 • Despoina Paschalidou, Luc van Gool, Andreas Geiger
Humans perceive the 3D world as a set of distinct objects that are characterized by various low-level (geometry, reflectance) and high-level (connectivity, adjacency, symmetry) properties.
6 code implementations • 27 Jan 2022 • Zudi Lin, Prateek Garg, Atmadeep Banerjee, Salma Abdel Magid, Deqing Sun, Yulun Zhang, Luc van Gool, Donglai Wei, Hanspeter Pfister
Image super-resolution (SR) is a fast-moving field with novel architectures attracting the spotlight.
1 code implementation • CVPR 2023 • Bin Xia, Jingwen He, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Luc van Gool
In SSL, we design pruning schemes for several key components in VSR models, including residual blocks, recurrent networks, and upsampling networks.
1 code implementation • NeurIPS 2023 • Zhejun Zhang, Alexander Liniger, Christos Sakaridis, Fisher Yu, Luc van Gool
The real-world deployment of an autonomous driving system requires its components to run on-board and in real-time, including the motion prediction module that predicts the future trajectories of surrounding traffic participants.
1 code implementation • 30 Sep 2020 • Juan-Ting Lin, Dengxin Dai, Luc van Gool
We give a comprehensive study of the fusion between RGB images and Radar measurements from different aspects and proposed a working solution based on the observations.
1 code implementation • 13 Jun 2022 • Wouter Van Gansbeke, Simon Vandenhende, Luc van Gool
This paper presents MaskDistill: a novel framework for unsupervised semantic segmentation based on three key ideas.
Ranked #4 on Unsupervised Semantic Segmentation on PASCAL VOC 2012 val (using extra training data)
3 code implementations • 6 Jun 2021 • Yun Liu, Yu-Huan Wu, Guolei Sun, Le Zhang, Ajad Chhatkuli, Luc van Gool
This paper tackles the high computational/space complexity associated with Multi-Head Self-Attention (MHSA) in vanilla vision transformers.
1 code implementation • 5 Jul 2022 • Jialun Pei, Tianyang Cheng, Deng-Ping Fan, He Tang, Chuanbo Chen, Luc van Gool
We present OSFormer, the first one-stage transformer framework for camouflaged instance segmentation (CIS).
1 code implementation • CVPR 2019 • Yawei Li, Vagia Tsiminaki, Radu Timofte, Marc Pollefeys, Luc van Gool
Experimental results demonstrate that our proposed networks successfully incorporate the 3D geometric information and super-resolve the texture maps.
2 code implementations • 31 Mar 2020 • Hao Tang, Xiaojuan Qi, Guolei Sun, Dan Xu, Nicu Sebe, Radu Timofte, Luc van Gool
We propose a novel ECGAN for the challenging semantic image synthesis task.
1 code implementation • 22 Jul 2023 • Hao Tang, Guolei Sun, Nicu Sebe, Luc van Gool
To tackle 2), we design an effective module to selectively highlight class-dependent feature maps according to the original semantic layout to preserve the semantic information.
1 code implementation • 14 Jul 2022 • David Bruggemann, Christos Sakaridis, Prune Truong, Luc van Gool
Due to the scarcity of dense pixel-level semantic annotations for images recorded in adverse visual conditions, there has been a keen interest in unsupervised domain adaptation (UDA) for the semantic segmentation of such images.
Ranked #1 on Semantic Segmentation on Dark Zurich
1 code implementation • CVPR 2021 • Tianfei Zhou, Wenguan Wang, Si Liu, Yi Yang, Luc van Gool
To address the challenging task of instance-aware human part parsing, a new bottom-up regime is proposed to learn category-level human semantic segmentation as well as multi-person pose estimation in a joint and end-to-end manner.
1 code implementation • ICCV 2017 • Wilfried Hartmann, Silvano Galliani, Michal Havlena, Luc van Gool, Konrad Schindler
Estimating a depth map from multiple views of a scene is a fundamental task in computer vision.
1 code implementation • CVPR 2018 • Despoina Paschalidou, Ali Osman Ulusoy, Carolin Schmitt, Luc van Gool, Andreas Geiger
RayNet integrates a CNN that learns view-invariant feature representations with an MRF that explicitly encodes the physics of perspective projection and occlusion.
1 code implementation • CVPR 2022 • Guolei Sun, Yun Liu, Henghui Ding, Min Wu, Luc van Gool
Specifically, we uniformly sample certain frames from the video and extract global contextual prototypes by k-means.
1 code implementation • 27 Nov 2023 • Lukas Hoyer, David Joseph Tan, Muhammad Ferjad Naeem, Luc van Gool, Federico Tombari
In SemiVL, we propose to integrate rich priors from VLM pre-training into semi-supervised semantic segmentation to learn better semantic decision boundaries.
Ranked #1 on Semi-Supervised Semantic Segmentation on PASCAL VOC 2012 732 labeled (using extra training data)
1 code implementation • 5 Dec 2020 • Yigit Baran Can, Alexander Liniger, Ozan Unal, Danda Paudel, Luc van Gool
In this work, we study scene understanding in the form of online estimation of semantic BEV maps using the video input from a single onboard camera.
1 code implementation • 29 Jun 2022 • Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Hao Tang, Gordon Wetzstein, Leonidas Guibas, Luc van Gool, Radu Timofte
Generative models have emerged as an essential building block for many image synthesis and editing tasks.
1 code implementation • ECCV 2020 • Ali Diba, Mohsen Fayyaz, Vivek Sharma, Manohar Paluri, Jurgen Gall, Rainer Stiefelhagen, Luc van Gool
HVU is organized hierarchically in a semantic taxonomy that focuses on multi-label and multi-task video understanding as a comprehensive problem that encompasses the recognition of multiple semantic aspects in the dynamic scene.
Ranked #11 on Action Recognition on UCF101
1 code implementation • ICCV 2021 • Qin Wang, Dengxin Dai, Lukas Hoyer, Luc van Gool, Olga Fink
However, such a supervision is not always available.
Ranked #15 on Domain Adaptation on SYNTHIA-to-Cityscapes (using extra training data)
1 code implementation • 21 Jul 2022 • JieZhang Cao, Jingyun Liang, Kai Zhang, Wenguan Wang, Qin Wang, Yulun Zhang, Hao Tang, Luc van Gool
These issues can be alleviated by a cascade of three separate sub-tasks, including video deblurring, frame interpolation, and super-resolution, which, however, would fail to capture the spatial and temporal correlations among video sequences.
1 code implementation • 7 Mar 2022 • Menelaos Kanakis, Simon Maurer, Matteo Spallanzani, Ajad Chhatkuli, Luc van Gool
Efficient detection and description of geometric regions in images is a prerequisite in visual systems for localization and mapping.
1 code implementation • 4 Jan 2024 • Muhammad Uzair Khattak, Muhammad Ferjad Naeem, Muzammal Naseer, Luc van Gool, Federico Tombari
While effective, most of these works require labeled data which is not practical, and often struggle to generalize towards new datasets due to over-fitting on the source data.
1 code implementation • CVPR 2019 • Rui Gong, Wen Li, Yu-Hua Chen, Luc van Gool
In this work, we present a domain flow generation(DLOW) model to bridge two different domains by generating a continuous sequence of intermediate domains flowing from one domain to the other.
1 code implementation • CVPR 2022 • Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc van Gool
We represent the road topology using a set of directed lane curves and their interactions, which are captured using their intersection points.
1 code implementation • 1 May 2017 • Arun Balajee Vasudevan, Michael Gygli, Anna Volokitin, Luc van Gool
Although the problem of automatic video summarization has recently received a lot of attention, the problem of creating a video summary that also highlights elements relevant to a search query has been less studied.
1 code implementation • 21 Apr 2023 • Deng-Ping Fan, Ge-Peng Ji, Peng Xu, Ming-Ming Cheng, Christos Sakaridis, Luc van Gool
Concealed scene understanding (CSU) is a hot computer vision topic aiming to perceive objects exhibiting camouflage.
1 code implementation • 11 Aug 2021 • Davide Menini, Suryansh Kumar, Martin R. Oswald, Erik Sandstrom, Cristian Sminchisescu, Luc van Gool
This paper presents a real-time online vision framework to jointly recover an indoor scene's 3D structure and semantic label.
1 code implementation • 10 Dec 2022 • Bowen Yin, Xuying Zhang, Qibin Hou, Bo-Yuan Sun, Deng-Ping Fan, Luc van Gool
How to identify and segment camouflaged objects from the background is challenging.
1 code implementation • ICCV 2019 • Qin Wang, Wen Li, Luc van Gool
We reveal that an essential sampling bias exists in semi-supervised learning due to the limited number of labeled samples, which often leads to a considerable empirical distribution mismatch between labeled data and unlabeled data.
1 code implementation • 3 Jan 2017 • Naoya Takahashi, Michael Gygli, Luc van Gool
Instead, combining visual features with our AENet features, which can be computed efficiently on a GPU, leads to significant performance improvements on action recognition and video highlight detection.
3 code implementations • ICCV 2019 • Yawei Li, Shuhang Gu, Luc van Gool, Radu Timofte
Convolutional neural networks (CNNs) based solutions have achieved state-of-the-art performances for many computer vision tasks, including classification and super-resolution of images.
2 code implementations • CVPR 2020 • Yawei Li, Shuhang Gu, Christoph Mayer, Luc van Gool, Radu Timofte
In this paper, we analyze two popular network compression techniques, i. e. filter pruning and low-rank decomposition, in a unified sense.
1 code implementation • IJCNLP 2019 • Thierry Deruyttere, Simon Vandenhende, Dusan Grujicic, Luc van Gool, Marie-Francine Moens
Or more specifically, we consider the problem in an autonomous driving setting, where a passenger requests an action that can be associated with an object found in a street scene.
1 code implementation • ECCV 2020 • Evangelos Ntavelis, Andrés Romero, Iason Kastanis, Luc van Gool, Radu Timofte
In contrast to previous methods that employ a discriminator that trivially concatenates semantics and image as an input, the SESAME discriminator is composed of two input streams that independently process the image and its semantics, using the latter to manipulate the results of the former.
1 code implementation • CVPR 2020 • Fabian Mentzer, Luc van Gool, Michael Tschannen
We leverage the powerful lossy image compression algorithm BPG to build a lossless image compression system.
2 code implementations • ECCV 2020 • Yawei Li, Shuhang Gu, Kai Zhang, Luc van Gool, Radu Timofte
Passing the sparsified latent vectors through the hypernetworks, the corresponding slices of the generated weight parameters can be removed, achieving the effect of network pruning.
1 code implementation • 4 Apr 2024 • Rui Li, Tobias Fischer, Mattia Segu, Marc Pollefeys, Luc van Gool, Federico Tombari
We propose KYN, a novel method for single-view scene reconstruction that reasons about semantic and spatial context to predict each point's density.
1 code implementation • 8 Jun 2017 • Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc van Gool
In generative modeling, the Wasserstein distance (WD) has emerged as a useful metric to measure the discrepancy between generated and real data distributions.
1 code implementation • 4 Oct 2018 • Dinesh Acharya, Zhiwu Huang, Danda Pani Paudel, Luc van Gool
Furthermore, we introduce a sliced version of Wasserstein GAN (SWGAN) loss to improve the distribution learning on the video data of high-dimension and mixed-spatiotemporal distribution.
1 code implementation • CVPR 2019 • Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc van Gool
In generative modeling, the Wasserstein distance (WD) has emerged as a useful metric to measure the discrepancy between generated and real data distributions.
Ranked #1 on Video Generation on TrailerFaces
1 code implementation • 30 Sep 2022 • Hui Wei, Hao Tang, Xuemei Jia, Zhixiang Wang, Hanxun Yu, Zhubo Li, Shin'ichi Satoh, Luc van Gool, Zheng Wang
Building upon this foundation, we uncover the pervasive role of artifacts carrying adversarial perturbations in the physical world.
1 code implementation • CVPR 2023 • Guolei Sun, Zhaochong An, Yun Liu, Ce Liu, Christos Sakaridis, Deng-Ping Fan, Luc van Gool
We further advance the frontier of this field by systematically studying a new challenge named indiscernible object counting (IOC), the goal of which is to count objects that are blended with respect to their surroundings.
2 code implementations • 7 Mar 2023 • Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc van Gool
We present TrafficBots, a multi-agent policy built upon motion prediction and end-to-end driving, and based on TrafficBots we obtain a world model tailored for the planning module of autonomous vehicles.
1 code implementation • 20 Nov 2023 • JieZhang Cao, Yue Shi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc van Gool
Due to the inherent property of diffusion models, most existing methods need long serial sampling chains to restore HQ images step-by-step, resulting in expensive sampling time and high computation costs.
2 code implementations • 9 Oct 2019 • Martin Hahner, Dengxin Dai, Christos Sakaridis, Jan-Nico Zaech, Luc van Gool
This work addresses the problem of semantic scene understanding under foggy road conditions.
1 code implementation • CVPR 2021 • Mohamad Shahbazi, Zhiwu Huang, Danda Pani Paudel, Ajad Chhatkuli, Luc van Gool
To address this problem, we introduce a new GAN transfer method to explicitly propagate the knowledge from the old classes to the new classes.
3 code implementations • 31 Dec 2021 • Deng-Ping Fan, Ziling Huang, Peng Zheng, Hong Liu, Xuebin Qin, Luc van Gool
Besides, we elaborate comprehensive experiments on the existing 19 cutting-edge models.
1 code implementation • ICCV 2021 • David Bruggemann, Menelaos Kanakis, Anton Obukhov, Stamatios Georgoulis, Luc van Gool
Our goal is to find the most efficient way to refine each task prediction by capturing cross-task contexts dependent on tasks' relations.
Ranked #78 on Semantic Segmentation on NYU Depth v2
1 code implementation • ICLR 2022 • Mohamad Shahbazi, Martin Danelljan, Danda Pani Paudel, Luc van Gool
On the contrary, we observe that class-conditioning causes mode collapse in limited data settings, where unconditional learning leads to satisfactory generative ability.
1 code implementation • CVPR 2022 • Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc van Gool
Positional encodings have enabled recent works to train a single adversarial network that can generate images of different scales.
1 code implementation • ICCV 2023 • David Bruggemann, Christos Sakaridis, Tim Brödermann, Luc van Gool
We investigate normal-to-adverse condition model adaptation for semantic segmentation, whereby image-level correspondences are available in the target domain.
Ranked #1 on Source-Free Domain Adaptation on Cityscapes to ACDC
1 code implementation • 28 Aug 2018 • Carles Ventura, Jordi Pont-Tuset, Sergi Caelles, Kevis-Kokitsi Maninis, Luc van Gool
This paper tackles the task of estimating the topology of road networks from aerial images.
2 code implementations • 28 Apr 2023 • Zhuyun Zhou, Zongwei Wu, Danda Pani Paudel, Rémi Boutteau, Fan Yang, Luc van Gool, Radu Timofte, Dominique Ginhac
Subsequently, we devise EmoFormer, a novel network able to exploit the event data.
1 code implementation • 27 Nov 2023 • Zongwei Wu, Jilai Zheng, Xiangxuan Ren, Florin-Alexandru Vasluianu, Chao Ma, Danda Pani Paudel, Luc van Gool, Radu Timofte
In practice, most existing RGB trackers learn a single set of parameters to use them across datasets and applications.
1 code implementation • 10 Dec 2018 • Andrés Romero, Pablo Arbeláez, Luc van Gool, Radu Timofte
This problem is highly challenging due to three main reasons: (i) unpaired datasets, (ii) multiple attributes, and (iii) the multimodality (e. g., style) associated with the translation.
1 code implementation • CVPR 2021 • Suman Saha, Anton Obukhov, Danda Pani Paudel, Menelaos Kanakis, Yuhua Chen, Stamatios Georgoulis, Luc van Gool
Specifically, we show that: (1) our approach improves performance on all tasks when they are complementary and mutually dependent; (2) the CTRL helps to improve both semantic segmentation and depth estimation tasks performance in the challenging UDA setting; (3) the proposed ISL training scheme further improves the semantic segmentation performance.
1 code implementation • CVPR 2022 • Zipeng Xu, Tianwei Lin, Hao Tang, Fu Li, Dongliang He, Nicu Sebe, Radu Timofte, Luc van Gool, Errui Ding
We propose a novel framework, i. e., Predict, Prevent, and Evaluate (PPE), for disentangled text-driven image manipulation that requires little manual annotation while being applicable to a wide variety of manipulations.
1 code implementation • 28 Nov 2023 • Qi Ma, Danda Pani Paudel, Ajad Chhatkuli, Luc van Gool
In this paper, we showcase the effectiveness of optimizing monocular camera poses as a continuous function of time.
1 code implementation • 15 Jun 2016 • Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc van Gool
In this paper, a new method for generating object and action proposals in images and videos is proposed.
1 code implementation • ICCV 2015 • Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc van Gool
We generate hypotheses in a sliding-window fashion over different activation layers and show that the final convolutional layers can find the object of interest with high recall but poor localization due to the coarseness of the feature maps.
1 code implementation • 11 Jan 2022 • Niclas Vödisch, Ozan Unal, Ke Li, Luc van Gool, Dengxin Dai
In this work, we take a new route to learn to optimize the LiDAR beam configuration for a given application.
1 code implementation • 20 Nov 2023 • Nikola Popovic, Dimitrios Christodoulou, Danda Pani Paudel, Xi Wang, Luc van Gool
In this work, we propose to predict 3D eye gaze from weak supervision of eye semantic segmentation masks and direct supervision of a few 3D gaze vectors.
1 code implementation • 5 Oct 2020 • Andrés Romero, Luc van Gool, Radu Timofte
Additionally, our method is capable of adding, removing or changing either fine-grained or coarse attributes by using an image as a reference or by exploring the style distribution space, and it can be easily extended to head-swapping and face-reenactment applications without being trained on videos.
1 code implementation • 19 Nov 2021 • Guanglei Yang, Hao Tang, Humphrey Shi, Mingli Ding, Nicu Sebe, Radu Timofte, Luc van Gool, Elisa Ricci
The global alignment network aims to transfer the input image from the source domain to the target domain.
1 code implementation • 27 Oct 2022 • Ge-Peng Ji, Mingcheng Zhuge, Dehong Gao, Deng-Ping Fan, Christos Sakaridis, Luc van Gool
We present a masked vision-language transformer (MVLT) for fashion-specific multi-modal representation.
1 code implementation • ECCV 2020 • Menelaos Kanakis, David Bruggemann, Suman Saha, Stamatios Georgoulis, Anton Obukhov, Luc van Gool
First, enabling the model to be inherently incremental, continuously incorporating information from new tasks without forgetting the previously learned ones (incremental learning).
1 code implementation • 11 May 2022 • Chuqiao Li, Zhiwu Huang, Danda Pani Paudel, Yabin Wang, Mohamad Shahbazi, Xiaopeng Hong, Luc van Gool
Within the proposed benchmark, we explore some commonly known essentials of standard continual learning.