1 code implementation • ECCV 2020 • Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan, Luc van Gool
This way, in an effort to fix localization errors, our loss provides an extra supervisory signal that helps the model to better discriminate between similar classes.
no code implementations • ECCV 2020 • Frank Verbiest, Marc Proesmans, Luc van Gool
Instead of using a generalized camera approach, we propose a novel approach to jointly optimize a traditional camera model, and a mathematical representation of the windshield’s surface.
1 code implementation • 23 Mar 2023 • Guofeng Mei, Hao Tang, Xiaoshui Huang, Weijie Wang, Juan Liu, Jian Zhang, Luc van Gool, Qiang Wu
Deep point cloud registration methods face challenges to partial overlaps and rely on labeled data.
no code implementations • 22 Mar 2023 • Mohamad Shahbazi, Evangelos Ntavelis, Alessio Tonioni, Edo Collins, Danda Pani Paudel, Martin Danelljan, Luc van Gool
Pose-conditioned convolutional generative models struggle with high-quality 3D-consistent image generation from single-view datasets, due to their lack of sufficient 3D priors.
no code implementations • 21 Mar 2023 • Kamil Adamczewski, Christos Sakaridis, Vaishakh Patil, Luc van Gool
Lidar is a vital sensor for estimating the depth of a scene.
no code implementations • 16 Mar 2023 • Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, Luc van Gool
Since the iterations are few, our DiffIR can adopt a joint optimization of CPEN$_{S2}$, DIRformer, and denoising network, which can further reduce the estimation error influence.
no code implementations • 15 Mar 2023 • Zixiang Zhao, Jiangshe Zhang, Xiang Gu, Chengli Tan, Shuang Xu, Yulun Zhang, Radu Timofte, Luc van Gool
Then, the extracted features are mapped to the spherical space to complete the separation of private features and the alignment of shared features.
no code implementations • 14 Mar 2023 • Hao Tang, Zhenyu Zhang, Humphrey Shi, Bo Li, Ling Shao, Nicu Sebe, Radu Timofte, Luc van Gool
We present a novel graph Transformer generative adversarial network (GTGAN) to learn effective graph node relations in an end-to-end fashion for the challenging graph-constrained house generation task.
no code implementations • 13 Mar 2023 • Zixiang Zhao, Haowen Bai, Yuanzhi Zhu, Jiangshe Zhang, Shuang Xu, Yulun Zhang, Kai Zhang, Deyu Meng, Radu Timofte, Luc van Gool
To leverage strong generative priors and address challenges such as unstable training and lack of interpretability for GAN-based generative methods, we propose a novel fusion algorithm based on the denoising diffusion probabilistic model (DDPM).
no code implementations • 9 Mar 2023 • David Bruggemann, Christos Sakaridis, Tim Brödermann, Luc van Gool
We investigate normal-to-adverse condition model adaptation for semantic segmentation, whereby image-level correspondences are available in the target domain.
1 code implementation • 7 Mar 2023 • Nick Bührer, Zhejun Zhang, Alexander Liniger, Fisher Yu, Luc van Gool
To this end, we propose a safe model-free RL algorithm with a novel multiplicative value function consisting of a safety critic and a reward critic.
1 code implementation • 7 Mar 2023 • Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc van Gool
We present TrafficBots, a multi-agent policy built upon motion prediction and end-to-end driving, and based on TrafficBots we obtain a world model tailored for the planning module of autonomous vehicles.
1 code implementation • 1 Mar 2023 • Yawei Li, Yuchen Fan, Xiaoyu Xiang, Denis Demandolx, Rakesh Ranjan, Radu Timofte, Luc van Gool
The aim of this paper is to propose a mechanism to efficiently and explicitly model image hierarchies in the global, regional, and local range for image restoration.
1 code implementation • 13 Feb 2023 • Ce Liu, Suryansh Kumar, Shuhang Gu, Radu Timofte, Luc van Gool
While state-of-the-art deep neural network methods for SIDP learn the scene depth from images in a supervised setting, they often overlook the invaluable invariances and priors in the rigid scene space, such as the regularity of the scene.
no code implementations • 22 Jan 2023 • Razvan-George Pasca, Alexey Gavryushin, Yen-Ling Kuo, Luc van Gool, Otmar Hilliges, Xi Wang
This action context together with the next video frame is processed by the multimodal fusion module to forecast the next object interaction.
no code implementations • 12 Jan 2023 • Lei Sun, Christos Sakaridis, Jingyun Liang, Peng Sun, JieZhang Cao, Kai Zhang, Qi Jiang, Kaiwei Wang, Luc van Gool
The performance of video frame interpolation is inherently correlated with the ability to handle motion in the input scene.
no code implementations • 22 Dec 2022 • Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc van Gool, Alina Kuznetsova
TaMOs achieves a 4x faster run-time in case of 10 concurrent objects compared to tracking each object independently and outperforms existing single object trackers on our new benchmark.
no code implementations • 14 Dec 2022 • Rui Gong, Qin Wang, Dengxin Dai, Luc van Gool
Thus, we aim to relieve this need on a large number of real data, and explore the one-shot unsupervised sim-to-real domain adaptation (OSUDA) and generalization (OSDG) problem, where only one real-world data sample is available.
1 code implementation • 10 Dec 2022 • Bowen Yin, Xuying Zhang, Qibin Hou, Bo-Yuan Sun, Deng-Ping Fan, Luc van Gool
How to identify and segment camouflaged objects from the background is challenging.
no code implementations • 10 Dec 2022 • Zongwei Wu, Danda Pani Paudel, Deng-Ping Fan, Jingjing Wang, Shuo Wang, Cédric Demonceaux, Radu Timofte, Luc van Gool
In this work, we adapt such depth inference models for object segmentation using the objects' ``pop-out'' prior in 3D.
no code implementations • 8 Dec 2022 • JieZhang Cao, Qin Wang, Yongqin Xian, Yawei Li, Bingbing Ni, Zhiming Pi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc van Gool
The effectiveness of the method is also demonstrated on the real-world SR setting.
no code implementations • 5 Dec 2022 • Muhammad Ferjad Naeem, Muhammad Gul Zain Ali Khan, Yongqin Xian, Muhammad Zeshan Afzal, Didier Stricker, Luc van Gool, Federico Tombari
Our proposed model, I2MVFormer, learns multi-view semantic embeddings for zero-shot image classification with these class views.
1 code implementation • 2 Dec 2022 • Lukas Hoyer, Dengxin Dai, Haoran Wang, Luc van Gool
MIC significantly improves the state-of-the-art performance across the different recognition tasks for synthetic-to-real, day-to-nighttime, and clear-to-adverse-weather UDA.
no code implementations • 2 Dec 2022 • Nikola Popovic, Danda Pani Paudel, Luc van Gool
Such representations are known to benefit from additional geometric and semantic supervision.
1 code implementation • 30 Nov 2022 • Bin Xia, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Radu Timofte, Luc van Gool
It consists of a knowledge distillation based implicit degradation estimator network (KD-IDE) and an efficient SR network.
no code implementations • 26 Nov 2022 • Zixiang Zhao, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Shuang Xu, Zudi Lin, Radu Timofte, Luc van Gool
In the second stage, the LT-based global fusion and INN-based local fusion layers output the fused image.
no code implementations • 22 Nov 2022 • Shengqu Cai, Eric Ryan Chan, Songyou Peng, Mohamad Shahbazi, Anton Obukhov, Luc van Gool, Gordon Wetzstein
Scene extrapolation -- the idea of generating novel views by flying into a given image -- is a promising, yet challenging task.
Ranked #1 on
Perpetual View Generation
on LHQ
no code implementations • 14 Nov 2022 • Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc van Gool
On the one hand, the proposed method learns to segment these planar hulls from the labeled data.
1 code implementation • 13 Nov 2022 • Ren Yang, Radu Timofte, Luc van Gool
In this paper, we propose an Advanced Learned Video Compression (ALVC) approach with the in-loop frame prediction module, which is able to effectively predict the target frame from the previously compressed frames, without consuming any bit-rate.
no code implementations • 8 Nov 2022 • Andrey Ignatov, Anastasia Sycheva, Radu Timofte, Yu Tseng, Yu-Syuan Xu, Po-Hsiang Yu, Cheng-Ming Chiang, Hsien-Kai Kuo, Min-Hung Chen, Chia-Ming Cheng, Luc van Gool
While neural networks-based photo processing solutions can provide a better image quality compared to the traditional ISP systems, their application to mobile devices is still very limited due to their very high computational complexity.
1 code implementation • 8 Nov 2022 • Andrey Ignatov, Grigory Malivenko, Radu Timofte, Yu Tseng, Yu-Syuan Xu, Po-Hsiang Yu, Cheng-Ming Chiang, Hsien-Kai Kuo, Min-Hung Chen, Chia-Ming Cheng, Luc van Gool
The increased importance of mobile photography created a need for fast and performant RAW image processing pipelines capable of producing good visual results in spite of the mobile camera sensor limitations.
1 code implementation • 30 Oct 2022 • Hanqing Wang, Wei Liang, Luc van Gool, Wenguan Wang
With the emergence of varied visual navigation tasks (e. g, image-/object-/audio-goal and vision-language navigation) that specify the target in different ways, the community has made appealing advances in training specialized agents capable of handling individual navigation tasks well.
no code implementations • 28 Oct 2022 • Nicola Marinello, Marc Proesmans, Luc van Gool
We start from an off-the-shelf 3D object detector, and apply a tracking mechanism where objects are matched by an affinity score computed on local object feature embeddings and motion descriptors.
1 code implementation • 27 Oct 2022 • Ge-Peng Ji, Mingcheng Zhuge, Dehong Gao, Deng-Ping Fan, Christos Sakaridis, Luc van Gool
We present a masked vision-language transformer (MVLT) for fashion-specific multi-modal representation.
no code implementations • 20 Oct 2022 • Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc van Gool, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal
CAPE learns to identify this structure and propagates knowledge between them to learn class embedding for all seen and unseen compositions.
no code implementations • 14 Oct 2022 • Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc van Gool
The proposed approach in this paper exploits the benefit of uncertainty modeling in a deep neural network for a reliable fusion of photometric stereo (PS) and multi-view stereo (MVS) network predictions.
no code implementations • 13 Oct 2022 • Menelaos Kanakis, Thomas E. Huang, David Bruggemann, Fisher Yu, Luc van Gool
In this paper, we find that jointly training a dense prediction (target) task with a self-supervised (auxiliary) task can consistently improve the performance of the target task, while eliminating the need for labeling auxiliary tasks.
Ranked #64 on
Semantic Segmentation
on NYU Depth v2
1 code implementation • 10 Oct 2022 • Yitong Xia, Hao Tang, Radu Timofte, Luc van Gool
NeRFmm is the Neural Radiance Fields (NeRF) that deal with Joint Optimization tasks, i. e., reconstructing real-world scenes and registering camera parameters simultaneously.
no code implementations • 9 Oct 2022 • Nishant Jain, Suryansh Kumar, Luc van Gool
Although recently proposed Mip-NeRF could handle multi-scale imaging problems with NeRF, it cannot handle camera pose estimation error.
1 code implementation • 2 Oct 2022 • Bin Xia, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Radu Timofte, Luc van Gool
In this study, we reconsider components in binary convolution, such as residual connection, BatchNorm, activation function, and structure, for IR tasks.
1 code implementation • 30 Sep 2022 • Anton Obukhov, Mikhail Usvyatsov, Christos Sakaridis, Konrad Schindler, Luc van Gool
Learning neural fields has been an active topic in deep learning research, focusing, among other issues, on finding more compact and easy-to-fit representations.
1 code implementation • 28 Sep 2022 • Yifan Lu, Gurkirt Singh, Suman Saha, Luc van Gool
We propose a novel domain adaptive action detection approach and a new adaptation protocol that leverages the recent advancements in image-level unsupervised domain adaptation (UDA) techniques and handle vagaries of instance-level video data.
no code implementations • 21 Sep 2022 • Muhammad Ferjad Naeem, Yongqin Xian, Luc van Gool, Federico Tombari
In order to distill discriminative visual words from noisy documents, we introduce a new cross-modal attention module that learns fine-grained interactions between image patches and document words.
1 code implementation • 6 Sep 2022 • Gurkirt Singh, Vasileios Choutas, Suman Saha, Fisher Yu, Luc van Gool
Current methods for spatiotemporal action tube detection often extend a bounding box proposal at a given keyframe into a 3D temporal cuboid and pool features from nearby frames.
no code implementations • 25 Aug 2022 • JieZhang Cao, Qin Wang, Jingyun Liang, Yulun Zhang, Kai Zhang, Radu Timofte, Luc van Gool
To this end, we propose a new multi-scale refined optical flow-guided video denoising method, which is more robust to different noise levels.
Ranked #1 on
Video Denoising
on VideoLQ
no code implementations • 18 Aug 2022 • Janis Postels, Martin Danelljan, Luc van Gool, Federico Tombari
In contrast to prior work, we approach this problem by generating samples from the original data distribution given full knowledge about the perturbed distribution and the noise model.
1 code implementation • 14 Aug 2022 • Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Akshay Dudhane, Martin Danelljan, Hisham Cholakkal, Salman Khan, Luc van Gool, Fahad Shahbaz Khan
While being greatly benefiting to the tracking research, existing benchmarks do not pose the same difficulty as before with recent trackers achieving higher performance mainly due to (i) the introduction of more sophisticated transformers-based methods and (ii) the lack of diverse scenarios with adverse visibility such as, severe weather conditions, camouflage and imaging effects.
1 code implementation • 25 Jul 2022 • JieZhang Cao, Jingyun Liang, Kai Zhang, Yawei Li, Yulun Zhang, Wenguan Wang, Luc van Gool
Reference-based image super-resolution (RefSR) aims to exploit auxiliary reference (Ref) images to super-resolve low-resolution (LR) images.
1 code implementation • 21 Jul 2022 • JieZhang Cao, Jingyun Liang, Kai Zhang, Wenguan Wang, Qin Wang, Yulun Zhang, Hao Tang, Luc van Gool
These issues can be alleviated by a cascade of three separate sub-tasks, including video deblurring, frame interpolation, and super-resolution, which, however, would fail to capture the spatial and temporal correlations among video sequences.
1 code implementation • 21 Jul 2022 • Guolei Sun, Yun Liu, Hao Tang, Ajad Chhatkuli, Le Zhang, Luc van Gool
The essence of video semantic segmentation (VSS) is how to leverage temporal information for prediction.
1 code implementation • 14 Jul 2022 • David Bruggemann, Christos Sakaridis, Prune Truong, Luc van Gool
Due to the scarcity of dense pixel-level semantic annotations for images recorded in adverse visual conditions, there has been a keen interest in unsupervised domain adaptation (UDA) for the semantic segmentation of such images.
Ranked #1 on
Semantic Segmentation
on Nighttime Driving
no code implementations • 13 Jul 2022 • Suryansh Kumar, Luc van Gool
Besides that, the paper provides insights into the NRSfM factorization -- both in terms of shape and motion -- and is the first approach to show the benefit of single rotation averaging for NRSfM.
1 code implementation • 5 Jul 2022 • Jialun Pei, Tianyang Cheng, Deng-Ping Fan, He Tang, Chuanbo Chen, Luc van Gool
We present OSFormer, the first one-stage transformer framework for camouflaged instance segmentation (CIS).
1 code implementation • 3 Jul 2022 • Kevin Ta, David Bruggemann, Tim Brödermann, Christos Sakaridis, Luc van Gool
As neuromorphic technology is maturing, its application to robotics and autonomous vehicle systems has become an area of active research.
1 code implementation • 30 Jun 2022 • Tim Broedermann, Christos Sakaridis, Dengxin Dai, Luc van Gool
Besides standard cameras, autonomous vehicles typically include multiple additional sensors, such as lidars and radars, which help acquire richer information for perceiving the content of the driving scene.
Ranked #1 on
2D object detection
on Clear Weather
1 code implementation • 29 Jun 2022 • Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Hao Tang, Gordon Wetzstein, Leonidas Guibas, Luc van Gool, Radu Timofte
Generative models have emerged as an essential building block for many image synthesis and editing tasks.
no code implementations • CVPR 2022 • Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc van Gool, Bernt Schiele, Federico Tombari, Fisher Yu
Adapting to a continuously evolving environment is a safety-critical challenge inevitably faced by all autonomous driving systems.
no code implementations • 15 Jun 2022 • Bin Xia, Jingwen He, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Luc van Gool
In SSL, we design pruning schemes for several key components in VSR models, including residual blocks, recurrent networks, and upsampling networks.
no code implementations • 13 Jun 2022 • Wouter Van Gansbeke, Simon Vandenhende, Luc van Gool
This paper presents MaskDistill: a novel framework for unsupervised semantic segmentation based on three key ideas.
Ranked #2 on
Unsupervised Semantic Segmentation
on PASCAL VOC 2012 val
(using extra training data)
1 code implementation • 5 Jun 2022 • Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, JieZhang Cao, Kai Zhang, Radu Timofte, Luc van Gool
Specifically, RVRT divides the video into multiple clips and uses the previously inferred clip feature to estimate the subsequent clip feature.
Ranked #1 on
Deblurring
on DVD
no code implementations • 3 Jun 2022 • Nikola Popovic, Danda Pani Paudel, Thomas Probst, Luc van Gool
It has since become a trend to use these five characteristics as a sufficient test, to determine whether or not gradient obfuscation is the main source of robustness.
1 code implementation • 30 May 2022 • Peng Zheng, Huazhu Fu, Deng-Ping Fan, Qi Fan, Jie Qin, Yu-Wing Tai, Chi-Keung Tang, Luc van Gool
In this paper, we present a novel end-to-end group collaborative learning network, termed GCoNet+, which can effectively and efficiently (250 fps) identify co-salient objects in natural scenes.
Ranked #1 on
Co-Salient Object Detection
on CoSal2015
1 code implementation • 25 May 2022 • Ge-Peng Ji, Deng-Ping Fan, Yu-Cheng Chou, Dengxin Dai, Alexander Liniger, Luc van Gool
This paper introduces DGNet, a novel deep framework that exploits object gradient supervision for camouflaged object detection (COD).
1 code implementation • 20 May 2022 • Jing Lin, Xiaowan Hu, Yuanhao Cai, Haoqian Wang, Youliang Yan, Xueyi Zou, Yulun Zhang, Luc van Gool
On the other hand, we equip the sequence-to-sequence model with an unsupervised optical flow estimator to maximize its potential.
Ranked #2 on
Video Enhancement
on MFQE v2
1 code implementation • 20 May 2022 • Yuanhao Cai, Jing Lin, Haoqian Wang, Xin Yuan, Henghui Ding, Yulun Zhang, Radu Timofte, Luc van Gool
In coded aperture snapshot spectral compressive imaging (CASSI) systems, hyperspectral image (HSI) reconstruction methods are employed to recover the spatial-spectral signal from a compressed measurement.
1 code implementation • 11 May 2022 • Chuqiao Li, Zhiwu Huang, Danda Pani Paudel, Yabin Wang, Mohamad Shahbazi, Xiaopeng Hong, Luc van Gool
Within the proposed benchmark, we explore some commonly known essentials of standard continual learning.
1 code implementation • CVPR 2022 • Yawei Li, Kamil Adamczewski, Wen Li, Shuhang Gu, Radu Timofte, Luc van Gool
The proposed approach provides a new way to compare different methods, namely how well they behave compared with random pruning.
2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang
The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.
1 code implementation • 27 Apr 2022 • Lukas Hoyer, Dengxin Dai, Luc van Gool
Therefore, we propose HRDA, a multi-resolution training approach for UDA, that combines the strengths of small high-resolution crops to preserve fine segmentation details and large low-resolution crops to capture long-range context dependencies with a learned scale attention, while maintaining a manageable GPU memory footprint.
1 code implementation • 17 Apr 2022 • Yuanhao Cai, Jing Lin, Zudi Lin, Haoqian Wang, Yulun Zhang, Hanspeter Pfister, Radu Timofte, Luc van Gool
Existing leading methods for spectral reconstruction (SR) focus on designing deeper or wider convolutional neural networks (CNNs) to learn the end-to-end mapping from the RGB image to its hyperspectral image (HSI).
1 code implementation • 13 Apr 2022 • Edoardo Mello Rella, Ajad Chhatkuli, Ender Konukoglu, Luc van Gool
Implicit fields have been very effective to represent and learn 3D shapes accurately.
1 code implementation • CVPR 2022 • Guolei Sun, Yun Liu, Henghui Ding, Thomas Probst, Luc van Gool
To address this problem, we propose a Coarse-to-Fine Feature Mining (CFFM) technique to learn a unified presentation of static contexts and motional contexts.
1 code implementation • 7 Apr 2022 • Erik Sandström, Martin R. Oswald, Suryansh Kumar, Silvan Weder, Fisher Yu, Cristian Sminchisescu, Luc van Gool
Multi-sensor depth fusion is able to substantially improve the robustness and accuracy of 3D reconstruction methods, but existing techniques are not robust enough to handle sensors which operate with diverse value ranges as well as noise and outlier statistics.
no code implementations • 5 Apr 2022 • Jose L. Vazquez, Alexander Liniger, Wilko Schwarting, Daniela Rus, Luc van Gool
Fundamental to the success of our method is the design of a novel multi-agent policy network that can steer a vehicle given the state of the surrounding agents and the map information.
1 code implementation • CVPR 2022 • Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Radu Timofte, Martin Danelljan, Luc van Gool
Positional encodings have enabled recent works to train a single adversarial network that can generate images of different scales.
1 code implementation • CVPR 2022 • Vaishakh Patil, Christos Sakaridis, Alexander Liniger, Luc van Gool
We focus on the supervised setup, in which ground-truth depth is available only at training time.
Ranked #4 on
Depth Estimation
on NYU-Depth V2
no code implementations • 4 Apr 2022 • Liqian Ma, Stamatios Georgoulis, Xu Jia, Luc van Gool
The ability to make educated predictions about their surroundings, and associate them with certain confidence, is important for intelligent systems, like autonomous vehicles and robots.
no code implementations • 4 Apr 2022 • Liqian Ma, Lingjie Liu, Christian Theobalt, Luc van Gool
In addition, DDP is computationally more efficient than previous dense pose estimation methods, and it reduces jitters when applied to a video sequence, which is a problem plaguing the previous methods.
1 code implementation • CVPR 2022 • Hanqing Wang, Wei Liang, Jianbing Shen, Luc van Gool, Wenguan Wang
Since the rise of vision-language navigation (VLN), great progress has been made in instruction following -- building a follower to navigate environments under the guidance of instructions.
1 code implementation • CVPR 2022 • Martin Hahner, Christos Sakaridis, Mario Bijelic, Felix Heide, Fisher Yu, Dengxin Dai, Luc van Gool
Due to the difficulty of collecting and annotating training data in this setting, we propose a physically based method to simulate the effect of snowfall on real clear-weather LiDAR point clouds.
Ranked #1 on
3D Object Detection
on Heavy Snowfall
1 code implementation • CVPR 2022 • Tianfei Zhou, Wenguan Wang, Ender Konukoglu, Luc van Gool
Prevalent semantic segmentation solutions, despite their different network designs (FCN based or attention based) and mask decoding strategies (parametric softmax based or pixel-query based), can be placed in one category, by considering the softmax weights or query vectors as learnable class prototypes.
4 code implementations • 27 Mar 2022 • Ge-Peng Ji, Guobao Xiao, Yu-Cheng Chou, Deng-Ping Fan, Kai Zhao, Geng Chen, Luc van Gool
We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era.
Ranked #1 on
Video Polyp Segmentation
on SUN-SEG-Easy (Unseen)
1 code implementation • CVPR 2022 • Qin Wang, Olga Fink, Luc van Gool, Dengxin Dai
However, real-world machine perception systems are running in non-stationary and continually changing environments where the target domain distribution can change over time.
no code implementations • 25 Mar 2022 • Ritika Chakraborty, Nikola Popovic, Danda Pani Paudel, Thomas Probst, Luc van Gool
However, multi-conditional image generation is a very challenging problem due to the heterogeneity and the sparsity of the (in practice) available conditioning labels.
1 code implementation • 24 Mar 2022 • Kai Zhang, Yawei Li, Jingyun Liang, JieZhang Cao, Yulun Zhang, Hao Tang, Radu Timofte, Luc van Gool
While recent years have witnessed a dramatic upsurge of exploiting deep neural networks toward solving image denoising, existing methods mostly rely on simple noise assumptions, such as additive white Gaussian noise (AWGN), JPEG compression noise and camera sensor noise, and a general-purpose blind denoising method for real images remains unsolved.
1 code implementation • 21 Mar 2022 • Matthieu Paul, Martin Danelljan, Christoph Mayer, Luc van Gool
We infer a bounding box from the segmentation mask, validate our tracker on challenging tracking datasets and achieve the new state of the art on LaSOT with a success AUC score of 69. 7%.
1 code implementation • CVPR 2022 • Christoph Mayer, Martin Danelljan, Goutam Bhat, Matthieu Paul, Danda Pani Paudel, Fisher Yu, Luc van Gool
Optimization based tracking methods have been widely successful by integrating a target model prediction module, providing effective global reasoning by minimizing an objective function.
Ranked #1 on
Visual Object Tracking
on LaSOT
(IS metric)
no code implementations • 20 Mar 2022 • Ardhendu Shekhar Tripathi, Martin Danelljan, Samarth Shukla, Radu Timofte, Luc van Gool
We propose a trainable Image Signal Processing (ISP) framework that produces DSLR quality images given RAW images captured by a smartphone.
1 code implementation • ICLR 2022 • Edoardo Mello Rella, Ajad Chhatkuli, Yun Liu, Ender Konukoglu, Luc van Gool
One of the key problems in boundary detection is the label representation, which typically leads to class imbalance and, as a consequence, to thick boundaries that require non-differential post-processing steps to be thinned.
1 code implementation • CVPR 2022 • Ozan Unal, Dengxin Dai, Luc van Gool
Densely annotating LiDAR point clouds remains too expensive and time-consuming to keep up with the ever growing volume of data.
Ranked #1 on
3D Semantic Segmentation
on ScribbleKITTI
no code implementations • 13 Mar 2022 • Feiyu Wang, Qin Wang, Wen Li, Dong Xu, Luc van Gool
Benefited from this new perspective, we first propose a new deep semi-supervised learning framework called Semi-supervised Learning by Empirical Distribution Alignment (SLEDA), in which existing technologies from the domain adaptation community can be readily used to address the semi-supervised learning problem through reducing the empirical distribution distance between labeled and unlabeled data.
1 code implementation • 9 Mar 2022 • Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc van Gool
Many algorithms have been developed to solve the inverse problem of coded aperture snapshot spectral imaging (CASSI), i. e., recovering the 3D hyperspectral images (HSIs) from a 2D compressive measurement.
1 code implementation • CVPR 2022 • Prune Truong, Martin Danelljan, Fisher Yu, Luc van Gool
We propose Probabilistic Warp Consistency, a weakly-supervised learning objective for semantic matching.
no code implementations • 7 Mar 2022 • Menelaos Kanakis, Simon Maurer, Matteo Spallanzani, Ajad Chhatkuli, Luc van Gool
Efficient detection and description of geometric regions in images is a prerequisite in visual systems for localization and mapping.
no code implementations • 7 Mar 2022 • Abhishek Jha, Badri N. Patro, Luc van Gool, Tinne Tuytelaars
In this paper, we propose a novel regularization for VQA models, Constrained Optimization using Barlow's theory (COB), that improves the information content of the joint space by minimizing the redundancy.
2 code implementations • CVPR 2022 • Xiaowan Hu, Yuanhao Cai, Jing Lin, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc van Gool
On the one hand, the proposed HR spatial-spectral attention module with its efficient feature fusion provides continuous and fine pixel-level features.
2 code implementations • 26 Feb 2022 • Shengqu Cai, Anton Obukhov, Dengxin Dai, Luc van Gool
We propose a pipeline to generate Neural Radiance Fields~(NeRF) of an object or a scene of a specific class, conditioned on a single input image.
no code implementations • CVPR 2022 • Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc van Gool
At each pixel, our approach either selects or discards deep-PS and deep-MVS network prediction depending on the prediction uncertainty measure.
no code implementations • CVPR 2022 • Jan-Nico Zaech, Alexander Liniger, Martin Danelljan, Dengxin Dai, Luc van Gool
Multi-Object Tracking (MOT) is most often approached in the tracking-by-detection paradigm, where object detections are associated through time.
no code implementations • 3 Feb 2022 • Dario Fuoli, Martin Danelljan, Radu Timofte, Luc van Gool
Our DAP aligns and integrates information from the recurrent state into the current frame prediction.
1 code implementation • 28 Jan 2022 • Jingyun Liang, JieZhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc van Gool
Besides, parallel warping is used to further fuse information from neighboring frames by parallel feature warping.
Ranked #1 on
Deblurring
on BASED
2 code implementations • 27 Jan 2022 • Zudi Lin, Prateek Garg, Atmadeep Banerjee, Salma Abdel Magid, Deqing Sun, Yulun Zhang, Luc van Gool, Donglai Wei, Hanspeter Pfister
Image super-resolution (SR) is a fast-moving field with novel architectures attracting the spotlight.
2 code implementations • CVPR 2022 • Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc van Gool
In this work, we propose RePaint: A Denoising Diffusion Probabilistic Model (DDPM) based inpainting approach that is applicable to even extreme masks.
1 code implementation • ICLR 2022 • Mohamad Shahbazi, Martin Danelljan, Danda Pani Paudel, Luc van Gool
On the contrary, we observe that class-conditioning causes mode collapse in limited data settings, where unconditional learning leads to satisfactory generative ability.
1 code implementation • 11 Jan 2022 • Niclas Vödisch, Ozan Unal, Ke Li, Luc van Gool, Dengxin Dai
In this work, we take a new route to learn to optimize the LiDAR beam configuration for a given application.
1 code implementation • 6 Jan 2022 • Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc van Gool
Exploiting similar and sharper scene patches in spatio-temporal neighborhoods is critical for video deblurring.
Ranked #1 on
Deblurring
on DVD
no code implementations • CVPR 2022 • Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool
Specifically, for this study, we investigate binaural sounds and image data in isolation.
1 code implementation • CVPR 2022 • Shengqu Cai, Anton Obukhov, Dengxin Dai, Luc van Gool
We propose a pipeline to generate Neural Radiance Fields (NeRF) of an object or a scene of a specific class, conditioned on a single input image.
3 code implementations • 31 Dec 2021 • Deng-Ping Fan, Ziling Huang, Peng Zheng, Hong Liu, Xuebin Qin, Luc van Gool
Besides, we elaborate comprehensive experiments on the existing 19 cutting-edge models.
no code implementations • 30 Dec 2021 • Nikola Popovic, Danda Pani Paudel, Thomas Probst, Luc van Gool
We use linear layers with token-consistent stochastic parameters inside the multilayer perceptron blocks, without altering the architecture of the transformer.
no code implementations • 19 Dec 2021 • Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc van Gool
We use a Transformer-based architecture to detect the keypoints, as well as to summarize the visual context of the image.
1 code implementation • CVPR 2022 • Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc van Gool
We represent the road topology using a set of directed lane curves and their interactions, which are captured using their intersection points.
2 code implementations • 17 Dec 2021 • Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc van Gool
E. T. Track, our visual tracker that incorporates Exemplar Transformer modules, runs at 47 FPS on a CPU.
no code implementations • 8 Dec 2021 • Yannick Strümpler, Janis Postels, Ren Yang, Luc van Gool, Federico Tombari
Recently Implicit Neural Representations (INRs) gained attention as a novel and effective representation for various data types.
1 code implementation • 30 Nov 2021 • Lei Sun, Christos Sakaridis, Jingyun Liang, Qi Jiang, Kailun Yang, Peng Sun, Yaozu Ye, Kaiwei Wang, Luc van Gool
Traditional frame-based cameras inevitably suffer from motion blur due to long exposure times.
Ranked #1 on
Deblurring
on GoPro
(using extra training data)
no code implementations • 29 Nov 2021 • Muhammad Ferjad Naeem, Evin Pınar Örnek, Yongqin Xian, Luc van Gool, Federico Tombari
Parts represent a basic unit of geometric and semantic similarity across different objects.
3 code implementations • CVPR 2022 • Lukas Hoyer, Dengxin Dai, Luc van Gool
It improves the state of the art by 10. 8 mIoU for GTA-to-Cityscapes and 5. 4 mIoU for Synthia-to-Cityscapes and enables learning even difficult classes such as train, bus, and truck well.
Ranked #5 on
Domain Adaptation
on Cityscapes to ACDC
1 code implementation • CVPR 2022 • Zipeng Xu, Tianwei Lin, Hao Tang, Fu Li, Dongliang He, Nicu Sebe, Radu Timofte, Luc van Gool, Errui Ding
We propose a novel framework, i. e., Predict, Prevent, and Evaluate (PPE), for disentangled text-driven image manipulation that requires little manual annotation while being applicable to a wide variety of manipulations.
1 code implementation • CVPR 2022 • Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc van Gool
Estimating 3D human poses from monocular videos is a challenging task due to depth ambiguity and self-occlusion.
Ranked #8 on
3D Human Pose Estimation
on MPI-INF-3DHP
1 code implementation • 19 Nov 2021 • Guanglei Yang, Hao Tang, Humphrey Shi, Mingli Ding, Nicu Sebe, Radu Timofte, Luc van Gool, Elisa Ricci
The global alignment network aims to transfer the input image from the source domain to the target domain.
2 code implementations • CVPR 2022 • Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc van Gool
The HSI representations are highly similar and correlated across the spectral dimension.
no code implementations • 5 Nov 2021 • Andreas Lugmayr, Martin Danelljan, Fisher Yu, Luc van Gool, Radu Timofte
Super-resolution is an ill-posed problem, where a ground-truth high-resolution image represents only one possibility in the space of plausible solutions.
no code implementations • 11 Oct 2021 • Berk Kaya, Suryansh Kumar, Francesco Sarno, Vittorio Ferrari, Luc van Gool
Our method performs neural rendering of multi-view images while utilizing surface normals estimated by a deep photometric stereo network.
no code implementations • 11 Oct 2021 • Francesco Sarno, Suryansh Kumar, Berk Kaya, Zhiwu Huang, Vittorio Ferrari, Luc van Gool
We then perform a continuous relaxation of this search space and present a gradient-based optimization strategy to find an efficient light calibration and normal estimation network.
1 code implementation • ICCV 2021 • Yigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc van Gool
In this work, we study the problem of extracting a directed graph representing the local road network in BEV coordinates, from a single onboard camera image.
no code implementations • 1 Oct 2021 • Jonas Heylen, Mark De Wolf, Bruno Dawagne, Marc Proesmans, Luc van Gool, Wim Abbeloos, Hazem Abdelkawy, Daniel Olmeda Reino
We surpass camera independent methods on the challenging KITTI3D benchmark and show the key benefits compared to camera dependent methods.
1 code implementation • 28 Sep 2021 • Prune Truong, Martin Danelljan, Radu Timofte, Luc van Gool
In order to apply dense methods to real-world applications, such as pose estimation, image manipulation, or 3D reconstruction, it is therefore crucial to estimate the confidence of the predicted matches.
no code implementations • 16 Sep 2021 • Yu-Hui Huang, Marc Proesmans, Luc van Gool
Zero padding is widely used in convolutional neural networks to prevent the size of feature maps diminishing too fast.
1 code implementation • 10 Sep 2021 • Rui Gong, Martin Danelljan, Dengxin Dai, Danda Pani Paudel, Ajad Chhatkuli, Fisher Yu, Luc van Gool
In many real-world settings, the target domain task requires a different taxonomy than the one imposed by the source domain.
3 code implementations • 7 Sep 2021 • Ren Yang, Radu Timofte, Luc van Gool
This paper proposes a Perceptual Learned Video Compression (PLVC) approach with recurrent conditional GAN.
no code implementations • 6 Sep 2021 • Dengxin Dai, Arun Balajee Vasudevan, Jiri Matas, Luc van Gool
Humans can robustly recognize and localize objects by using visual and/or auditory cues.
1 code implementation • 28 Aug 2021 • Lukas Hoyer, Dengxin Dai, Qin Wang, Yuhua Chen, Luc van Gool
Training deep networks for semantic segmentation requires large amounts of labeled training data, which presents a major challenge in practice, as labeling segmentation masks is a highly labor-intensive process.
1 code implementation • 25 Aug 2021 • Angela Castillo, María Escobar, Juan C. Pérez, Andrés Romero, Radu Timofte, Luc van Gool, Pablo Arbeláez
Instead of learning a dataset-specific degradation, we employ adversarial attacks to create difficult examples that target the model's weaknesses.
9 code implementations • 23 Aug 2021 • Jingyun Liang, JieZhang Cao, Guolei Sun, Kai Zhang, Luc van Gool, Radu Timofte
In particular, the deep feature extraction module is composed of several residual Swin Transformer blocks (RSTB), each of which has several Swin Transformer layers together with a residual connection.
Ranked #2 on
Color Image Denoising
on urban100 sigma15
2 code implementations • ICCV 2021 • Goutam Bhat, Martin Danelljan, Fisher Yu, Luc van Gool, Radu Timofte
The deep reparametrization allows us to directly model the image formation process in the latent space, and to integrate learned image priors into the prediction.
Ranked #4 on
Burst Image Super-Resolution
on BurstSR
2 code implementations • ICCV 2021 • Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc van Gool
Our end-to-end agent achieves a 78% success rate while generalizing to a new town and new weather on the NoCrash-dense benchmark and state-of-the-art performance on the challenging public routes of the CARLA LeaderBoard.
no code implementations • 12 Aug 2021 • Edoardo Mello Rella, Jan-Nico Zaech, Alexander Liniger, Luc van Gool
Forecasting the future behavior of all traffic agents in the vicinity is a key task to achieve safe and reliable autonomous driving systems.
1 code implementation • ICCV 2021 • Martin Hahner, Christos Sakaridis, Dengxin Dai, Luc van Gool
2) Through extensive experiments with several state-of-the-art detection approaches, we show that our fog simulation can be leveraged to significantly improve the performance for 3D object detection in the presence of fog.
Ranked #1 on
3D Object Detection
on Dense Fog
1 code implementation • ICCV 2021 • Jingyun Liang, Guolei Sun, Kai Zhang, Luc van Gool, Radu Timofte
Extensive experiments on synthetic and real images show that the proposed MANet not only performs favorably for both spatially variant and invariant kernel estimation, but also leads to state-of-the-art blind SR performance when combined with non-blind SR methods.
1 code implementation • 11 Aug 2021 • Davide Menini, Suryansh Kumar, Martin R. Oswald, Erik Sandstrom, Cristian Sminchisescu, Luc van Gool
This paper presents a real-time online vision framework to jointly recover an indoor scene's 3D structure and semantic label.
1 code implementation • ICCV 2021 • Jingyun Liang, Andreas Lugmayr, Kai Zhang, Martin Danelljan, Luc van Gool, Radu Timofte
More specifically, HCFlow learns a bijective mapping between HR and LR image pairs by modelling the distribution of the LR image and the rest high-frequency component simultaneously.
no code implementations • 4 Aug 2021 • Guolei Sun, Yun Liu, Jingyun Liang, Luc van Gool
Due to the fact that fully supervised semantic segmentation methods require sufficient fully-labeled data to work well and can not generalize to unseen classes, few-shot segmentation has attracted lots of research attention.
1 code implementation • 2 Jul 2021 • Tianfei Zhou, Fatih Porikli, David Crandall, Luc van Gool, Wenguan Wang
Video segmentation -- partitioning video frames into multiple segments or objects -- plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing.
1 code implementation • 1 Jul 2021 • Janis Postels, Mattia Segu, Tao Sun, Luca Sieber, Luc van Gool, Fisher Yu, Federico Tombari
We find that, while DUMs scale to realistic vision tasks and perform well on OOD detection, the practicality of current methods is undermined by poor calibration under distributional shifts.
no code implementations • CVPR 2021 • Stefano d'Apolito, Danda Pani Paudel, Zhiwu Huang, Andres Romero, Luc van Gool
On the other hand, learning from inexpensive and intuitive basic categorical emotion labels leads to limited emotion variability.
1 code implementation • 12 Jun 2021 • JieZhang Cao, Yawei Li, Kai Zhang, Jingyun Liang, Luc van Gool
Specifically, to tackle the first issue, we present a spatial-temporal convolutional self-attention layer with a theoretical understanding to exploit the locality information.
2 code implementations • NeurIPS 2021 • Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc van Gool
Contrastive self-supervised learning has outperformed supervised pretraining on many downstream tasks like segmentation and object detection.
no code implementations • CVPR 2022 • Rhea Sanjay Sukthanker, Zhiwu Huang, Suryansh Kumar, Radu Timofte, Luc van Gool
The key idea is to exploit a masked scheme of these two attentions to learn long-range data dependencies in the context of generative flows.
no code implementations • 6 Jun 2021 • Janis Postels, Mengya Liu, Riccardo Spezialetti, Luc van Gool, Federico Tombari
Recently normalizing flows (NFs) have demonstrated state-of-the-art performance on modeling 3D point clouds while allowing sampling with arbitrary resolution at inference time.
2 code implementations • 6 Jun 2021 • Yun Liu, Yu-Huan Wu, Guolei Sun, Le Zhang, Ajad Chhatkuli, Luc van Gool
This paper tackles the low-efficiency flaw of the vision transformer caused by the high computational/space complexity in Multi-Head Self-Attention (MHSA).
no code implementations • ICCV 2021 • Dario Fuoli, Luc van Gool, Radu Timofte
As large models are often not practical in real-world applications, we investigate and propose novel loss functions, to enable SR with high perceptual quality from much more efficient models.
no code implementations • 23 May 2021 • Guolei Sun, Yun Liu, Thomas Probst, Danda Pani Paudel, Nikola Popovic, Luc van Gool
This indicates that global scene context is essential, despite the seemingly bottom-up nature of the problem.
no code implementations • 18 May 2021 • Ankush Panwar, Pratyush Singh, Suman Saha, Danda Pani Paudel, Luc van Gool
The proposed method successfully adapts to the compound target domain consisting multiple new spoof types.
1 code implementation • CVPR 2021 • Suman Saha, Anton Obukhov, Danda Pani Paudel, Menelaos Kanakis, Yuhua Chen, Stamatios Georgoulis, Luc van Gool
Specifically, we show that: (1) our approach improves performance on all tasks when they are complementary and mutually dependent; (2) the CTRL helps to improve both semantic segmentation and depth estimation tasks performance in the challenging UDA setting; (3) the proposed ISL training scheme further improves the semantic segmentation performance.