Search Results for author: Luc van Gool

Found 540 papers, 271 papers with code

Seeking the Strongest Rigid Detector

no code implementations CVPR 2013 Rodrigo Benenson, Markus Mathias, Tinne Tuytelaars, Luc van Gool

The current state of the art solutions for object detection describe each class by a set of models trained on discovered sub-classes (so called "components"), with each model itself composed of collections of interrelated parts (deformable models).

feature selection object-detection +1

Bayesian Grammar Learning for Inverse Procedural Modeling

no code implementations CVPR 2013 Andelo Martinovic, Luc van Gool

Given a set of labeled positive examples, we induce a grammar which can be sampled to create novel instances of the same building style.

Fast Energy Minimization Using Learned State Filters

no code implementations CVPR 2013 Matthieu Guillaumin, Luc van Gool, Vittorio Ferrari

However, when the graph is fully connected and the pairwise potentials are arbitrary, the complexity of even approximate minimization algorithms such as TRW-S grows quadratically both in the number of nodes and in the number of states a node can take.

Human Pose Estimation Using Body Parts Dependent Joint Regressors

no code implementations CVPR 2013 Matthias Dantone, Juergen Gall, Christian Leistner, Luc van Gool

The second layer takes the estimated class distributions of the first one into account and is thereby able to predict joint locations by modeling the interdependence and co-occurrence of the parts.

Pose Estimation

Query Adaptive Similarity for Large Scale Object Retrieval

no code implementations CVPR 2013 Danfeng Qin, Christian Wengert, Luc van Gool

Furthermore, we propose a function to score the individual contributions into an image to image similarity within the probabilistic framework.

Object Retrieval

Random Binary Mappings for Kernel Learning and Efficient SVM

no code implementations19 Jul 2013 Gemma Roig, Xavier Boix, Luc van Gool

SVMs suffer from various drawbacks in terms of selecting the right kernel, which depends on the image descriptors, as well as computational and memory efficiency.

Attribute Quantization

SEEDS: Superpixels Extracted via Energy-Driven Sampling

1 code implementation16 Sep 2013 Michael Van den Bergh, Xavier Boix, Gemma Roig, Luc van Gool

We define a robust and fast to evaluate energy function, based on enforcing color similarity between the bound- aries and the superpixel color histogram.

Superpixels

The Synthesizability of Texture Examples

no code implementations CVPR 2014 Dengxin Dai, Hayko Riemenschneider, Luc van Gool

This work is the first attempt to quantify this image property, and we find that texture synthesizability can be learned and predicted.

Texture Synthesis

Latent Dictionary Learning for Sparse Representation based Classification

no code implementations CVPR 2014 Meng Yang, Dengxin Dai, Lilin Shen, Luc van Gool

Each dictionary atom is jointly learned with a latent vector, which associates this atom to the representation of different classes.

Classification Dictionary Learning +4

Object Classification with Adaptable Regions

no code implementations CVPR 2014 Hakan Bilen, Marco Pedersoli, Vinay P. Namboodiri, Tinne Tuytelaars, Luc van Gool

In classification of objects substantial work has gone into improving the low level representation of an image by considering various aspects such as different features, a number of feature pooling and coding techniques and considering different kernels.

Classification General Classification +1

Fast, Approximate Piecewise-Planar Modeling Based on Sparse Structure-from-Motion and Superpixels

no code implementations CVPR 2014 Andras Bodis-Szomoru, Hayko Riemenschneider, Luc van Gool

State-of-the-art Multi-View Stereo (MVS) algorithms deliver dense depth maps or complex meshes with very high detail, and redundancy over regular surfaces.

Superpixels

Ground Plane Estimation using a Hidden Markov Model

no code implementations CVPR 2014 Ralf Dragon, Luc van Gool

We focus on the problem of estimating the ground plane orientation and location in monocular video sequences from a moving observer.

Using a Deformation Field Model for Localizing Faces and Facial Points under Weak Supervision

no code implementations CVPR 2014 Marco Pedersoli, Tinne Tuytelaars, Luc van Gool

Additionally, without any facial point annotation at the level of individual training images, our method can localize facial points with an accuracy similar to fully supervised approaches.

Face Detection

Comment on "Ensemble Projection for Semi-supervised Image Classification"

no code implementations29 Aug 2014 Xavier Boix, Gemma Roig, Luc van Gool

In a series of papers by Dai and colleagues [1, 2], a feature map (or kernel) was introduced for semi- and unsupervised learning.

Classification General Classification +1

From Categories to Subcategories: Large-Scale Image Classification With Partial Class Label Refinement

no code implementations CVPR 2015 Marko Ristin, Juergen Gall, Matthieu Guillaumin, Luc van Gool

Compared to approaches that disregard the extra coarse labeled data, we achieve a relative improvement in subcategory classification accuracy of up to 22% in our large-scale image classification experiments.

Classification General Classification +1

Joint Vanishing Point Extraction and Tracking

no code implementations CVPR 2015 Till Kroeger, Dengxin Dai, Luc van Gool

Although the method is designed for unknown camera poses, it is also helpful in scenarios with known poses, since a multi-frame approach in VP detection helps to regularize in frames with weak VP line support.

Metric Imitation by Manifold Transfer for Efficient Vision Applications

no code implementations CVPR 2015 Dengxin Dai, Till Kroeger, Radu Timofte, Luc van Gool

In particular, MI consists of: 1) quantifying the properties of source metrics as manifold geometry, 2) transferring the manifold from source domain to target domain, and 3) learning a mapping of TFs so that the manifold is approximated as well as possible in the mapped feature domain.

Clustering Image Clustering +5

Oracle MCG: A first peek into COCO Detection Challenges

no code implementations14 Aug 2015 Jordi Pont-Tuset, Pablo Arbeláez, Luc van Gool

The recently presented COCO detection challenge will most probably be the reference benchmark in object detection in the next years.

Object object-detection +1

Is Image Super-resolution Helpful for Other Vision Tasks?

no code implementations23 Sep 2015 Dengxin Dai, Yujian Wang, Yuhua Chen, Luc van Gool

In this paper, we present the first comprehensive study and analysis of the usefulness of ISR for other vision applications.

Edge Detection Image Segmentation +4

DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers

1 code implementation ICCV 2015 Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc van Gool

We generate hypotheses in a sliding-window fashion over different activation layers and show that the final convolutional layers can find the object of interest with high recall but poor localization due to the coarseness of the feature maps.

Object

Some like it hot - visual guidance for preference prediction

no code implementations CVPR 2016 Rasmus Rothe, Radu Timofte, Luc van Gool

Our computational pipeline comprises a face detector, convolutional neural networks for the extraction of deep features, standard support vector regression for gender, age and facial beauty, and - as the main novelties - visual regularized collaborative filtering to infer inter-person preferences as well as a novel regression technique for handling visual queries without rating history.

Collaborative Filtering regression

Seven ways to improve example-based single image super resolution

no code implementations CVPR 2016 Radu Timofte, Rasmus Rothe, Luc van Gool

In this paper we present seven techniques that everybody should know to improve example-based single image super resolution (SR): 1) augmentation of data, 2) use of large dictionaries with efficient search structures, 3) cascading, 4) image self-similarities, 5) back projection refinement, 6) enhanced prediction by consistency check, and 7) context reasoning.

Image Super-Resolution

Boosting Object Proposals: From Pascal to COCO

no code implementations ICCV 2015 Jordi Pont-Tuset, Luc van Gool

Computer vision in general, and object proposals in particular, are nowadays strongly influenced by the databases on which researchers evaluate the performance of their algorithms.

Object

A Gaussian Process Latent Variable Model for BRDF Inference

no code implementations ICCV 2015 Stamatios Georgoulis, Vincent Vanweddingen, Marc Proesmans, Luc van Gool

Although inferring higher dimensional BRDFs from such modest training is not a trivial problem, our method performs better than state-of-the-art parametric, semi-parametric and non-parametric approaches.

Unsupervised High-level Feature Learning by Ensemble Projection for Semi-supervised Image Classification and Image Clustering

no code implementations2 Feb 2016 Dengxin Dai, Luc van Gool

Hence, in the spirit of ensemble learning we create a set of such training sets which are all diverse, leading to diverse classifiers.

Classification Clustering +6

Fast Optical Flow using Dense Inverse Search

no code implementations11 Mar 2016 Till Kroeger, Radu Timofte, Dengxin Dai, Luc van Gool

Most recent works in optical flow extraction focus on the accuracy and neglect the time complexity.

Action Detection Activity Detection +1

Energy-Efficient ConvNets Through Approximate Computing

no code implementations22 Mar 2016 Bert Moons, Bert de Brabandere, Luc van Gool, Marian Verhelst

Recently ConvNets or convolutional neural networks (CNN) have come up as state-of-the-art classification and detection algorithms, achieving near-human performance in visual detection.

Classification General Classification

DeLight-Net: Decomposing Reflectance Maps into Specular Materials and Natural Illumination

no code implementations27 Mar 2016 Stamatios Georgoulis, Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Luc van Gool, Tinne Tuytelaars

In this paper we are extracting surface reflectance and natural environmental illumination from a reflectance map, i. e. from a single 2D image of a sphere of one material under one illumination.

Direction matters: hand pose estimation from local surface normals

no code implementations10 Apr 2016 Chengde Wan, Angela Yao, Luc van Gool

We present a hierarchical regression framework for estimating hand joint positions from single depth images based on local surface normals.

Hand Pose Estimation regression

Actionness Estimation Using Hybrid Fully Convolutional Networks

no code implementations CVPR 2016 Limin Wang, Yu Qiao, Xiaoou Tang, Luc van Gool

Actionness was introduced to quantify the likelihood of containing a generic action instance at a specific location.

Action Detection Action Recognition +1

Low-Cost Scene Modeling using a Density Function Improves Segmentation Performance

no code implementations26 May 2016 Vivek Sharma, Sule Yildirim-Yayilgan, Luc van Gool

We propose a low cost and effective way to combine a free simulation software and free CAD models for modeling human-object interaction in order to improve human & object segmentation.

Human-Object Interaction Detection Object +2

k2-means for fast and accurate large scale clustering

no code implementations30 May 2016 Eirikur Agustsson, Radu Timofte, Luc van Gool

k^2-means builds upon the standard k-means (Lloyd's algorithm) and combines a new strategy to accelerate the convergence with a new low time complexity divisive initialization.

Clustering

Dynamic Filter Networks

1 code implementation NeurIPS 2016 Bert De Brabandere, Xu Jia, Tinne Tuytelaars, Luc van Gool

In a traditional convolutional layer, the learned filters stay fixed after training.

 Ranked #1 on Video Prediction on KTH (Cond metric)

Depth Estimation Optical Flow Estimation +1

A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation

1 code implementation CVPR 2016 Federico Perazzi, Jordi Pont-Tuset, Brian McWilliams, Luc van Gool, Markus Gross, Alexander Sorkine-Hornung

The dataset, named DAVIS (Densely Annotated VIdeo Segmentation), consists of fifty high quality, Full HD video sequences, spanning multiple occurrences of common video object segmentation challenges such as occlusions, motion-blur and appearance changes.

Segmentation Semantic Segmentation +3

Progressive Prioritized Multi-View Stereo

1 code implementation CVPR 2016 Alex Locher, Michal Perdoch, Luc van Gool

This work proposes a progressive patch based multi-view stereo algorithm able to deliver a dense point cloud at any time.

Scale-Aware Alignment of Hierarchical Image Segmentation

1 code implementation CVPR 2016 Yuhua Chen, Dengxin Dai, Jordi Pont-Tuset, Luc van Gool

To demonstrate the power of our method, we perform comprehensive experiments, which show that our method, as a post-processing step, can significantly improve the quality of the hierarchical segmentation representations, and ease the usage of hierarchical image segmentation to high-level vision tasks such as object segmentation.

Image Segmentation Segmentation +1

Fast Algorithms for Linear and Kernel SVM+

no code implementations CVPR 2016 Wen Li, Dengxin Dai, Mingkui Tan, Dong Xu, Luc van Gool

The SVM+ approach has shown excellent performance in visual recognition tasks for exploiting privileged information in the training data.

DeepProposals: Hunting Objects and Actions by Cascading Deep Convolutional Layers

1 code implementation15 Jun 2016 Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc van Gool

In this paper, a new method for generating object and action proposals in images and videos is proposed.

Object

Generic 3D Convolutional Fusion for image restoration

no code implementations26 Jul 2016 Jiqing Wu, Radu Timofte, Luc van Gool

Unlike other methods adapted to different tasks, our method uses the exact same convolutional network architecture to address both image denois- ing and single image super-resolution.

Image Denoising Image Restoration +1

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016

1 code implementation2 Aug 2016 Yuanjun Xiong, Li-Min Wang, Zhe Wang, Bo-Wen Zhang, Hang Song, Wei Li, Dahua Lin, Yu Qiao, Luc van Gool, Xiaoou Tang

This paper presents the method that underlies our submission to the untrimmed video classification task of ActivityNet Challenge 2016.

General Classification Video Classification

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

19 code implementations2 Aug 2016 Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc van Gool

The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network.

Action Classification Action Recognition In Videos +2

Convolutional Oriented Boundaries

1 code implementation9 Aug 2016 Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Pablo Arbeláez, Luc van Gool

We present Convolutional Oriented Boundaries (COB), which produces multiscale oriented contours and region hierarchies starting from generic image classification Convolutional Neural Networks (CNNs).

Contour Detection General Classification +2

Deep Expectation of Real and Apparent Age from a Single Image Without Facial Landmarks

no code implementations IJCV 2016 Rasmus Rothe, Radu Timofte, Luc van Gool

In this paper we propose a deep learning solution to age estimation from a single face image without the use of facial landmarks and introduce the IMDB-WIKI dataset, the largest public dataset of face images with age and gender labels.

Age Estimation Face Alignment +2

DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns

no code implementations CVPR 2016 Ali Diba, Ali Mohammad Pazandeh, Hamed Pirsiavash, Luc van Gool

On the other hand, we let an iteration of feature learning and patch clustering purify the set of dedicated patches that we use.

Attribute Clustering

A Riemannian Network for SPD Matrix Learning

no code implementations15 Aug 2016 Zhiwu Huang, Luc van Gool

Symmetric Positive Definite (SPD) matrix learning methods have become popular in many image and video processing tasks, thanks to their ability to learn appropriate statistical representations while respecting Riemannian geometry of underlying SPD manifolds.

Cross Euclidean-to-Riemannian Metric Learning with Application to Face Recognition from Video

no code implementations15 Aug 2016 Zhiwu Huang, Ruiping Wang, Shiguang Shan, Luc van Gool, Xilin Chen

With this mapping, the problem of learning a cross-view metric between the two source heterogeneous spaces can be expressed as learning a single-view Euclidean distance metric in the target common Euclidean space.

Face Recognition Metric Learning

Geometry-aware Similarity Learning on SPD Manifolds for Visual Recognition

no code implementations17 Aug 2016 Zhiwu Huang, Ruiping Wang, Xianqiu Li, Wenxian Liu, Shiguang Shan, Luc van Gool, Xilin Chen

Specifically, by exploiting the Riemannian geometry of the manifold of fixed-rank Positive Semidefinite (PSD) matrices, we present a new solution to reduce optimizing over the space of column full-rank transformation matrices to optimizing on the PSD manifold which has a well-established Riemannian structure.

Does V-NIR based Image Enhancement Come with Better Features?

no code implementations23 Aug 2016 Vivek Sharma, Luc van Gool

Image enhancement using the visible (V) and near-infrared (NIR) usually enhances useful image details.

Image Enhancement

Failure Detection for Facial Landmark Detectors

no code implementations23 Aug 2016 Andreas Steger, Radu Timofte, Luc van Gool

Most face applications depend heavily on the accuracy of the face and facial landmarks detectors employed.

Facial Landmark Detection

Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification

no code implementations31 Aug 2016 Ali Diba, Ali Mohammad Pazandeh, Luc van Gool

The video and action classification have extremely evolved by deep neural networks specially with two stream CNN using RGB and optical flow as inputs and they present outstanding performance in terms of video analysis.

3D Architecture Action Classification +4

Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images

no code implementations1 Sep 2016 Limin Wang, Zhe Wang, Yu Qiao, Luc van Gool

These newly designed transferring techniques exploit multi-task learning frameworks to incorporate extra knowledge from other networks and additional datasets into the training procedure of event CNNs.

Multi-Task Learning

Deep Retinal Image Understanding

1 code implementation5 Sep 2016 Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Pablo Arbeláez, Luc van Gool

This paper presents Deep Retinal Image Understanding (DRIU), a unified framework of retinal image analysis that provides both retinal vessel and optic disc segmentation.

General Classification Image Classification +4

Efficient Volumetric Fusion of Airborne and Street-Side Data for Urban Reconstruction

no code implementations5 Sep 2016 András Bódis-Szomorú, Hayko Riemenschneider, Luc van Gool

Airborne acquisition and on-road mobile mapping provide complementary 3D information of an urban landscape: the former acquires roof structures, ground, and vegetation at a large scale, but lacks the facade and street-side details, while the latter is incomplete for higher floors and often totally misses out on pedestrian-only areas or undriven districts.

Dilemma First Search for Effortless Optimization of NP-Hard Problems

no code implementations12 Sep 2016 Julien Weissenberg, Hayko Riemenschneider, Ralf Dragon, Luc van Gool

We evaluate DFS on two problems: First, the Knapsack problem, for which efficient algorithms exist, serves as a toy example.

One-Shot Video Object Segmentation

8 code implementations CVPR 2017 Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc van Gool

This paper tackles the task of semi-supervised video object segmentation, i. e., the separation of an object from the background in a video, given the mask of the first frame.

Foreground Segmentation Object +4

Building Deep Networks on Grassmann Manifolds

no code implementations17 Nov 2016 Zhiwu Huang, Jiqing Wu, Luc van Gool

Learning representations on Grassmann manifolds is popular in quite a few visual recognition tasks.

Deep Temporal Linear Encoding Networks

2 code implementations CVPR 2017 Ali Diba, Vivek Sharma, Luc van Gool

Advantages of TLEs are: (a) they encode the entire video into a compact feature representation, learning the semantics and a discriminative feature space; (b) they are applicable to all kinds of networks like 2D and 3D CNNs for video classification; and (c) they model feature interactions in a more expressive way and without loss of information.

Representation Learning Video Classification

Weakly Supervised Cascaded Convolutional Networks

no code implementations CVPR 2017 Ali Diba, Vivek Sharma, Ali Pazandeh, Hamed Pirsiavash, Luc van Gool

The final stage of both architectures is a part of a convolutional neural network that performs multiple instance learning on proposals extracted in the previous stage(s).

Multiple Instance Learning Object +3

A Dataset for Multimodal Question Answering in the Cultural Heritage Domain

no code implementations WS 2016 Shurong Sheng, Luc van Gool, Marie-Francine Moens

In this paper, we introduce the construction of a golden standard dataset that will aid research of multimodal question answering in the cultural heritage domain.

Question Answering Speech Recognition +1

DARN: a Deep Adversial Residual Network for Intrinsic Image Decomposition

no code implementations23 Dec 2016 Louis Lettry, Kenneth Vanhoey, Luc van Gool

We present a new deep supervised learning method for intrinsic decomposition of a single image into its albedo and shading components.

Intrinsic Image Decomposition valid

AENet: Learning Deep Audio Features for Video Analysis

1 code implementation3 Jan 2017 Naoya Takahashi, Michael Gygli, Luc van Gool

Instead, combining visual features with our AENet features, which can be computed efficiently on a GPU, leads to significant performance improvements on action recognition and video highlight detection.

Action Recognition Data Augmentation +4

Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks

2 code implementations17 Jan 2017 Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Pablo Arbeláez, Luc van Gool

We present Convolutional Oriented Boundaries (COB), which produces multiscale oriented contours and region hierarchies starting from generic image classification Convolutional Neural Networks (CNNs).

Boundary Detection Contour Detection +7

Crossing Nets: Combining GANs and VAEs with a Shared Latent Space for Hand Pose Estimation

no code implementations CVPR 2017 Chengde Wan, Thomas Probst, Luc van Gool, Angela Yao

Regressing the hand pose can then be done by learning a discriminator to estimate the posterior of the latent pose given some depth maps.

3D Hand Pose Estimation

PathTrack: Fast Trajectory Annotation with Path Supervision

no code implementations ICCV 2017 Santiago Manen, Michael Gygli, Dengxin Dai, Luc van Gool

We further validate our approach by crowdsourcing the PathTrack dataset, with more than 15, 000 person trajectories in 720 sequences.

Multiple Object Tracking Object +1

Learned Multi-Patch Similarity

1 code implementation ICCV 2017 Wilfried Hartmann, Silvano Galliani, Michal Havlena, Luc van Gool, Konrad Schindler

Estimating a depth map from multiple views of a scene is a fundamental task in computer vision.

The 2017 DAVIS Challenge on Video Object Segmentation

no code implementations3 Apr 2017 Jordi Pont-Tuset, Federico Perazzi, Sergi Caelles, Pablo Arbeláez, Alex Sorkine-Hornung, Luc van Gool

The DAVIS Challenge follows up on the recent publication of DAVIS (Densely-Annotated VIdeo Segmentation), which has fostered the development of several novel state-of-the-art video object segmentation techniques.

Object Scene Classification +5

On the Relation between Color Image Denoising and Classification

no code implementations5 Apr 2017 Jiqing Wu, Radu Timofte, Zhiwu Huang, Luc van Gool

Inspired by classification models, we propose a novel deep learning architecture for color (multichannel) image denoising and report on thousands of images from ImageNet dataset as well as commonly used imagery.

Classification Color Image Denoising +3

Semantically-Guided Video Object Segmentation

no code implementations6 Apr 2017 Sergi Caelles, Yu-Hua Chen, Jordi Pont-Tuset, Luc van Gool

This paper tackles the problem of semi-supervised video object segmentation, that is, segmenting an object in a sequence given its mask in the first frame.

Object Segmentation +3

DSLR-Quality Photos on Mobile Devices with Deep Convolutional Networks

3 code implementations ICCV 2017 Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Kenneth Vanhoey, Luc van Gool

Despite a rapid rise in the quality of built-in smartphone cameras, their physical limitations - small sensor size, compact lenses and the lack of specific hardware, - impede them to achieve the quality results of DSLR cameras.

Translation

Query-adaptive Video Summarization via Quality-aware Relevance Estimation

1 code implementation1 May 2017 Arun Balajee Vasudevan, Michael Gygli, Anna Volokitin, Luc van Gool

Although the problem of automatic video summarization has recently received a lot of attention, the problem of creating a video summary that also highlights elements relevant to a search query has been less studied.

Video Summarization

Speech-Based Visual Question Answering

1 code implementation1 May 2017 Ted Zhang, Dengxin Dai, Tinne Tuytelaars, Marie-Francine Moens, Luc van Gool

This paper introduces speech-based visual question answering (VQA), the task of generating an answer given an image and a spoken question.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Temporal Segment Networks for Action Recognition in Videos

11 code implementations8 May 2017 Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc van Gool

Furthermore, based on the temporal segment networks, we won the video classification track at the ActivityNet challenge 2016 among 24 teams, which demonstrates the effectiveness of TSN and the proposed good practices.

Action Classification Action Recognition In Videos +3

WebVision Challenge: Visual Learning and Understanding With Web Data

no code implementations16 May 2017 Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Jesse Berent, Abhinav Gupta, Rahul Sukthankar, Luc van Gool

The 2017 WebVision challenge consists of two tracks, the image classification task on WebVision test set, and the transfer learning task on PASCAL VOC 2012 dataset.

Benchmarking Image Classification +1

Pose Guided Person Image Generation

2 code implementations NeurIPS 2017 Liqian Ma, Xu Jia, Qianru Sun, Bernt Schiele, Tinne Tuytelaars, Luc van Gool

This paper proposes the novel Pose Guided Person Generation Network (PG$^2$) that allows to synthesize person images in arbitrary poses, based on an image of that person and a novel pose.

Gesture-to-Gesture Translation Pose Transfer

Sliced Wasserstein Generative Models

1 code implementation8 Jun 2017 Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc van Gool

In generative modeling, the Wasserstein distance (WD) has emerged as a useful metric to measure the discrepancy between generated and real data distributions.

Image Generation Video Generation

Consensus Maximization With Linear Matrix Inequality Constraints

no code implementations CVPR 2017 Pablo Speciale, Danda Pani Paudel, Martin R. Oswald, Till Kroeger, Luc van Gool, Marc Pollefeys

While randomized methods like RANSAC are fast, they do not guarantee global optimality and fail to manage large amounts of outliers.

An Analysis of Human-centered Geolocation

2 code implementations10 Jul 2017 Kaili Wang, Yu-Hui Huang, Jose Oramas, Luc van Gool, Tinne Tuytelaars

Experiments on the Fashion 144k and a Pinterest-based dataset show that the automatic methods succeed at this task to a reasonable extent.

Deep Domain Adaptation by Geodesic Distance Minimization

no code implementations13 Jul 2017 Yifei Wang, Wen Li, Dengxin Dai, Luc van Gool

Our work builds on the recently proposed Deep CORAL method, which proposed to train a convolutional neural network and simultaneously minimize the Euclidean distance of convariance matrices between the source and target domains.

Domain Adaptation

Semantic Instance Segmentation with a Discriminative Loss Function

8 code implementations8 Aug 2017 Bert De Brabandere, Davy Neven, Luc van Gool

In this work we propose to tackle the problem with a discriminative loss function, operating at the pixel level, that encourages a convolutional network to produce a representation of the image that can easily be clustered into instances with a simple post-processing step.

Instance Segmentation Lane Detection +4

WebVision Database: Visual Learning and Understanding from Web Data

no code implementations9 Aug 2017 Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Luc van Gool

Our new WebVision database and relevant studies in this work would benefit the advance of learning state-of-the-art visual models with minimum supervision based on web data.

Domain Adaptation

Semantic Foggy Scene Understanding with Synthetic Data

no code implementations25 Aug 2017 Christos Sakaridis, Dengxin Dai, Luc van Gool

Due to the difficulty of collecting and annotating foggy images, we choose to generate synthetic fog on real images that depict clear-weather outdoor scenes, and then leverage these partially synthetic data for SFSU by employing state-of-the-art convolutional neural networks (CNN).

Image Dehazing object-detection +3

WESPE: Weakly Supervised Photo Enhancer for Digital Cameras

3 code implementations4 Sep 2017 Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Kenneth Vanhoey, Luc van Gool

Low-end and compact mobile cameras demonstrate limited photo quality mainly due to space, hardware and budget constraints.

Generative Adversarial Network

Video Object Segmentation Without Temporal Information

no code implementations18 Sep 2017 Kevis-Kokitsi Maninis, Sergi Caelles, Yu-Hua Chen, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc van Gool

Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames.

Foreground Segmentation Object +5

Anchored Regression Networks Applied to Age Estimation and Super Resolution

no code implementations ICCV 2017 Eirikur Agustsson, Radu Timofte, Luc van Gool

We propose the Anchored Regression Network (ARN), a nonlinear regression network which can be seamlessly integrated into various networks or can be used stand-alone when the features have already been fixed.

Age Estimation Image Super-Resolution +1

Optimal Transformation Estimation With Semantic Cues

no code implementations ICCV 2017 Danda Pani Paudel, Adlane Habed, Luc van Gool

This paper addresses the problem of estimating the geometric transformation relating two distinct visual modalities (e. g. an image and a map, or a projective structure and a Euclidean 3D model) while relying only on semantic cues, such as semantically segmented regions or object bounding boxes.

Classification Driven Dynamic Image Enhancement

no code implementations20 Oct 2017 Vivek Sharma, Ali Diba, Davy Neven, Michael S. Brown, Luc van Gool, Rainer Stiefelhagen

In this paper, we are interested in learning CNNs that can emulate image enhancement and restoration, but with the overall goal to improve image classification and not necessarily human perception.

Classification General Classification +3

Optimal transport maps for distribution preserving operations on latent spaces of Generative Models

no code implementations ICLR 2018 Eirikur Agustsson, Alexander Sage, Radu Timofte, Luc van Gool

Generative models such as Variational Auto Encoders (VAEs) and Generative Adversarial Networks (GANs) are typically trained for a fixed prior distribution in the latent space, such as uniform or Gaussian.

Object Referring in Visual Scene with Spoken Language

no code implementations10 Nov 2017 Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool

This paper investigates Object Referring with Spoken Language (ORSpoken) by presenting two datasets and one novel approach.

Object

Acquiring Common Sense Spatial Knowledge through Implicit Spatial Templates

1 code implementation18 Nov 2017 Guillem Collell, Luc van Gool, Marie-Francine Moens

In contrast with prior work that restricts spatial templates to explicit spatial prepositions (e. g., "glass on table"), here we extend this concept to implicit spatial language, i. e., those relationships (generally actions) for which the spatial arrangement of the objects is only implicitly implied (e. g., "man riding horse").

Common Sense Reasoning Question Answering +1

Weakly Supervised Object Discovery by Generative Adversarial & Ranking Networks

no code implementations22 Nov 2017 Ali Diba, Vivek Sharma, Rainer Stiefelhagen, Luc van Gool

We approach GANs with a novel training method and learning objective, to discover multiple object instances for three cases: 1) synthesizing a picture of a specific object within a cluttered scene; 2) localizing different categories in images for weakly supervised object detection; and 3) improving object discov- ery in object detection pipelines.

Object object-detection +2

Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification

3 code implementations22 Nov 2017 Ali Diba, Mohsen Fayyaz, Vivek Sharma, Amir Hossein Karami, Mohammad Mahdi Arzani, Rahman Yousefzadeh, Luc van Gool

Thus, by finetuning this network, we beat the performance of generic and recent methods in 3D CNNs, which were trained on large video datasets, e. g. Sports-1M, and finetuned on the target datasets, e. g. HMDB51/UCF101.

Action Recognition General Classification +3

Dense 3D Regression for Hand Pose Estimation

1 code implementation CVPR 2018 Chengde Wan, Thomas Probst, Luc van Gool, Angela Yao

Specifically, we decompose the pose parameters into a set of per-pixel estimations, i. e., 2D heat maps, 3D heat maps and unit 3D directional vector fields.

3D Hand Pose Estimation regression

Deep Extreme Cut: From Extreme Points to Object Segmentation

2 code implementations CVPR 2018 Kevis-Kokitsi Maninis, Sergi Caelles, Jordi Pont-Tuset, Luc van Gool

This paper explores the use of extreme points in an object (left-most, right-most, top, bottom pixels) as input to obtain precise object segmentation for images and videos.

Instance Segmentation Interactive Segmentation +4

Appearance-and-Relation Networks for Video Classification

1 code implementation CVPR 2018 Limin Wang, Wei Li, Wen Li, Luc van Gool

Specifically, SMART blocks decouple the spatiotemporal learning module into an appearance branch for spatial modeling and a relation branch for temporal modeling.

Action Classification Action Recognition +6

Natural and Effective Obfuscation by Head Inpainting

no code implementations CVPR 2018 Qianru Sun, Liqian Ma, Seong Joon Oh, Luc van Gool, Bernt Schiele, Mario Fritz

As more and more personal photos are shared online, being able to obfuscate identities in such photos is becoming a necessity for privacy protection.

Detection-aided liver lesion segmentation using deep learning

2 code implementations29 Nov 2017 Miriam Bellver, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Xavier Giro-i-Nieto, Jordi Torres, Luc van Gool

A fully automatic technique for segmenting the liver and localizing its unhealthy tissues is a convenient tool in order to diagnose hepatic diseases and assess the response to the according treatments.

Computed Tomography (CT) Lesion Segmentation +1

Improving Video Generation for Multi-functional Applications

1 code implementation30 Nov 2017 Bernhard Kratzwald, Zhiwu Huang, Danda Pani Paudel, Acharya Dinesh, Luc van Gool

In this paper, we aim to improve the state-of-the-art video generative adversarial networks (GANs) with a view towards multi-functional applications.

Colorization Future prediction +2

Wasserstein Divergence for GANs

1 code implementation ECCV 2018 Jiqing Wu, Zhiwu Huang, Janine Thoma, Dinesh Acharya, Luc van Gool

In many domains of computer vision, generative adversarial networks (GANs) have achieved great success, among which the family of Wasserstein GANs (WGANs) is considered to be state-of-the-art due to the theoretical contributions and competitive qualitative performance.

Image Generation

Face Translation between Images and Videos using Identity-aware CycleGAN

no code implementations4 Dec 2017 Zhiwu Huang, Bernhard Kratzwald, Danda Pani Paudel, Jiqing Wu, Luc van Gool

This paper presents a new problem of unpaired face translation between images and videos, which can be applied to facial video prediction and enhancement.

Image-to-Image Translation Translation +1

Iterative Deep Learning for Network Topology Extraction

no code implementations4 Dec 2017 Carles Ventura, Jordi Pont-Tuset, Sergi Caelles, Kevis-Kokitsi Maninis, Luc van Gool

This paper tackles the task of estimating the topology of filamentary networks such as retinal vessels and road networks.

Manifold-valued Image Generation with Wasserstein Generative Adversarial Nets

no code implementations5 Dec 2017 Zhiwu Huang, Jiqing Wu, Luc van Gool

In addition, we recommend three benchmark datasets that are CIFAR-10 HSV/CB color images, ImageNet HSV/CB color images, UCL DT image datasets.

Image Generation

Disentangled Person Image Generation

1 code implementation CVPR 2018 Liqian Ma, Qianru Sun, Stamatios Georgoulis, Luc van Gool, Bernt Schiele, Mario Fritz

Generating novel, yet realistic, images of persons is a challenging task due to the complex interplay between the different image factors, such as the foreground, background and pose information.

Gesture-to-Gesture Translation Person Re-Identification +1

ComboGAN: Unrestrained Scalability for Image Domain Translation

1 code implementation19 Dec 2017 Asha Anoosheh, Eirikur Agustsson, Radu Timofte, Luc Van Gool

This year alone has seen unprecedented leaps in the area of learning-based image translation, namely CycleGAN, by Zhu et al.

Image-to-Image Translation Translation

Object Referring in Videos with Language and Human Gaze

no code implementations CVPR 2018 Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool

To that end, we present a new video dataset for OR, with 30, 000 objects over 5, 000 stereo video sequences annotated for their descriptions and gaze.

Object Referring Expression

Conditional Probability Models for Deep Image Compression

1 code implementation CVPR 2018 Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool

During training, the auto-encoder makes use of the context model to estimate the entropy of its representation, and the context model is concurrently updated to learn the dependencies between the symbols in the latent representation.

Image Compression MS-SSIM +3

Towards End-to-End Lane Detection: an Instance Segmentation Approach

22 code implementations15 Feb 2018 Davy Neven, Bert de Brabandere, Stamatios Georgoulis, Marc Proesmans, Luc van Gool

By doing so, we ensure a lane fitting which is robust against road plane changes, unlike existing approaches that rely on a fixed, pre-defined transformation.

Instance Segmentation Lane Detection +1

The 2018 DAVIS Challenge on Video Object Segmentation

no code implementations1 Mar 2018 Sergi Caelles, Alberto Montes, Kevis-Kokitsi Maninis, Yu-Hua Chen, Luc van Gool, Federico Perazzi, Jordi Pont-Tuset

Motivated by the analysis of the results of the 2017 edition, the main track of the competition will be the same than in the previous edition (segmentation given the full mask of the objects in the first frame -- semi-supervised scenario).

Interactive Segmentation Object +4

Unsupervised Deep Single-Image Intrinsic Decomposition using Illumination-Varying Image Sequences

no code implementations2 Mar 2018 Louis Lettry, Kenneth Vanhoey, Luc van Gool

Machine learning based Single Image Intrinsic Decomposition (SIID) methods decompose a captured scene into its albedo and shading images by using the knowledge of a large set of known and realistic ground truth decompositions.

Towards Image Understanding from Deep Compression without Decoding

1 code implementation ICLR 2018 Robert Torfason, Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool

Motivated by recent work on deep neural network (DNN)-based image compression methods showing potential improvements in image quality, savings in storage, and bandwidth reduction, we propose to perform image understanding tasks such as classification and segmentation directly on the compressed representations produced by these compression methods.

Classification General Classification +2

Progressive Structure from Motion

no code implementations ECCV 2018 Alex Locher, Michal Havlena, Luc van Gool

Structure from Motion or the sparse 3D reconstruction out of individual photos is a long studied topic in computer vision.

3D Reconstruction

End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners

no code implementations ECCV 2018 Simon Hecker, Dengxin Dai, Luc van Gool

In particular, we develop a sensor setup that provides data for a 360-degree view of the area surrounding the vehicle, the driving route to the destination, and low-level driving maneuvers (e. g. steering angle and speed) by human drivers.

Adversarial Binary Coding for Efficient Person Re-identification

no code implementations29 Mar 2018 Zheng Liu, Jie Qin, Annan Li, Yunhong Wang, Luc van Gool

Specifically, instead of learning explicit projections or adding fully-connected mapping layers, the proposed Adversarial Binary Coding (ABC) framework guides the extraction of binary codes implicitly and effectively.

Person Re-Identification

Viewpoint-aware Video Summarization

no code implementations CVPR 2018 Atsushi Kanehira, Luc van Gool, Yoshitaka Ushiku, Tatsuya Harada

To satisfy these requirements (A)-(C) simultaneously, we proposed a novel video summarization method from multiple groups of videos.

Semantic Similarity Semantic Textual Similarity +1

Failure Prediction for Autonomous Driving

no code implementations4 May 2018 Simon Hecker, Dengxin Dai, Luc van Gool

This work presents a method to learn to predict the occurrence of these failures, i. e. to assess how difficult a scene is to a given driving model and to possibly give the human driver an early headsup.

Autonomous Driving

Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency

no code implementations ICLR 2019 Liqian Ma, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc van Gool

Experimental results on various datasets show that EGSC-IT does not only translate the source image to diverse instances in the target domain, but also preserves the semantic consistency during the process.

Translation Unsupervised Image-To-Image Translation

Classification-Driven Dynamic Image Enhancement

no code implementations CVPR 2018 Vivek Sharma, Ali Diba, Davy Neven, Michael S. Brown, Luc van Gool, Rainer Stiefelhagen

In this paper, we are interested in learning CNNs that can emulate image enhancement and restoration, but with the overall goal to improve image classification and not necessarily human perception.

Classification General Classification +3

Spatio-Temporal Channel Correlation Networks for Action Classification

no code implementations ECCV 2018 Ali Diba, Mohsen Fayyaz, Vivek Sharma, M. Mahdi Arzani, Rahman Yousefzadeh, Juergen Gall, Luc van Gool

Our experiments show that adding STC blocks to current state-of-the-art architectures outperforms the state-of-the-art methods on the HMDB51, UCF101 and Kinetics datasets.

Action Classification Classification +1

Model-free Consensus Maximization for Non-Rigid Shapes

no code implementations ECCV 2018 Thomas Probst, Ajad Chhatkuli, Danda Pani Paudel, Luc van Gool

In this paper, we formulate the model-free consensus maximization as an Integer Program in a graph using `rules' on measurements.

Multi-bin Trainable Linear Unit for Fast Image Restoration Networks

no code implementations30 Jul 2018 Shuhang Gu, Radu Timofte, Luc van Gool

Tremendous advances in image restoration tasks such as denoising and super-resolution have been achieved using neural networks.

Image Denoising Image Restoration +1

Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding

no code implementations ECCV 2018 Christos Sakaridis, Dengxin Dai, Simon Hecker, Luc van Gool

In addition, we present three other main stand-alone contributions: 1) a novel method to add synthetic fog to real, clear-weather scenes using semantic input; 2) a new fog density estimator; 3) the Foggy Zurich dataset comprising $3808$ real foggy images, with pixel-level semantic annotations for $16$ images with dense fog.

Scene Understanding Semantic Segmentation

Incremental Non-Rigid Structure-from-Motion with Unknown Focal Length

no code implementations ECCV 2018 Thomas Probst, Danda Pani Paudel, Ajad Chhatkuli, Luc van Gool

In this paper we present a method for incremental Non-Rigid Structure-from-Motion (NRSfM) with the perspective camera model and the isometric surface prior with unknown focal length.

Sampling Algebraic Varieties for Robust Camera Autocalibration

no code implementations ECCV 2018 Danda Pani Paudel, Luc van Gool

This paper addresses the problem of robustly autocalibrating a moving camera with constant intrinsics.

Generative Domain-Migration Hashing for Sketch-to-Image Retrieval

1 code implementation ECCV 2018 Jingyi Zhang, Fumin Shen, Li Liu, Fan Zhu, Mengyang Yu, Ling Shao, Heng Tao Shen, Luc van Gool

The generative model learns a mapping that the distributions of sketches can be indistinguishable from the distribution of natural images using an adversarial loss, and simultaneously learns an inverse mapping based on the cycle consistency loss in order to enhance the indistinguishability.

Multi-Task Learning Retrieval +1

stagNet: An Attentive Semantic RNN for Group Activity Recognition

no code implementations ECCV 2018 Mengshi Qi, Jie Qin, Annan Li, Yunhong Wang, Jiebo Luo, Luc van Gool

Group activity recognition plays a fundamental role in a variety of applications, e. g. sports video analysis and intelligent surveillance.

Group Activity Recognition

Night-to-Day Image Translation for Retrieval-based Localization

1 code implementation26 Sep 2018 Asha Anoosheh, Torsten Sattler, Radu Timofte, Marc Pollefeys, Luc van Gool

We then compare the daytime and translated night images to obtain a pose estimate for the night image using the known 6-DOF position of the closest day image.

Image Retrieval Position +4

AI Benchmark: Running Deep Neural Networks on Android Smartphones

1 code implementation2 Oct 2018 Andrey Ignatov, Radu Timofte, William Chou, Ke Wang, Max Wu, Tim Hartley, Luc van Gool

Over the last years, the computational power of mobile devices such as smartphones and tablets has grown dramatically, reaching the level of desktop computers available not long ago.

Towards High Resolution Video Generation with Progressive Growing of Sliced Wasserstein GANs

1 code implementation4 Oct 2018 Dinesh Acharya, Zhiwu Huang, Danda Pani Paudel, Luc van Gool

Furthermore, we introduce a sliced version of Wasserstein GAN (SWGAN) loss to improve the distribution learning on the video data of high-dimension and mixed-spatiotemporal distribution.

Action Recognition Image Generation +2

Practical Full Resolution Learned Lossless Image Compression

3 code implementations CVPR 2019 Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool

We propose the first practical learned lossless image compression system, L3C, and show that it outperforms the popular engineered codecs, PNG, WebP and JPEG 2000.

Image Compression

SMIT: Stochastic Multi-Label Image-to-Image Translation

1 code implementation10 Dec 2018 Andrés Romero, Pablo Arbeláez, Luc van Gool, Radu Timofte

This problem is highly challenging due to three main reasons: (i) unpaired datasets, (ii) multiple attributes, and (iii) the multimodality (e. g., style) associated with the translation.

Image-to-Image Translation Translation

Learning Semantic Segmentation from Synthetic Data: A Geometrically Guided Input-Output Adaptation Approach

no code implementations CVPR 2019 Yuhua Chen, Wen Li, Xiaoran Chen, Luc van Gool

In this work, we take the advantage of additional geometric information from synthetic data, a powerful yet largely neglected cue, to bridge the domain gap.

Depth Estimation Segmentation +1

DLOW: Domain Flow for Adaptation and Generalization

1 code implementation CVPR 2019 Rui Gong, Wen Li, Yu-Hua Chen, Luc van Gool

In this work, we present a domain flow generation(DLOW) model to bridge two different domains by generating a continuous sequence of intermediate domains flowing from one domain to the other.

Domain Adaptation Semantic Segmentation +1

Fast Perceptual Image Enhancement

1 code implementation31 Dec 2018 Etienne de Stoutz, Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Luc van Gool

We extend upon the results of Ignatov et al., where they are able to translate images from compact mobile cameras into images with comparable quality to high-resolution photos taken by DSLR cameras.

Image Enhancement

Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding

1 code implementation5 Jan 2019 Dengxin Dai, Christos Sakaridis, Simon Hecker, Luc van Gool

The method is based on the fact that the results of semantic segmentation in moderately adverse conditions (light fog) can be bootstrapped to solve the same problem in highly adverse conditions (dense fog).

Domain Adaptation Scene Understanding +2

RayNet: Learning Volumetric 3D Reconstruction with Ray Potentials

1 code implementation CVPR 2018 Despoina Paschalidou, Ali Osman Ulusoy, Carolin Schmitt, Luc van Gool, Andreas Geiger

RayNet integrates a CNN that learns view-invariant feature representations with an MRF that explicitly encodes the physics of perspective projection and occlusion.

3D Reconstruction

Guided Curriculum Model Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation

1 code implementation ICCV 2019 Christos Sakaridis, Dengxin Dai, Luc van Gool

Our central contributions are: 1) a curriculum framework to gradually adapt semantic segmentation models from day to night via labeled synthetic images and unlabeled real images, both for progressively darker times of day, which exploits cross-time-of-day correspondences for the real images to guide the inference of their labels; 2) a novel uncertainty-aware annotation and evaluation framework and metric for semantic segmentation, designed for adverse conditions and including image regions beyond human recognition capability in the evaluation in a principled fashion; 3) the Dark Zurich dataset, which comprises 2416 unlabeled nighttime and 2920 unlabeled twilight images with correspondences to their daytime counterparts plus a set of 151 nighttime images with fine pixel-level annotations created with our protocol, which serves as a first benchmark to perform our novel evaluation.

Image Segmentation Segmentation +2

End-to-end Lane Detection through Differentiable Least-Squares Fitting

1 code implementation1 Feb 2019 Wouter Van Gansbeke, Bert de Brabandere, Davy Neven, Marc Proesmans, Luc van Gool

The problem with such a two-step approach is that the parameters of the network are not optimized for the true task of interest (estimating the lane curvature parameters) but for a proxy task (segmenting the lane markings), resulting in sub-optimal performance.

Lane Detection

Real-time 3D Traffic Cone Detection for Autonomous Driving

2 code implementations6 Feb 2019 Ankit Dhall, Dengxin Dai, Luc van Gool

In this work, we leverage the unique structure of traffic cones and propose a pipelined approach to the problem.

3D Object Detection Autonomous Driving +3

Sparse and noisy LiDAR completion with RGB guidance and uncertainty

1 code implementation14 Feb 2019 Wouter Van Gansbeke, Davy Neven, Bert de Brabandere, Luc van Gool

However, we additionally propose a fusion method with RGB guidance from a monocular camera in order to leverage object information and to correct mistakes in the sparse input.

Autonomous Vehicles Depth Completion +2

Learning Accurate, Comfortable and Human-like Driving

no code implementations26 Mar 2019 Simon Hecker, Dengxin Dai, Luc van Gool

Our model is trained and evaluated on the Drive360 dataset, which features 60 hours and 3000 km of real-world driving data.

Autonomous Vehicles Navigate

Fast video object segmentation with Spatio-Temporal GANs

no code implementations28 Mar 2019 Sergi Caelles, Albert Pumarola, Francesc Moreno-Noguer, Alberto Sanfeliu, Luc van Gool

To achieve this, we concentrate all the heavy computational load to the training phase with two critics that enforce spatial and temporal mask consistency over the last K frames.

Descriptive Object +4

Branched Multi-Task Networks: Deciding What Layers To Share

no code implementations ICLR 2020 Simon Vandenhende, Stamatios Georgoulis, Bert de Brabandere, Luc van Gool

In the context of multi-task learning, neural networks with branched architectures have often been employed to jointly tackle the tasks at hand.

Multi-Task Learning Neural Architecture Search

Sliced Wasserstein Generative Models

1 code implementation CVPR 2019 Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc van Gool

In generative modeling, the Wasserstein distance (WD) has emerged as a useful metric to measure the discrepancy between generated and real data distributions.

Image Generation Video Generation

Learning Discriminative Model Prediction for Tracking

2 code implementations ICCV 2019 Goutam Bhat, Martin Danelljan, Luc van Gool, Radu Timofte

The current strive towards end-to-end trainable computer vision systems imposes major challenges for the task of visual tracking.

Visual Object Tracking Visual Tracking

A Novel BiLevel Paradigm for Image-to-Image Translation

no code implementations18 Apr 2019 Liqian Ma, Qianru Sun, Bernt Schiele, Luc van Gool

Image-to-image (I2I) translation is a pixel-level mapping that requires a large number of paired training data and often suffers from the problems of high diversity and strong category bias in image scenes.

Image-to-Image Translation Translation

DynamoNet: Dynamic Action and Motion Network

no code implementations ICCV 2019 Ali Diba, Vivek Sharma, Luc van Gool, Rainer Stiefelhagen

With these overall objectives, to this end, we introduce a novel unified spatio-temporal 3D-CNN architecture (DynamoNet) that jointly optimizes the video classification and learning motion representation by predicting future frames as a multi-task learning problem.

Action Recognition Classification +5

Large Scale Holistic Video Understanding

1 code implementation ECCV 2020 Ali Diba, Mohsen Fayyaz, Vivek Sharma, Manohar Paluri, Jurgen Gall, Rainer Stiefelhagen, Luc van Gool

HVU is organized hierarchically in a semantic taxonomy that focuses on multi-label and multi-task video understanding as a comprehensive problem that encompasses the recognition of multiple semantic aspects in the dynamic scene.

Action Classification Action Recognition +7

The 2019 DAVIS Challenge on VOS: Unsupervised Multi-Object Segmentation

no code implementations2 May 2019 Sergi Caelles, Jordi Pont-Tuset, Federico Perazzi, Alberto Montes, Kevis-Kokitsi Maninis, Luc van Gool

We present the 2019 DAVIS Challenge on Video Object Segmentation, the third edition of the DAVIS Challenge series, a public competition designed for the task of Video Object Segmentation (VOS).

Object Segmentation +3

Semi-Supervised Learning by Augmented Distribution Alignment

1 code implementation ICCV 2019 Qin Wang, Wen Li, Luc van Gool

We reveal that an essential sampling bias exists in semi-supervised learning due to the limited number of labeled samples, which often leads to a considerable empirical distribution mismatch between labeled data and unlabeled data.

Domain Adaptation Semi-Supervised Image Classification

3D Appearance Super-Resolution with Deep Learning

1 code implementation CVPR 2019 Yawei Li, Vagia Tsiminaki, Radu Timofte, Marc Pollefeys, Luc van Gool

Experimental results demonstrate that our proposed networks successfully incorporate the 3D geometric information and super-resolve the texture maps.

Super-Resolution

Gated CRF Loss for Weakly Supervised Semantic Image Segmentation

no code implementations11 Jun 2019 Anton Obukhov, Stamatios Georgoulis, Dengxin Dai, Luc van Gool

State-of-the-art approaches for semantic segmentation rely on deep convolutional neural networks trained on fully annotated datasets, that have been shown to be notoriously expensive to collect, both in terms of time and money.

Image Segmentation Weakly supervised Semantic Segmentation +1

Learning a Curve Guardian for Motorcycles

no code implementations12 Jul 2019 Simon Hecker, Alexander Liniger, Henrik Maurenbrecher, Dengxin Dai, Luc van Gool

Our contributes are fourfold: 1) we predict the motorcycle's intra-lane position using a convolutional neural network (CNN), 2) we predict the motorcycle roll angle using a CNN, 3) we use an upgraded controller model that incorporates road incline for a more realistic model and prediction, 4) we design a scale-able system by utilizing HERE Technologies map database to obtain the accurate road geometry of the future path.

Position

Dual Grid Net: hand mesh vertex regression from single depth maps

no code implementations ECCV 2020 Chengde Wan, Thomas Probst, Luc van Gool, Angela Yao

In the first stage, the network estimates a dense correspondence field for every pixel on the depth map or image grid to the mesh grid.

regression

Learning Filter Basis for Convolutional Neural Network Compression

3 code implementations ICCV 2019 Yawei Li, Shuhang Gu, Luc van Gool, Radu Timofte

Convolutional neural networks (CNNs) based solutions have achieved state-of-the-art performances for many computer vision tasks, including classification and super-resolution of images.

General Classification Image Classification +2

Texture Underfitting for Domain Adaptation

no code implementations29 Aug 2019 Jan-Nico Zaech, Dengxin Dai, Martin Hahner, Luc van Gool

Comprehensive semantic segmentation is one of the key components for robust scene understanding and a requirement to enable autonomous driving.

Autonomous Driving Domain Adaptation +4

Extremely Weak Supervised Image-to-Image Translation for Semantic Segmentation

1 code implementation18 Sep 2019 Samarth Shukla, Luc van Gool, Radu Timofte

Recent advances in generative models and adversarial training have led to a flourishing image-to-image (I2I) translation literature.

Image Segmentation Image-to-Image Translation +2

Talk2Car: Taking Control of Your Self-Driving Car

1 code implementation IJCNLP 2019 Thierry Deruyttere, Simon Vandenhende, Dusan Grujicic, Luc van Gool, Marie-Francine Moens

Or more specifically, we consider the problem in an autonomous driving setting, where a passenger requests an action that can be associated with an object found in a street scene.

Autonomous Driving Object +2

Convex Relaxations for Consensus and Non-Minimal Problems in 3D Vision

no code implementations ICCV 2019 Thomas Probst, Danda Pani Paudel, Ajad Chhatkuli, Luc van Gool

Notably, we further exploit the POP formulation of non-minimal solver also for the generic consensus maximization problems in 3D vision.

AI Benchmark: All About Deep Learning on Smartphones in 2019

no code implementations15 Oct 2019 Andrey Ignatov, Radu Timofte, Andrei Kulik, Seungsoo Yang, Ke Wang, Felix Baum, Max Wu, Lirong Xu, Luc van Gool

The performance of mobile AI accelerators has been evolving rapidly in the past two years, nearly doubling with each new generation of SoCs.

Self-supervised Object Motion and Depth Estimation from Video

no code implementations9 Dec 2019 Qi Dai, Vaishakh Patil, Simon Hecker, Dengxin Dai, Luc van Gool, Konrad Schindler

We present a self-supervised learning framework to estimate the individual object motion and monocular depth from video.

Depth Estimation Instance Segmentation +5

Cannot find the paper you are looking for? You can Submit a new open access paper.