Search Results for author: Jakob Verbeek

Found 57 papers, 24 papers with code

Improved Baselines for Data-efficient Perceptual Augmentation of LLMs

no code implementations • 20 Mar 2024 • Théophane Vallaeys, Mustafa Shukor, Matthieu Cord, Jakob Verbeek

The abilities of large language models (LLMs) have recently progressed to unprecedented levels, paving the way to novel applications in a wide variety of areas.

Audio captioning Image Captioning +2

Paper
Add Code

Better (pseudo-)labels for semi-supervised instance segmentation

no code implementations • 18 Mar 2024 • François Porcher, Camille Couprie, Marc Szafraniec, Jakob Verbeek

Despite the availability of large datasets for tasks like image classification and image-text alignment, labeled data for more complex recognition tasks, such as detection and segmentation, is less abundant.

Few-Shot Learning Image Classification +3

Paper
Add Code

Residual Quantization with Implicit Neural Codebooks

1 code implementation • 26 Jan 2024 • Iris Huijben, Matthijs Douze, Matthew Muckley, Ruud Van Sloun, Jakob Verbeek

In this paper, we propose QINCo, a neural RQ variant which predicts specialized codebooks per vector using a neural network that is conditioned on the approximation of the vector from previous steps.

Data Compression Quantization

Paper
Code

Unlocking Pre-trained Image Backbones for Semantic Image Synthesis

no code implementations • 20 Dec 2023 • Tariq Berrada, Jakob Verbeek, Camille Couprie, Karteek Alahari

Semantic image synthesis, i. e., generating images from user-provided semantic label maps, is an important conditional image generation task as it allows to control both the content as well as the spatial layout of generated images.

Ranked #1 on Image-to-Image Translation on ADE20K Labels-to-Photos

Conditional Image Generation Image Classification +1

Paper
Add Code

Towards image compression with perfect realism at ultra-low bitrates

no code implementations • 16 Oct 2023 • Marlène Careil, Matthew J. Muckley, Jakob Verbeek, Stéphane Lathuilière

We find that our model leads to reconstructions with state-of-the-art visual quality as measured by FID and KID.

Image Compression

Paper
Add Code

Guided Distillation for Semi-Supervised Instance Segmentation

1 code implementation • 3 Aug 2023 • Tariq Berrada, Camille Couprie, Karteek Alahari, Jakob Verbeek

Although instance segmentation methods have improved considerably, the dominant paradigm is to rely on fully-annotated training images, which are tedious to obtain.

Ranked #1 on Semi-Supervised Instance Segmentation on COCO 10% labeled data

Instance Segmentation Semantic Segmentation +1

Paper
Code

Multi-Domain Learning with Modulation Adapters

no code implementations • 17 Jul 2023 • Ekaterina Iakovleva, Karteek Alahari, Jakob Verbeek

Deep convolutional networks are ubiquitous in computer vision, due to their excellent performance across different tasks for various domains.

Image Classification

Paper
Add Code

Zero-shot spatial layout conditioning for text-to-image diffusion models

no code implementations • ICCV 2023 • Guillaume Couairon, Marlène Careil, Matthieu Cord, Stéphane Lathuilière, Jakob Verbeek

Large-scale text-to-image diffusion models have significantly improved the state of the art in generative image modelling and allow for an intuitive and powerful user interface to drive the image generation process.

Image Generation Segmentation +1

Paper
Add Code

Improved baselines for vision-language pre-training

1 code implementation • 15 May 2023 • Enrico Fini, Pietro Astolfi, Adriana Romero-Soriano, Jakob Verbeek, Michal Drozdzal

Indeed, we find that a simple CLIP baseline can also be improved substantially, up to a 25% relative improvement on downstream zero-shot tasks, by using well-known training techniques that are popular in other subfields.

Contrastive Learning Data Augmentation +1

Paper
Code

Controllable Image Generation via Collage Representations

no code implementations • 26 Apr 2023 • Arantxa Casanova, Marlène Careil, Adriana Romero-Soriano, Christopher J. Pal, Jakob Verbeek, Michal Drozdzal

Our experiments on the OI dataset show that M&Ms outperforms baselines in terms of fine-grained scene controllability while being very competitive in terms of image quality and sample diversity.

Attribute Image Generation

Paper
Add Code

Are Visual Recognition Models Robust to Image Compression?

no code implementations • 10 Apr 2023 • João Maria Janeiro, Stanislav Frolov, Alaaeldin El-Nouby, Jakob Verbeek

For example, for segmentation mIoU is reduced from 44. 5 to 30. 5 mIoU when compressing to 0. 1 bpp using the best compression model we evaluated.

Image Classification Image Compression +4

Paper
Add Code

Few-shot Semantic Image Synthesis with Class Affinity Transfer

no code implementations • CVPR 2023 • Marlène Careil, Jakob Verbeek, Stéphane Lathuilière

The class affinity matrix is introduced as a first layer to the source model to make it compatible with the target label maps, and the source model is then further finetuned for the target domain.

Image Generation Semantic Segmentation

Paper
Add Code

Instance-Conditioned GAN Data Augmentation for Representation Learning

no code implementations • 16 Mar 2023 • Pietro Astolfi, Arantxa Casanova, Jakob Verbeek, Pascal Vincent, Adriana Romero-Soriano, Michal Drozdzal

We showcase the benefits of DA_IC-GAN by plugging it out-of-the-box into the supervised training of ResNets and DeiT models on the ImageNet dataset, and achieving accuracy boosts up to between 1%p and 2%p with the highest capacity models.

Data Augmentation Few-Shot Learning +1

Paper
Add Code

Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models

no code implementations • 26 Jan 2023 • Matthew J. Muckley, Alaaeldin El-Nouby, Karen Ullrich, Hervé Jégou, Jakob Verbeek

Lossy image compression aims to represent images in as few bits as possible while maintaining fidelity to the original.

Image Compression MS-SSIM +1

Paper
Add Code

Co-Training 2L Submodels for Visual Recognition

1 code implementation • CVPR 2023 • Hugo Touvron, Matthieu Cord, Maxime Oquab, Piotr Bojanowski, Jakob Verbeek, Hervé Jégou

Given a neural network to be trained, for each sample we implicitly instantiate two altered networks, "submodels", with stochastic depth: i. e. activating only a subset of the layers and skipping others.

Image Classification Semantic Segmentation

3,869

Paper
Code

Image Compression with Product Quantized Masked Image Modeling

no code implementations • 14 Dec 2022 • Alaaeldin El-Nouby, Matthew J. Muckley, Karen Ullrich, Ivan Laptev, Jakob Verbeek, Hervé Jégou

In this work, we attempt to bring these lines of research closer by revisiting vector quantization for image compression.

Image Compression Image Generation +3

Paper
Add Code

Co-training $2^L$ Submodels for Visual Recognition

1 code implementation • 9 Dec 2022 • Hugo Touvron, Matthieu Cord, Maxime Oquab, Piotr Bojanowski, Jakob Verbeek, Hervé Jégou

We introduce submodel co-training, a regularization method related to co-training, self-distillation and stochastic depth.

Ranked #69 on Image Classification on ImageNet

Image Classification Semantic Segmentation

3,869

Paper
Code

Unifying conditional and unconditional semantic image synthesis with OCO-GAN

no code implementations • 25 Nov 2022 • Marlène Careil, Stéphane Lathuilière, Camille Couprie, Jakob Verbeek

To allow for more control, image synthesis can be conditioned on semantic segmentation maps that instruct the generator the position of objects in the image.

Image Generation Semantic Segmentation

Paper
Add Code

DiffEdit: Diffusion-based semantic image editing with mask guidance

4 code implementations • 20 Oct 2022 • Guillaume Couairon, Jakob Verbeek, Holger Schwenk, Matthieu Cord

Semantic image editing is an extension of image generation, with the additional constraint that the generated image should be as similar as possible to a given input image.

Image Generation

Paper
Code

Three things everyone should know about Vision Transformers

6 code implementations • 18 Mar 2022 • Hugo Touvron, Matthieu Cord, Alaaeldin El-Nouby, Jakob Verbeek, Hervé Jégou

(2) Fine-tuning the weights of the attention layers is sufficient to adapt vision transformers to a higher resolution and to other classification tasks.

Ranked #8 on Image Classification on CIFAR-10 (using extra training data)

Fine-Grained Image Classification

29,758

Paper
Code

FlexIT: Towards Flexible Semantic Image Translation

1 code implementation • CVPR 2022 • Guillaume Couairon, Asya Grechka, Jakob Verbeek, Holger Schwenk, Matthieu Cord

Via the latent space of an auto-encoder, we iteratively transform the input image toward the target point, ensuring coherence and quality with a variety of novel regularization terms.

Image Generation Translation

Paper
Code

Instance-Conditioned GAN

1 code implementation • NeurIPS 2021 • Arantxa Casanova, Marlène Careil, Jakob Verbeek, Michal Drozdzal, Adriana Romero-Soriano

Generative Adversarial Networks (GANs) can generate near photo realistic images in narrow domains such as human faces.

Ranked #1 on Conditional Image Generation on ImageNet 64x64

Conditional Image Generation Unconditional Image Generation

530

Paper
Code

XCiT: Cross-Covariance Image Transformers

11 code implementations • NeurIPS 2021 • Alaaeldin El-Nouby, Hugo Touvron, Mathilde Caron, Piotr Bojanowski, Matthijs Douze, Armand Joulin, Ivan Laptev, Natalia Neverova, Gabriel Synnaeve, Jakob Verbeek, Hervé Jegou

We propose a "transposed" version of self-attention that operates across feature channels rather than tokens, where the interactions are based on the cross-covariance matrix between keys and queries.

Ranked #55 on Instance Segmentation on COCO minival

Instance Segmentation object-detection +3

29,758

Paper
Code

ResMLP: Feedforward networks for image classification with data-efficient training

15 code implementations • NeurIPS 2021 • Hugo Touvron, Piotr Bojanowski, Mathilde Caron, Matthieu Cord, Alaaeldin El-Nouby, Edouard Grave, Gautier Izacard, Armand Joulin, Gabriel Synnaeve, Jakob Verbeek, Hervé Jégou

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification.

Ranked #1 on Image Classification on Certificate Verification

Data Augmentation Fine-Grained Image Classification +4

29,758

Paper
Code

Dual Mesh Convolutional Networks for Human Shape Correspondence

no code implementations • 23 Mar 2021 • Nitika Verma, Adnane Boukhayma, Jakob Verbeek, Edmond Boyer

Convolutional networks have been extremely successful for regular data structures such as 2D images and 3D voxel grids.

3D Shape Representation

Paper
Add Code

Meta-Learning with Shared Amortized Variational Inference

1 code implementation • ICML 2020 • Ekaterina Iakovleva, Jakob Verbeek, Karteek Alahari

We propose a novel amortized variational inference scheme for an empirical Bayes meta-learning model, where model parameters are treated as latent variables.

Meta-Learning Variational Inference

Paper
Code

Discrete Point Flow Networks for Efficient Point Cloud Generation

1 code implementation • ECCV 2020 • Roman Klokov, Edmond Boyer, Jakob Verbeek

Generative models have proven effective at modeling 3D shapes and their statistical variations.

3D Shape Representation Point Cloud Generation

Paper
Code

Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English

1 code implementation • COLING 2020 • Maha Elbayad, Michael Ustaszewski, Emmanuelle Esperança-Rodier, Francis Brunet Manquat, Jakob Verbeek, Laurent Besacier

We conduct in this work an evaluation study comparing offline and online neural machine translation architectures.

Machine Translation NMT +1

Paper
Code

Efficient Wait-k Models for Simultaneous Machine Translation

1 code implementation • 18 May 2020 • Maha Elbayad, Laurent Besacier, Jakob Verbeek

We also show that the 2D-convolution architecture is competitive with Transformers for simultaneous translation of spoken language.

Machine Translation Translation

497

Paper
Code

Anytime Inference with Distilled Hierarchical Neural Ensembles

1 code implementation • 3 Mar 2020 • Adria Ruiz, Jakob Verbeek

We propose Hierarchical Neural Ensembles (HNE), a novel framework to embed an ensemble of multiple networks in a hierarchical tree structure, sharing intermediate layers.

Image Classification

Paper
Code

Improved Training Techniques for Online Neural Machine Translation

no code implementations • 25 Sep 2019 • Maha Elbayad, Laurent Besacier, Jakob Verbeek

We investigate the sensitivity of such models to the value of k that is used during training and when deploying the model, and the effect of updating the hidden states in transformer models as new source tokens are read.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

no code implementations • CVPR 2020 • Xiaotian Li, Shuzhe Wang, Yi Zhao, Jakob Verbeek, Juho Kannala

In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.

Classification Data Augmentation +4

Paper
Add Code

Probabilistic Reconstruction Networks for 3D Shape Inference from a Single Image

1 code implementation • 20 Aug 2019 • Roman Klokov, Jakob Verbeek, Edmond Boyer

We study end-to-end learning strategies for 3D shape inference from images, in particular from a single image.

3D Reconstruction

Paper
Code

Adaptative Inference Cost With Convolutional Neural Mixture Models

no code implementations • ICCV 2019 • Adria Ruiz, Jakob Verbeek

Despite the outstanding performance of convolutional neural networks (CNNs) for many vision tasks, the required computational cost during inference is problematic when resources are limited.

Image Classification Semantic Segmentation

Paper
Add Code

Reference-based Variational Autoencoders

no code implementations • ICLR Workshop LLD 2019 • Adrià Ruiz, Oriol Martinez, Xavier Binefa, Jakob Verbeek

Given a pool of unlabelled images, the goal is to learn a representation where a set of target factors are disentangled from others.

Attribute Conditional Image Generation

Paper
Add Code

Learning Disentangled Representations with Reference-Based Variational Autoencoders

no code implementations • 24 Jan 2019 • Adria Ruiz, Oriol Martinez, Xavier Binefa, Jakob Verbeek

Given a pool of unlabeled images, the goal is to learn a representation where a set of target factors are disentangled from others.

Attribute Conditional Image Generation

Paper
Add Code

Adaptive Density Estimation for Generative Models

no code implementations • NeurIPS 2019 • Thomas Lucas, Konstantin Shmelkov, Karteek Alahari, Cordelia Schmid, Jakob Verbeek

We show that our model significantly improves over existing hybrid models: offering GAN-like samples, IS and FID scores that are competitive with fully adversarial models, and improved likelihood scores.

Density Estimation

Paper
Add Code

Understanding Priors in Bayesian Neural Networks at the Unit Level

no code implementations • 11 Oct 2018 • Mariia Vladimirova, Jakob Verbeek, Pablo Mesejo, Julyan Arbel

We investigate deep Bayesian neural networks with Gaussian weight priors and a class of ReLU-like nonlinearities.

Paper
Add Code

Coverage and Quality Driven Training of Generative Image Models

no code implementations • 27 Sep 2018 • Thomas Lucas, Konstantin Shmelkov, Karteek Alahari, Cordelia Schmid, Jakob Verbeek

First, we propose a model that extends variational autoencoders by using deterministic invertible transformation layers to map samples from the decoder to the image space.

Paper
Add Code

Scene Coordinate Regression with Angle-Based Reprojection Loss for Camera Relocalization

no code implementations • 15 Aug 2018 • Xiaotian Li, Juha Ylioinas, Jakob Verbeek, Juho Kannala

Image-based camera relocalization is an important problem in computer vision and robotics.

Camera Relocalization regression

Paper
Add Code

Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

3 code implementations • CONLL 2018 • Maha Elbayad, Laurent Besacier, Jakob Verbeek

Current state-of-the-art machine translation systems are based on encoder-decoder architectures, that first encode the input sequence, and then generate an output sequence based on the input encoding.

Ranked #2 on Machine Translation on IWSLT2015 German-English

Machine Translation Translation

496

Paper
Code

Mixed batches and symmetric discriminators for GAN training

no code implementations • ICML 2018 • Thomas Lucas, Corentin Tallec, Jakob Verbeek, Yann Ollivier

We propose to feed the discriminator with mixed batches of true and fake samples, and train it to predict the ratio of true samples in the batch.

Paper
Add Code

Token-level and sequence-level loss smoothing for RNN language models

1 code implementation • ACL 2018 • Maha Elbayad, Laurent Besacier, Jakob Verbeek

We extend this approach to token-level loss smoothing, and propose improvements to the sequence-level smoothing approach.

Image Captioning Machine Translation +1

Paper
Code

Predicting Future Instance Segmentation by Forecasting Convolutional Features

1 code implementation • ECCV 2018 • Pauline Luc, Camille Couprie, Yann Lecun, Jakob Verbeek

We apply the "detection head'" of Mask R-CNN on the predicted features to produce the instance segmentation of future frames.

Instance Segmentation Optical Flow Estimation +3

Paper
Code

Auxiliary Guided Autoregressive Variational Autoencoders

no code implementations • ICLR 2018 • Thomas Lucas, Jakob Verbeek

Our contribution is a training procedure relying on an auxiliary loss function that controls which information is captured by the latent variables and what is left to the autoregressive decoder.

Paper
Add Code

FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis

1 code implementation • CVPR 2018 • Nitika Verma, Edmond Boyer, Jakob Verbeek

Convolutional neural networks (CNNs) have massively impacted visual recognition in 2D images, and are now ubiquitous in state-of-the-art approaches.

Paper
Code

Predicting Deeper into the Future of Semantic Segmentation

2 code implementations • ICCV 2017 • Pauline Luc, Natalia Neverova, Camille Couprie, Jakob Verbeek, Yann Lecun

The ability to predict and therefore to anticipate the future is an important attribute of intelligence.

Attribute Autonomous Driving +5

Paper
Code

Areas of Attention for Image Captioning

no code implementations • ICCV 2017 • Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek

We propose "Areas of Attention", a novel attention-based model for automatic image captioning.

Image Captioning Language Modelling

Paper
Add Code

Semantic Segmentation using Adversarial Networks

1 code implementation • 25 Nov 2016 • Pauline Luc, Camille Couprie, Soumith Chintala, Jakob Verbeek

Adversarial training has been shown to produce state of the art results for generative image modeling.

Segmentation Semantic Segmentation

Paper
Code

Convolutional Neural Fabrics

no code implementations • NeurIPS 2016 • Shreyas Saxena, Jakob Verbeek

Despite the success of CNNs, selecting the optimal architecture for a given task remains an open problem.

Image Classification Semantic Segmentation

Paper
Add Code

Frankenstein: Learning Deep Face Representations using Small Data

no code implementations • 21 Mar 2016 • Guosheng Hu, Xiaojiang Peng, Yongxin Yang, Timothy Hospedales, Jakob Verbeek

To train such networks, very large training sets are needed with millions of labeled images.

Face Recognition Heterogeneous Face Recognition +1

Paper
Add Code

Approximate Fisher Kernels of non-iid Image Models for Image Categorization

no code implementations • 3 Oct 2015 • Ramazan Gokberk Cinbis, Jakob Verbeek, Cordelia Schmid

It has been experimentally observed that the performance of BoW and FV representations can be improved by employing discounting transformations such as power normalization.

Image Categorization

Paper
Add Code

Circulant temporal encoding for video retrieval and temporal alignment

1 code implementation • 8 Jun 2015 • Matthijs Douze, Jérôme Revaud, Jakob Verbeek, Hervé Jégou, Cordelia Schmid

We address the problem of specific video event retrieval.

Retrieval Video Retrieval

132

Paper
Code

A robust and efficient video representation for action recognition

no code implementations • 21 Apr 2015 • Heng Wang, Dan Oneata, Jakob Verbeek, Cordelia Schmid

We also use the homography to cancel out camera motion from the optical flow.

Action Recognition Homography Estimation +4

Paper
Add Code

Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning

no code implementations • 3 Mar 2015 • Ramazan Gokberk Cinbis, Jakob Verbeek, Cordelia Schmid

In this case, the supervised information is restricted to binary labels that indicate the absence/presence of object instances in the image, without their locations.

Multiple Instance Learning Object +2

Paper
Add Code

Multi-fold MIL Training for Weakly Supervised Object Localization

no code implementations • CVPR 2014 • Ramazan Gokberk Cinbis, Jakob Verbeek, Cordelia Schmid

In this case, the supervised information is restricted to binary labels that indicate the absence/presence of object instances in the image, without their locations.

Multiple Instance Learning Object +2

Paper
Add Code

Efficient Action Localization with Approximately Normalized Fisher Vectors

no code implementations • CVPR 2014 • Dan Oneata, Jakob Verbeek, Cordelia Schmid

Transformation of the FV by power and L2 normalizations has shown to significantly improve its performance, and led to state-of-the-art results for a range of image and video classification and retrieval tasks.

Action Recognition General Classification +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.