Search Results for author: Jakob Verbeek

Found 57 papers, 24 papers with code

Improved Baselines for Data-efficient Perceptual Augmentation of LLMs

no code implementations20 Mar 2024 Théophane Vallaeys, Mustafa Shukor, Matthieu Cord, Jakob Verbeek

The abilities of large language models (LLMs) have recently progressed to unprecedented levels, paving the way to novel applications in a wide variety of areas.

Audio captioning Image Captioning +2

Better (pseudo-)labels for semi-supervised instance segmentation

no code implementations18 Mar 2024 François Porcher, Camille Couprie, Marc Szafraniec, Jakob Verbeek

Despite the availability of large datasets for tasks like image classification and image-text alignment, labeled data for more complex recognition tasks, such as detection and segmentation, is less abundant.

Few-Shot Learning Image Classification +3

Residual Quantization with Implicit Neural Codebooks

1 code implementation26 Jan 2024 Iris Huijben, Matthijs Douze, Matthew Muckley, Ruud Van Sloun, Jakob Verbeek

In this paper, we propose QINCo, a neural RQ variant which predicts specialized codebooks per vector using a neural network that is conditioned on the approximation of the vector from previous steps.

Data Compression Quantization

Unlocking Pre-trained Image Backbones for Semantic Image Synthesis

no code implementations20 Dec 2023 Tariq Berrada, Jakob Verbeek, Camille Couprie, Karteek Alahari

Semantic image synthesis, i. e., generating images from user-provided semantic label maps, is an important conditional image generation task as it allows to control both the content as well as the spatial layout of generated images.

Conditional Image Generation Image Classification +1

Towards image compression with perfect realism at ultra-low bitrates

no code implementations16 Oct 2023 Marlène Careil, Matthew J. Muckley, Jakob Verbeek, Stéphane Lathuilière

We find that our model leads to reconstructions with state-of-the-art visual quality as measured by FID and KID.

Image Compression

Guided Distillation for Semi-Supervised Instance Segmentation

1 code implementation3 Aug 2023 Tariq Berrada, Camille Couprie, Karteek Alahari, Jakob Verbeek

Although instance segmentation methods have improved considerably, the dominant paradigm is to rely on fully-annotated training images, which are tedious to obtain.

Instance Segmentation Semantic Segmentation +1

Multi-Domain Learning with Modulation Adapters

no code implementations17 Jul 2023 Ekaterina Iakovleva, Karteek Alahari, Jakob Verbeek

Deep convolutional networks are ubiquitous in computer vision, due to their excellent performance across different tasks for various domains.

Image Classification

Zero-shot spatial layout conditioning for text-to-image diffusion models

no code implementations ICCV 2023 Guillaume Couairon, Marlène Careil, Matthieu Cord, Stéphane Lathuilière, Jakob Verbeek

Large-scale text-to-image diffusion models have significantly improved the state of the art in generative image modelling and allow for an intuitive and powerful user interface to drive the image generation process.

Image Generation Segmentation +1

Improved baselines for vision-language pre-training

1 code implementation15 May 2023 Enrico Fini, Pietro Astolfi, Adriana Romero-Soriano, Jakob Verbeek, Michal Drozdzal

Indeed, we find that a simple CLIP baseline can also be improved substantially, up to a 25% relative improvement on downstream zero-shot tasks, by using well-known training techniques that are popular in other subfields.

Contrastive Learning Data Augmentation +1

Controllable Image Generation via Collage Representations

no code implementations26 Apr 2023 Arantxa Casanova, Marlène Careil, Adriana Romero-Soriano, Christopher J. Pal, Jakob Verbeek, Michal Drozdzal

Our experiments on the OI dataset show that M&Ms outperforms baselines in terms of fine-grained scene controllability while being very competitive in terms of image quality and sample diversity.

Attribute Image Generation

Are Visual Recognition Models Robust to Image Compression?

no code implementations10 Apr 2023 João Maria Janeiro, Stanislav Frolov, Alaaeldin El-Nouby, Jakob Verbeek

For example, for segmentation mIoU is reduced from 44. 5 to 30. 5 mIoU when compressing to 0. 1 bpp using the best compression model we evaluated.

Image Classification Image Compression +4

Few-shot Semantic Image Synthesis with Class Affinity Transfer

no code implementations CVPR 2023 Marlène Careil, Jakob Verbeek, Stéphane Lathuilière

The class affinity matrix is introduced as a first layer to the source model to make it compatible with the target label maps, and the source model is then further finetuned for the target domain.

Image Generation Semantic Segmentation

Instance-Conditioned GAN Data Augmentation for Representation Learning

no code implementations16 Mar 2023 Pietro Astolfi, Arantxa Casanova, Jakob Verbeek, Pascal Vincent, Adriana Romero-Soriano, Michal Drozdzal

We showcase the benefits of DA_IC-GAN by plugging it out-of-the-box into the supervised training of ResNets and DeiT models on the ImageNet dataset, and achieving accuracy boosts up to between 1%p and 2%p with the highest capacity models.

Data Augmentation Few-Shot Learning +1

Co-Training 2L Submodels for Visual Recognition

1 code implementation CVPR 2023 Hugo Touvron, Matthieu Cord, Maxime Oquab, Piotr Bojanowski, Jakob Verbeek, Hervé Jégou

Given a neural network to be trained, for each sample we implicitly instantiate two altered networks, "submodels", with stochastic depth: i. e. activating only a subset of the layers and skipping others.

Image Classification Semantic Segmentation

Co-training $2^L$ Submodels for Visual Recognition

1 code implementation9 Dec 2022 Hugo Touvron, Matthieu Cord, Maxime Oquab, Piotr Bojanowski, Jakob Verbeek, Hervé Jégou

We introduce submodel co-training, a regularization method related to co-training, self-distillation and stochastic depth.

Image Classification Semantic Segmentation

Unifying conditional and unconditional semantic image synthesis with OCO-GAN

no code implementations25 Nov 2022 Marlène Careil, Stéphane Lathuilière, Camille Couprie, Jakob Verbeek

To allow for more control, image synthesis can be conditioned on semantic segmentation maps that instruct the generator the position of objects in the image.

Image Generation Semantic Segmentation

DiffEdit: Diffusion-based semantic image editing with mask guidance

4 code implementations20 Oct 2022 Guillaume Couairon, Jakob Verbeek, Holger Schwenk, Matthieu Cord

Semantic image editing is an extension of image generation, with the additional constraint that the generated image should be as similar as possible to a given input image.

Image Generation

Three things everyone should know about Vision Transformers

6 code implementations18 Mar 2022 Hugo Touvron, Matthieu Cord, Alaaeldin El-Nouby, Jakob Verbeek, Hervé Jégou

(2) Fine-tuning the weights of the attention layers is sufficient to adapt vision transformers to a higher resolution and to other classification tasks.

Ranked #8 on Image Classification on CIFAR-10 (using extra training data)

Fine-Grained Image Classification

FlexIT: Towards Flexible Semantic Image Translation

1 code implementation CVPR 2022 Guillaume Couairon, Asya Grechka, Jakob Verbeek, Holger Schwenk, Matthieu Cord

Via the latent space of an auto-encoder, we iteratively transform the input image toward the target point, ensuring coherence and quality with a variety of novel regularization terms.

Image Generation Translation

XCiT: Cross-Covariance Image Transformers

11 code implementations NeurIPS 2021 Alaaeldin El-Nouby, Hugo Touvron, Mathilde Caron, Piotr Bojanowski, Matthijs Douze, Armand Joulin, Ivan Laptev, Natalia Neverova, Gabriel Synnaeve, Jakob Verbeek, Hervé Jegou

We propose a "transposed" version of self-attention that operates across feature channels rather than tokens, where the interactions are based on the cross-covariance matrix between keys and queries.

Instance Segmentation object-detection +3

Dual Mesh Convolutional Networks for Human Shape Correspondence

no code implementations23 Mar 2021 Nitika Verma, Adnane Boukhayma, Jakob Verbeek, Edmond Boyer

Convolutional networks have been extremely successful for regular data structures such as 2D images and 3D voxel grids.

3D Shape Representation

Meta-Learning with Shared Amortized Variational Inference

1 code implementation ICML 2020 Ekaterina Iakovleva, Jakob Verbeek, Karteek Alahari

We propose a novel amortized variational inference scheme for an empirical Bayes meta-learning model, where model parameters are treated as latent variables.

Meta-Learning Variational Inference

Efficient Wait-k Models for Simultaneous Machine Translation

1 code implementation18 May 2020 Maha Elbayad, Laurent Besacier, Jakob Verbeek

We also show that the 2D-convolution architecture is competitive with Transformers for simultaneous translation of spoken language.

Machine Translation Translation

Anytime Inference with Distilled Hierarchical Neural Ensembles

1 code implementation3 Mar 2020 Adria Ruiz, Jakob Verbeek

We propose Hierarchical Neural Ensembles (HNE), a novel framework to embed an ensemble of multiple networks in a hierarchical tree structure, sharing intermediate layers.

Image Classification

Improved Training Techniques for Online Neural Machine Translation

no code implementations25 Sep 2019 Maha Elbayad, Laurent Besacier, Jakob Verbeek

We investigate the sensitivity of such models to the value of k that is used during training and when deploying the model, and the effect of updating the hidden states in transformer models as new source tokens are read.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

no code implementations CVPR 2020 Xiaotian Li, Shuzhe Wang, Yi Zhao, Jakob Verbeek, Juho Kannala

In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.

Classification Data Augmentation +4

Probabilistic Reconstruction Networks for 3D Shape Inference from a Single Image

1 code implementation20 Aug 2019 Roman Klokov, Jakob Verbeek, Edmond Boyer

We study end-to-end learning strategies for 3D shape inference from images, in particular from a single image.

3D Reconstruction

Adaptative Inference Cost With Convolutional Neural Mixture Models

no code implementations ICCV 2019 Adria Ruiz, Jakob Verbeek

Despite the outstanding performance of convolutional neural networks (CNNs) for many vision tasks, the required computational cost during inference is problematic when resources are limited.

Image Classification Semantic Segmentation

Reference-based Variational Autoencoders

no code implementations ICLR Workshop LLD 2019 Adrià Ruiz, Oriol Martinez, Xavier Binefa, Jakob Verbeek

Given a pool of unlabelled images, the goal is to learn a representation where a set of target factors are disentangled from others.

Attribute Conditional Image Generation

Learning Disentangled Representations with Reference-Based Variational Autoencoders

no code implementations24 Jan 2019 Adria Ruiz, Oriol Martinez, Xavier Binefa, Jakob Verbeek

Given a pool of unlabeled images, the goal is to learn a representation where a set of target factors are disentangled from others.

Attribute Conditional Image Generation

Adaptive Density Estimation for Generative Models

no code implementations NeurIPS 2019 Thomas Lucas, Konstantin Shmelkov, Karteek Alahari, Cordelia Schmid, Jakob Verbeek

We show that our model significantly improves over existing hybrid models: offering GAN-like samples, IS and FID scores that are competitive with fully adversarial models, and improved likelihood scores.

Density Estimation

Understanding Priors in Bayesian Neural Networks at the Unit Level

no code implementations11 Oct 2018 Mariia Vladimirova, Jakob Verbeek, Pablo Mesejo, Julyan Arbel

We investigate deep Bayesian neural networks with Gaussian weight priors and a class of ReLU-like nonlinearities.

Coverage and Quality Driven Training of Generative Image Models

no code implementations27 Sep 2018 Thomas Lucas, Konstantin Shmelkov, Karteek Alahari, Cordelia Schmid, Jakob Verbeek

First, we propose a model that extends variational autoencoders by using deterministic invertible transformation layers to map samples from the decoder to the image space.

Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

3 code implementations CONLL 2018 Maha Elbayad, Laurent Besacier, Jakob Verbeek

Current state-of-the-art machine translation systems are based on encoder-decoder architectures, that first encode the input sequence, and then generate an output sequence based on the input encoding.

Machine Translation Translation

Mixed batches and symmetric discriminators for GAN training

no code implementations ICML 2018 Thomas Lucas, Corentin Tallec, Jakob Verbeek, Yann Ollivier

We propose to feed the discriminator with mixed batches of true and fake samples, and train it to predict the ratio of true samples in the batch.

Token-level and sequence-level loss smoothing for RNN language models

1 code implementation ACL 2018 Maha Elbayad, Laurent Besacier, Jakob Verbeek

We extend this approach to token-level loss smoothing, and propose improvements to the sequence-level smoothing approach.

Image Captioning Machine Translation +1

Auxiliary Guided Autoregressive Variational Autoencoders

no code implementations ICLR 2018 Thomas Lucas, Jakob Verbeek

Our contribution is a training procedure relying on an auxiliary loss function that controls which information is captured by the latent variables and what is left to the autoregressive decoder.

FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis

1 code implementation CVPR 2018 Nitika Verma, Edmond Boyer, Jakob Verbeek

Convolutional neural networks (CNNs) have massively impacted visual recognition in 2D images, and are now ubiquitous in state-of-the-art approaches.

Semantic Segmentation using Adversarial Networks

1 code implementation25 Nov 2016 Pauline Luc, Camille Couprie, Soumith Chintala, Jakob Verbeek

Adversarial training has been shown to produce state of the art results for generative image modeling.

Segmentation Semantic Segmentation

Convolutional Neural Fabrics

no code implementations NeurIPS 2016 Shreyas Saxena, Jakob Verbeek

Despite the success of CNNs, selecting the optimal architecture for a given task remains an open problem.

Image Classification Semantic Segmentation

Approximate Fisher Kernels of non-iid Image Models for Image Categorization

no code implementations3 Oct 2015 Ramazan Gokberk Cinbis, Jakob Verbeek, Cordelia Schmid

It has been experimentally observed that the performance of BoW and FV representations can be improved by employing discounting transformations such as power normalization.

Image Categorization

Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning

no code implementations3 Mar 2015 Ramazan Gokberk Cinbis, Jakob Verbeek, Cordelia Schmid

In this case, the supervised information is restricted to binary labels that indicate the absence/presence of object instances in the image, without their locations.

Multiple Instance Learning Object +2

Multi-fold MIL Training for Weakly Supervised Object Localization

no code implementations CVPR 2014 Ramazan Gokberk Cinbis, Jakob Verbeek, Cordelia Schmid

In this case, the supervised information is restricted to binary labels that indicate the absence/presence of object instances in the image, without their locations.

Multiple Instance Learning Object +2

Efficient Action Localization with Approximately Normalized Fisher Vectors

no code implementations CVPR 2014 Dan Oneata, Jakob Verbeek, Cordelia Schmid

Transformation of the FV by power and L2 normalizations has shown to significantly improve its performance, and led to state-of-the-art results for a range of image and video classification and retrieval tasks.

Action Recognition General Classification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.