Search Results for author: Mario Lucic

Found 48 papers, 23 papers with code

PaLI-X: On Scaling up a Multilingual Vision and Language Model

2 code implementations • 29 May 2023 • Xi Chen, Josip Djolonga, Piotr Padlewski, Basil Mustafa, Soravit Changpinyo, Jialin Wu, Carlos Riquelme Ruiz, Sebastian Goodman, Xiao Wang, Yi Tay, Siamak Shakeri, Mostafa Dehghani, Daniel Salz, Mario Lucic, Michael Tschannen, Arsha Nagrani, Hexiang Hu, Mandar Joshi, Bo Pang, Ceslee Montgomery, Paulina Pietrzyk, Marvin Ritter, AJ Piergiovanni, Matthias Minderer, Filip Pavetic, Austin Waters, Gang Li, Ibrahim Alabdulmohsin, Lucas Beyer, Julien Amelot, Kenton Lee, Andreas Peter Steiner, Yang Li, Daniel Keysers, Anurag Arnab, Yuanzhong Xu, Keran Rong, Alexander Kolesnikov, Mojtaba Seyedhosseini, Anelia Angelova, Xiaohua Zhai, Neil Houlsby, Radu Soricut

We present the training recipe and results of scaling up PaLI-X, a multilingual vision and language model, both in terms of size of the components and the breadth of its training task mixture.

Ranked #1 on Fine-Grained Image Recognition on OVEN

Chart Question Answering document understanding +9

Paper
Code

Audiovisual Masked Autoencoders

2 code implementations • ICCV 2023 • Mariana-Iuliana Georgescu, Eduardo Fonseca, Radu Tudor Ionescu, Mario Lucic, Cordelia Schmid, Anurag Arnab

Can we leverage the audiovisual information already present in video to improve self-supervised representation learning?

Ranked #1 on Audio Classification on EPIC-KITCHENS-100 (using extra training data)

Audio Classification Representation Learning

3,069

Paper
Code

RUST: Latent Neural Scene Representations from Unposed Imagery

no code implementations • CVPR 2023 • Mehdi S. M. Sajjadi, Aravindh Mahendran, Thomas Kipf, Etienne Pot, Daniel Duckworth, Mario Lucic, Klaus Greff

Our main insight is that one can train a Pose Encoder that peeks at the target image and learns a latent pose embedding which is used by the decoder for view synthesis.

Decoder Novel View Synthesis

Paper
Add Code

VCT: A Video Compression Transformer

1 code implementation • 15 Jun 2022 • Fabian Mentzer, George Toderici, David Minnen, Sung-Jin Hwang, Sergi Caelles, Mario Lucic, Eirikur Agustsson

The resulting video compression transformer outperforms previous methods on standard video compression data sets.

motion prediction Video Compression

33,156

Paper
Code

PolyViT: Co-training Vision Transformers on Images, Videos and Audio

no code implementations • 25 Nov 2021 • Valerii Likhosherstov, Anurag Arnab, Krzysztof Choromanski, Mario Lucic, Yi Tay, Adrian Weller, Mostafa Dehghani

Can we train a single transformer model capable of processing multiple modalities and datasets, whilst sharing almost all of its learnable parameters?

Audio Classification

Paper
Add Code

Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations

1 code implementation • CVPR 2022 • Mehdi S. M. Sajjadi, Henning Meyer, Etienne Pot, Urs Bergmann, Klaus Greff, Noha Radwan, Suhani Vora, Mario Lucic, Daniel Duckworth, Alexey Dosovitskiy, Jakob Uszkoreit, Thomas Funkhouser, Andrea Tagliasacchi

In this work, we propose the Scene Representation Transformer (SRT), a method which processes posed or unposed RGB images of a new area, infers a "set-latent scene representation", and synthesises novel views, all in a single feed-forward pass.

Decoder Novel View Synthesis +1

199

Paper
Code

Revisiting the Calibration of Modern Neural Networks

1 code implementation • NeurIPS 2021 • Matthias Minderer, Josip Djolonga, Rob Romijnders, Frances Hubis, Xiaohua Zhai, Neil Houlsby, Dustin Tran, Mario Lucic

Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks.

Image Classification

458

Paper
Code

A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models

1 code implementation • NeurIPS 2021 • Ibrahim Alabdulmohsin, Mario Lucic

We present a scalable post-processing algorithm for debiasing trained models, including deep neural networks (DNNs), which we prove to be near-optimal by bounding its excess Bayes risk.

BIG-bench Machine Learning

33,156

Paper
Code

MLP-Mixer: An all-MLP Architecture for Vision

47 code implementations • NeurIPS 2021 • Ilya Tolstikhin, Neil Houlsby, Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, Mario Lucic, Alexey Dosovitskiy

Convolutional Neural Networks (CNNs) are the go-to model for computer vision.

Ranked #17 on Image Classification on OmniBenchmark

Image Classification

49,393

Paper
Code

SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size

1 code implementation • 9 Apr 2021 • Jessica Yung, Rob Romijnders, Alexander Kolesnikov, Lucas Beyer, Josip Djolonga, Neil Houlsby, Sylvain Gelly, Mario Lucic, Xiaohua Zhai

Before deploying machine learning models it is critical to assess their robustness.

BIG-bench Machine Learning

Paper
Code

A Near-Optimal Recipe for Debiasing Trained Machine Learning Models

no code implementations • 1 Jan 2021 • Ibrahim Alabdulmohsin, Mario Lucic

We present an efficient and scalable algorithm for debiasing trained models, including deep neural networks (DNNs), which we prove to be near-optimal by bounding its excess Bayes risk.

BIG-bench Machine Learning Classification +1

Paper
Add Code

Unconditional Synthesis of Complex Scenes Using a Semantic Bottleneck

no code implementations • 1 Jan 2021 • Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, Mario Lucic

Coupling the high-fidelity generation capabilities of label-conditional image synthesis methods with the flexibility of unconditional generative models, we propose a semantic bottleneck GAN model for unconditional synthesis of complex scenes.

Image Generation Segmentation

Paper
Add Code

Underspecification Presents Challenges for Credibility in Modern Machine Learning

no code implementations • 6 Nov 2020 • Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne, Rajiv Raman, Kim Ramasamy, Rory Sayres, Jessica Schrouff, Martin Seneviratne, Shannon Sequeira, Harini Suresh, Victor Veitch, Max Vladymyrov, Xuezhi Wang, Kellie Webster, Steve Yadlowsky, Taedong Yun, Xiaohua Zhai, D. Sculley

Predictors returned by underspecified pipelines are often treated as equivalent based on their training domain performance, but we show here that such predictors can behave very differently in deployment domains.

BIG-bench Machine Learning

Paper
Add Code

A Sober Look at the Unsupervised Learning of Disentangled Representations and their Evaluation

no code implementations • 27 Oct 2020 • Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

The idea behind the \emph{unsupervised} learning of \emph{disentangled} representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms.

Disentanglement

Paper
Add Code

Which Model to Transfer? Finding the Needle in the Growing Haystack

no code implementations • CVPR 2022 • Cedric Renggli, André Susano Pinto, Luka Rimanic, Joan Puigcerver, Carlos Riquelme, Ce Zhang, Mario Lucic

Transfer learning has been recently popularized as a data-efficient alternative to training models from scratch, in particular for computer vision tasks where it provides a remarkably solid baseline.

Transfer Learning

Paper
Add Code

Representation learning from videos in-the-wild: An object-centric approach

no code implementations • 6 Oct 2020 • Rob Romijnders, Aravindh Mahendran, Michael Tschannen, Josip Djolonga, Marvin Ritter, Neil Houlsby, Mario Lucic

We propose a method to learn image representations from uncurated videos.

Few-Shot Learning Object +3

Paper
Add Code

A Commentary on the Unsupervised Learning of Disentangled Representations

no code implementations • 28 Jul 2020 • Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

The goal of the unsupervised learning of disentangled representations is to separate the independent explanatory factors of variation in the data without access to supervision.

Paper
Add Code

On Robustness and Transferability of Convolutional Neural Networks

1 code implementation • CVPR 2021 • Josip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D'Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic

Modern deep convolutional networks (CNNs) are often criticized for not generalizing under distributional shifts.

Image Classification Transfer Learning

Paper
Code

Self-Supervised Learning of Video-Induced Visual Invariances

no code implementations • CVPR 2020 • Michael Tschannen, Josip Djolonga, Marvin Ritter, Aravindh Mahendran, Xiaohua Zhai, Neil Houlsby, Sylvain Gelly, Mario Lucic

We propose a general framework for self-supervised learning of transferable visual representations based on Video-Induced Visual Invariances (VIVI).

Ranked #15 on Image Classification on VTAB-1k (using extra training data)

Image Classification Self-Supervised Learning +1

Paper
Add Code

Semantic Bottleneck Scene Generation

2 code implementations • 26 Nov 2019 • Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, Mario Lucic

For the former, we use an unconditional progressive segmentation generation network that captures the distribution of realistic semantic scene layouts.

Ranked #1 on Image Generation on Cityscapes-5K 256x512

Conditional Image Generation Image-to-Image Translation +2

Paper
Code

A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

2 code implementations • arXiv 2020 • Xiaohua Zhai, Joan Puigcerver, Alexander Kolesnikov, Pierre Ruyssen, Carlos Riquelme, Mario Lucic, Josip Djolonga, Andre Susano Pinto, Maxim Neumann, Alexey Dosovitskiy, Lucas Beyer, Olivier Bachem, Michael Tschannen, Marcin Michalski, Olivier Bousquet, Sylvain Gelly, Neil Houlsby

And, how close are we to general visual representations?

Ranked #10 on Image Classification on VTAB-1k (using extra training data)

Image Classification Representation Learning

3,234

Paper
Code

The Visual Task Adaptation Benchmark

no code implementations • 25 Sep 2019 • Xiaohua Zhai, Joan Puigcerver, Alexander Kolesnikov, Pierre Ruyssen, Carlos Riquelme, Mario Lucic, Josip Djolonga, Andre Susano Pinto, Maxim Neumann, Alexey Dosovitskiy, Lucas Beyer, Olivier Bachem, Michael Tschannen, Marcin Michalski, Olivier Bousquet, Sylvain Gelly, Neil Houlsby

Representation learning promises to unlock deep learning for the long tail of vision tasks without expansive labelled datasets.

Representation Learning

Paper
Add Code

On Mutual Information Maximization for Representation Learning

2 code implementations • ICLR 2020 • Michael Tschannen, Josip Djolonga, Paul K. Rubenstein, Sylvain Gelly, Mario Lucic

Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data.

Inductive Bias Representation Learning +1

33,159

Paper
Code

Precision-Recall Curves Using Information Divergence Frontiers

no code implementations • 26 May 2019 • Josip Djolonga, Mario Lucic, Marco Cuturi, Olivier Bachem, Olivier Bousquet, Sylvain Gelly

Despite the tremendous progress in the estimation of generative models, the development of tools for diagnosing their failures and assessing their performance has advanced at a much slower pace.

Image Generation Information Retrieval +1

Paper
Add Code

The GAN Landscape: Losses, Architectures, Regularization, and Normalization

no code implementations • ICLR 2019 • Karol Kurach, Mario Lucic, Xiaohua Zhai, Marcin Michalski, Sylvain Gelly

Generative adversarial networks (GANs) are a class of deep generative models which aim to learn a target distribution in an unsupervised fashion.

Paper
Add Code

High-Fidelity Image Generation With Fewer Labels

1 code implementation • 6 Mar 2019 • Mario Lucic, Michael Tschannen, Marvin Ritter, Xiaohua Zhai, Olivier Bachem, Sylvain Gelly

Deep generative models are becoming a cornerstone of modern machine learning.

Ranked #10 on Conditional Image Generation on ImageNet 128x128

Conditional Image Generation Vocal Bursts Intensity Prediction

1,822

Paper
Code

Recent Advances in Autoencoder-Based Representation Learning

no code implementations • 12 Dec 2018 • Michael Tschannen, Olivier Bachem, Mario Lucic

Finally, we provide an analysis of autoencoder-based representation learning through the lens of rate-distortion theory and identify a clear tradeoff between the amount of prior knowledge available about the downstream tasks, and how useful the representation is for this task.

Disentanglement

Paper
Add Code

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations

8 code implementations • ICML 2019 • Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

The key idea behind the unsupervised learning of disentangled representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms.

Disentanglement

1,363

Paper
Code

Self-Supervised GANs via Auxiliary Rotation Loss

4 code implementations • CVPR 2019 • Ting Chen, Xiaohua Zhai, Marvin Ritter, Mario Lucic, Neil Houlsby

In this work we exploit two popular unsupervised learning techniques, adversarial training and self-supervision, and take a step towards bridging the gap between conditional and unconditional GANs.

Ranked #6 on Image Generation on CelebA-HQ 128x128

Image Generation Representation Learning

1,822

Paper
Code

On Self Modulation for Generative Adversarial Networks

2 code implementations • ICLR 2019 • Ting Chen, Mario Lucic, Neil Houlsby, Sylvain Gelly

Training Generative Adversarial Networks (GANs) is notoriously challenging.

1,822

Paper
Code

A Large-Scale Study on Regularization and Normalization in GANs

5 code implementations • ICLR 2019 • Karol Kurach, Mario Lucic, Xiaohua Zhai, Marcin Michalski, Sylvain Gelly

Generative adversarial networks (GANs) are a class of deep generative models which aim to learn a target distribution in an unsupervised fashion.

1,822

Paper
Code

Assessing Generative Models via Precision and Recall

4 code implementations • NeurIPS 2018 • Mehdi S. M. Sajjadi, Olivier Bachem, Mario Lucic, Olivier Bousquet, Sylvain Gelly

Recent advances in generative modeling have led to an increased interest in the study of statistical divergences as means of model comparison.

100

Paper
Code

Deep Generative Models for Distribution-Preserving Lossy Compression

1 code implementation • NeurIPS 2018 • Michael Tschannen, Eirikur Agustsson, Mario Lucic

We propose and study the problem of distribution-preserving lossy compression.

Image Compression Image Generation

Paper
Code

Are GANs Created Equal? A Large-Scale Study

9 code implementations • NeurIPS 2018 • Mario Lucic, Karol Kurach, Marcin Michalski, Sylvain Gelly, Olivier Bousquet

Generative adversarial networks (GAN) are a powerful subclass of generative models.

Hyperparameter Optimization

3,160

Paper
Code

One-Shot Coresets: The Case of k-Clustering

no code implementations • 27 Nov 2017 • Olivier Bachem, Mario Lucic, Silvio Lattanzi

Scaling clustering algorithms to massive data sets is a challenging task.

Clustering Data Summarization +1

Paper
Add Code

Stochastic Submodular Maximization: The Case of Coverage Functions

no code implementations • NeurIPS 2017 • Mohammad Reza Karimi, Mario Lucic, Hamed Hassani, Andreas Krause

By exploiting that common extensions act linearly on the class of submodular functions, we employ projected stochastic gradient ascent and its variants in the continuous domain, and perform rounding to obtain discrete solutions.

Clustering Stochastic Optimization

Paper
Add Code

Uniform Deviation Bounds for k-Means Clustering

no code implementations • ICML 2017 • Olivier Bachem, Mario Lucic, S. Hamed Hassani, Andreas Krause

In this paper, we provide a novel framework to obtain uniform deviation bounds for loss functions which are unbounded.

Clustering

Paper
Add Code

Distributed and Provably Good Seedings for k-Means in Constant Rounds

no code implementations • ICML 2017 • Olivier Bachem, Mario Lucic, Andreas Krause

The k-Means++ algorithm is the state of the art algorithm to solve k-Means clustering problems as the computed clusterings are O(log k) competitive in expectation.

Clustering

Paper
Add Code

Training Gaussian Mixture Models at Scale via Coresets

no code implementations • 23 Mar 2017 • Mario Lucic, Matthew Faulkner, Andreas Krause, Dan Feldman

In this work we show how to construct coresets for mixtures of Gaussians.

Paper
Add Code

Practical Coreset Constructions for Machine Learning

2 code implementations • 19 Mar 2017 • Olivier Bachem, Mario Lucic, Andreas Krause

We investigate coresets - succinct, small summaries of large data sets - so that solutions found on the summary are provably competitive with solution found on the full data set.

BIG-bench Machine Learning Clustering +1

Paper
Code

Scalable k-Means Clustering via Lightweight Coresets

1 code implementation • 27 Feb 2017 • Olivier Bachem, Mario Lucic, Andreas Krause

As such, they have been successfully used to scale up clustering models to massive data sets.

Clustering Data Summarization

524

Paper
Code

Uniform Deviation Bounds for Unbounded Loss Functions like k-Means

no code implementations • 27 Feb 2017 • Olivier Bachem, Mario Lucic, S. Hamed Hassani, Andreas Krause

In this paper, we provide a novel framework to obtain uniform deviation bounds for loss functions which are *unbounded*.

Clustering

Paper
Add Code

Fast and Provably Good Seedings for k-Means

1 code implementation • NeurIPS 2016 • Olivier Bachem, Mario Lucic, Hamed Hassani, Andreas Krause

Seeding - the task of finding initial cluster centers - is critical in obtaining high-quality clusterings for k-Means.

Clustering

771

Paper
Code

Horizontally Scalable Submodular Maximization

no code implementations • 31 May 2016 • Mario Lucic, Olivier Bachem, Morteza Zadimoghaddam, Andreas Krause

A variety of large-scale machine learning problems can be cast as instances of constrained submodular maximization.

Paper
Add Code

Tradeoffs for Space, Time, Data and Risk in Unsupervised Learning

no code implementations • 2 May 2016 • Mario Lucic, Mesrob I. Ohannessian, Amin Karbasi, Andreas Krause

Using k-means clustering as a prototypical unsupervised learning problem, we show how we can strategically summarize the data (control space) in order to trade off risk and time when data is generated by a probabilistic model.

Clustering Navigate

Paper
Add Code

Linear-time Outlier Detection via Sensitivity

no code implementations • 2 May 2016 • Mario Lucic, Olivier Bachem, Andreas Krause

Outliers are ubiquitous in modern data sets.

Clustering Outlier Detection

Paper
Add Code

Strong Coresets for Hard and Soft Bregman Clustering with Applications to Exponential Family Mixtures

no code implementations • 21 Aug 2015 • Mario Lucic, Olivier Bachem, Andreas Krause

We propose a single, practical algorithm to construct strong coresets for a large class of hard and soft clustering problems based on Bregman divergences.

Clustering

Paper
Add Code

Fast and Robust Least Squares Estimation in Corrupted Linear Models

no code implementations • NeurIPS 2014 • Brian McWilliams, Gabriel Krummenacher, Mario Lucic, Joachim M. Buhmann

Subsampling methods have been recently proposed to speed up least squares estimation in large scale settings.

regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.