Search Results for author: Michael Tschannen

Found 46 papers, 23 papers with code

Subspace clustering of dimensionality-reduced data

no code implementations • 27 Apr 2014 • Reinhard Heckel, Michael Tschannen, Helmut Bölcskei

Subspace clustering refers to the problem of clustering unlabeled high-dimensional data points into a union of low-dimensional linear subspaces, assumed unknown.

Clustering Dimensionality Reduction

Paper
Add Code

Nonparametric Nearest Neighbor Random Process Clustering

no code implementations • 20 Apr 2015 • Michael Tschannen, Helmut Bölcskei

We consider the problem of clustering noisy finite-length observations of stationary ergodic random processes according to their nonparametric generative models without prior knowledge of the model statistics and the number of generative models.

Clustering

Paper
Add Code

Dimensionality-reduced subspace clustering

no code implementations • 25 Jul 2015 • Reinhard Heckel, Michael Tschannen, Helmut Bölcskei

Subspace clustering refers to the problem of clustering unlabeled high-dimensional data points into a union of low-dimensional linear subspaces, whose number, orientations, and dimensions are all unknown.

Clustering Dimensionality Reduction

Paper
Add Code

Pursuits in Structured Non-Convex Matrix Factorizations

no code implementations • 12 Feb 2016 • Rajiv Khanna, Michael Tschannen, Martin Jaggi

Efficiently representing real world data in a succinct and parsimonious manner is of central importance in many fields.

Paper
Add Code

Discrete Deep Feature Extraction: A Theory and New Architectures

no code implementations • 26 May 2016 • Thomas Wiatowski, Michael Tschannen, Aleksandar Stanić, Philipp Grohs, Helmut Bölcskei

First steps towards a mathematical theory of deep convolutional neural networks for feature extraction were made---for the continuous-time case---in Mallat, 2012, and Wiatowski and B\"olcskei, 2015.

Facial Landmark Detection Feature Importance +2

Paper
Add Code

Deep Structured Features for Semantic Segmentation

no code implementations • 26 Sep 2016 • Michael Tschannen, Lukas Cavigelli, Fabian Mentzer, Thomas Wiatowski, Luca Benini

We propose a highly structured neural network architecture for semantic segmentation with an extremely small model size, suitable for low-power embedded and mobile platforms.

General Classification Segmentation +1

Paper
Add Code

Robust nonparametric nearest neighbor random process clustering

no code implementations • 4 Dec 2016 • Michael Tschannen, Helmut Bölcskei

We consider the problem of clustering noisy finite-length observations of stationary ergodic random processes according to their generative models without prior knowledge of the model statistics and the number of generative models.

Clustering

Paper
Add Code

Noisy subspace clustering via matching pursuits

no code implementations • 11 Dec 2016 • Michael Tschannen, Helmut Bölcskei

The clustering conditions we obtain for SSC-OMP and SSC-MP are similar to those for SSC and for the thresholding-based subspace clustering (TSC) algorithm due to Heckel and B\"olcskei.

Clustering

Paper
Add Code

A Unified Optimization View on Generalized Matching Pursuit and Frank-Wolfe

no code implementations • 21 Feb 2017 • Francesco Locatello, Rajiv Khanna, Michael Tschannen, Martin Jaggi

Two of the most fundamental prototypes of greedy optimization are the matching pursuit and Frank-Wolfe algorithms.

Paper
Add Code

Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations

no code implementations • NeurIPS 2017 • Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte, Luca Benini, Luc van Gool

We present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy.

Image Compression Neural Network Compression +1

Paper
Add Code

Greedy Algorithms for Cone Constrained Optimization with Convergence Guarantees

no code implementations • NeurIPS 2017 • Francesco Locatello, Michael Tschannen, Gunnar Rätsch, Martin Jaggi

Greedy optimization methods such as Matching Pursuit (MP) and Frank-Wolfe (FW) algorithms regained popularity in recent years due to their simplicity, effectiveness and theoretical guarantees.

Paper
Add Code

Convolutional Recurrent Neural Networks for Electrocardiogram Classification

1 code implementation • 17 Oct 2017 • Martin Zihlmann, Dmytro Perekrestenko, Michael Tschannen

We propose two deep neural network architectures for classification of arbitrary-length electrocardiogram (ECG) recordings and evaluate them on the atrial fibrillation (AF) classification data set provided by the PhysioNet/CinC Challenge 2017.

Classification Data Augmentation +1

Paper
Code

StrassenNets: Deep Learning with a Multiplication Budget

1 code implementation • ICML 2018 • Michael Tschannen, Aran Khanna, Anima Anandkumar

A large fraction of the arithmetic operations required to evaluate deep neural networks (DNNs) consists of matrix multiplications, in both convolution and fully connected layers.

Image Classification Knowledge Distillation +2

Paper
Code

Conditional Probability Models for Deep Image Compression

1 code implementation • CVPR 2018 • Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool

During training, the auto-encoder makes use of the context model to estimate the entropy of its representation, and the context model is concurrently updated to learn the dependencies between the symbols in the latent representation.

Image Compression MS-SSIM +3

179

Paper
Code

Towards Image Understanding from Deep Compression without Decoding

1 code implementation • ICLR 2018 • Robert Torfason, Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool

Motivated by recent work on deep neural network (DNN)-based image compression methods showing potential improvements in image quality, savings in storage, and bandwidth reduction, we propose to perform image understanding tasks such as classification and segmentation directly on the compressed representations produced by these compression methods.

Classification General Classification +2

Paper
Code

Generative Adversarial Networks for Extreme Learned Image Compression

1 code implementation • ICCV 2019 • Eirikur Agustsson, Michael Tschannen, Fabian Mentzer, Radu Timofte, Luc van Gool

We present a learned image compression system based on GANs, operating at extremely low bitrates.

Image Compression

509

Paper
Code

Born Again Neural Networks

2 code implementations • ICML 2018 • Tommaso Furlanello, Zachary C. Lipton, Michael Tschannen, Laurent Itti, Anima Anandkumar

Knowledge distillation (KD) consists of transferring knowledge from one machine learning model (the teacher}) to another (the student).

Image Classification Knowledge Distillation

Paper
Code

Deep Generative Models for Distribution-Preserving Lossy Compression

1 code implementation • NeurIPS 2018 • Michael Tschannen, Eirikur Agustsson, Mario Lucic

We propose and study the problem of distribution-preserving lossy compression.

Image Compression Image Generation

Paper
Code

Practical Full Resolution Learned Lossless Image Compression

3 code implementations • CVPR 2019 • Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, Luc van Gool

We propose the first practical learned lossless image compression system, L3C, and show that it outperforms the popular engineered codecs, PNG, WebP and JPEG 2000.

Ranked #3 on Image Compression on ImageNet32

Image Compression

388

Paper
Code

Recent Advances in Autoencoder-Based Representation Learning

no code implementations • 12 Dec 2018 • Michael Tschannen, Olivier Bachem, Mario Lucic

Finally, we provide an analysis of autoencoder-based representation learning through the lens of rate-distortion theory and identify a clear tradeoff between the amount of prior knowledge available about the downstream tasks, and how useful the representation is for this task.

Disentanglement

Paper
Add Code

High-Fidelity Image Generation With Fewer Labels

1 code implementation • 6 Mar 2019 • Mario Lucic, Michael Tschannen, Marvin Ritter, Xiaohua Zhai, Olivier Bachem, Sylvain Gelly

Deep generative models are becoming a cornerstone of modern machine learning.

Ranked #10 on Conditional Image Generation on ImageNet 128x128

Conditional Image Generation Vocal Bursts Intensity Prediction

1,821

Paper
Code

Disentangling Factors of Variation Using Few Labels

no code implementations • 3 May 2019 • Francesco Locatello, Michael Tschannen, Stefan Bauer, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem

Recently, Locatello et al. (2019) demonstrated that unsupervised disentanglement learning without inductive biases is theoretically impossible and that existing inductive biases and unsupervised methods do not allow to consistently learn disentangled representations.

Disentanglement Model Selection

Paper
Add Code

On Mutual Information Maximization for Representation Learning

2 code implementations • ICLR 2020 • Michael Tschannen, Josip Djolonga, Paul K. Rubenstein, Sylvain Gelly, Mario Lucic

Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data.

Inductive Bias Representation Learning +1

32,745

Paper
Code

The Visual Task Adaptation Benchmark

no code implementations • 25 Sep 2019 • Xiaohua Zhai, Joan Puigcerver, Alexander Kolesnikov, Pierre Ruyssen, Carlos Riquelme, Mario Lucic, Josip Djolonga, Andre Susano Pinto, Maxim Neumann, Alexey Dosovitskiy, Lucas Beyer, Olivier Bachem, Michael Tschannen, Marcin Michalski, Olivier Bousquet, Sylvain Gelly, Neil Houlsby

Representation learning promises to unlock deep learning for the long tail of vision tasks without expansive labelled datasets.

Representation Learning

Paper
Add Code

A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

2 code implementations • arXiv 2020 • Xiaohua Zhai, Joan Puigcerver, Alexander Kolesnikov, Pierre Ruyssen, Carlos Riquelme, Mario Lucic, Josip Djolonga, Andre Susano Pinto, Maxim Neumann, Alexey Dosovitskiy, Lucas Beyer, Olivier Bachem, Michael Tschannen, Marcin Michalski, Olivier Bousquet, Sylvain Gelly, Neil Houlsby

And, how close are we to general visual representations?

Ranked #10 on Image Classification on VTAB-1k (using extra training data)

Image Classification Representation Learning

3,227

Paper
Code

Semantic Bottleneck Scene Generation

2 code implementations • 26 Nov 2019 • Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, Mario Lucic

For the former, we use an unconditional progressive segmentation generation network that captures the distribution of realistic semantic scene layouts.

Ranked #1 on Image Generation on Cityscapes-5K 256x512

Conditional Image Generation Image-to-Image Translation +2

Paper
Code

Self-Supervised Learning of Video-Induced Visual Invariances

no code implementations • CVPR 2020 • Michael Tschannen, Josip Djolonga, Marvin Ritter, Aravindh Mahendran, Xiaohua Zhai, Neil Houlsby, Sylvain Gelly, Mario Lucic

We propose a general framework for self-supervised learning of transferable visual representations based on Video-Induced Visual Invariances (VIVI).

Ranked #15 on Image Classification on VTAB-1k (using extra training data)

Image Classification Self-Supervised Learning +1

Paper
Add Code

Weakly-Supervised Disentanglement Without Compromises

3 code implementations • ICML 2020 • Francesco Locatello, Ben Poole, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem, Michael Tschannen

Third, we perform a large-scale empirical study and show that such pairs of observations are sufficient to reliably learn disentangled representations on several benchmark data sets.

Disentanglement Fairness

1,358

Paper
Code

Automatic Shortcut Removal for Self-Supervised Representation Learning

no code implementations • ICML 2020 • Matthias Minderer, Olivier Bachem, Neil Houlsby, Michael Tschannen

In self-supervised visual representation learning, a feature extractor is trained on a "pretext task" for which labels can be generated cheaply, without human annotation.

Representation Learning

Paper
Add Code

Learning Better Lossless Compression Using Lossy Compression

1 code implementation • CVPR 2020 • Fabian Mentzer, Luc van Gool, Michael Tschannen

We leverage the powerful lossy image compression algorithm BPG to build a lossless image compression system.

Image Compression

Paper
Code

Disentangling Factors of Variations Using Few Labels

no code implementations • ICLR Workshop LLD 2019 • Francesco Locatello, Michael Tschannen, Stefan Bauer, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem

Disentanglement Model Selection

Paper
Add Code

High-Fidelity Generative Image Compression

4 code implementations • NeurIPS 2020 • Fabian Mentzer, George Toderici, Michael Tschannen, Eirikur Agustsson

We extensively study how to combine Generative Adversarial Networks and learned compression to obtain a state-of-the-art generative lossy compression system.

Image Compression Vocal Bursts Intensity Prediction

820

Paper
Code

On Robustness and Transferability of Convolutional Neural Networks

1 code implementation • CVPR 2021 • Josip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D'Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic

Modern deep convolutional networks (CNNs) are often criticized for not generalizing under distributional shifts.

Image Classification Transfer Learning

Paper
Code

Representation learning from videos in-the-wild: An object-centric approach

no code implementations • 6 Oct 2020 • Rob Romijnders, Aravindh Mahendran, Michael Tschannen, Josip Djolonga, Marvin Ritter, Neil Houlsby, Mario Lucic

We propose a method to learn image representations from uncurated videos.

Few-Shot Learning Object +3

Paper
Add Code

Unconditional Synthesis of Complex Scenes Using a Semantic Bottleneck

no code implementations • 1 Jan 2021 • Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, Mario Lucic

Coupling the high-fidelity generation capabilities of label-conditional image synthesis methods with the flexibility of unconditional generative models, we propose a semantic bottleneck GAN model for unconditional synthesis of complex scenes.

Image Generation Segmentation

Paper
Add Code

Neural Face Video Compression using Multiple Views

no code implementations • 29 Mar 2022 • Anna Volokitin, Stefan Brugger, Ali Benlalah, Sebastian Martin, Brian Amberg, Michael Tschannen

Recent advances in deep generative models led to the development of neural face video compression codecs that use an order of magnitude less bandwidth than engineered codecs.

Video Compression

Paper
Add Code

CLIPPO: Image-and-Language Understanding from Pixels Only

1 code implementation • CVPR 2023 • Michael Tschannen, Basil Mustafa, Neil Houlsby

Multimodal models are becoming increasingly effective, in part due to unified components, such as the Transformer architecture.

Contrastive Learning Image Classification +7

1,539

Paper
Code

FlexiViT: One Model for All Patch Sizes

4 code implementations • CVPR 2023 • Lucas Beyer, Pavel Izmailov, Alexander Kolesnikov, Mathilde Caron, Simon Kornblith, Xiaohua Zhai, Matthias Minderer, Michael Tschannen, Ibrahim Alabdulmohsin, Filip Pavetic

Vision Transformers convert images to sequences by slicing them into patches.

Panoptic Segmentation Retrieval +1

29,671

Paper
Code

Scaling Vision Transformers to 22 Billion Parameters

1 code implementation • 10 Feb 2023 • Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin F. Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Patrick Collier, Alexey Gritsenko, Vighnesh Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetić, Dustin Tran, Thomas Kipf, Mario Lučić, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby

The scaling of Transformers has driven breakthrough capabilities for language models.

Ranked #1 on Zero-Shot Transfer Image Classification on ObjectNet

Action Classification Fairness +3

192

Paper
Code

M2T: Masking Transformers Twice for Faster Decoding

no code implementations • ICCV 2023 • Fabian Mentzer, Eirikur Agustsson, Michael Tschannen

We show how bidirectional transformers trained for masked token prediction can be applied to neural image compression to achieve state-of-the-art results.

Image Compression Image Generation

Paper
Add Code

PaLI-X: On Scaling up a Multilingual Vision and Language Model

2 code implementations • 29 May 2023 • Xi Chen, Josip Djolonga, Piotr Padlewski, Basil Mustafa, Soravit Changpinyo, Jialin Wu, Carlos Riquelme Ruiz, Sebastian Goodman, Xiao Wang, Yi Tay, Siamak Shakeri, Mostafa Dehghani, Daniel Salz, Mario Lucic, Michael Tschannen, Arsha Nagrani, Hexiang Hu, Mandar Joshi, Bo Pang, Ceslee Montgomery, Paulina Pietrzyk, Marvin Ritter, AJ Piergiovanni, Matthias Minderer, Filip Pavetic, Austin Waters, Gang Li, Ibrahim Alabdulmohsin, Lucas Beyer, Julien Amelot, Kenton Lee, Andreas Peter Steiner, Yang Li, Daniel Keysers, Anurag Arnab, Yuanzhong Xu, Keran Rong, Alexander Kolesnikov, Mojtaba Seyedhosseini, Anelia Angelova, Xiaohua Zhai, Neil Houlsby, Radu Soricut

We present the training recipe and results of scaling up PaLI-X, a multilingual vision and language model, both in terms of size of the components and the breadth of its training task mixture.

Ranked #1 on Fine-Grained Image Recognition on OVEN

Chart Question Answering document understanding +9

Paper
Code

Image Captioners Are Scalable Vision Learners Too

1 code implementation • NeurIPS 2023 • Michael Tschannen, Manoj Kumar, Andreas Steiner, Xiaohua Zhai, Neil Houlsby, Lucas Beyer

We further analyze the effect of the model architecture and scale, as well as the pretraining data on the representation quality, and find that captioning exhibits the same or better scaling behavior along these axes.

Image Captioning

1,539

Paper
Code

Finite Scalar Quantization: VQ-VAE Made Simple

3 code implementations • 27 Sep 2023 • Fabian Mentzer, David Minnen, Eirikur Agustsson, Michael Tschannen

Each dimension is quantized to a small set of fixed values, leading to an (implicit) codebook given by the product of these sets.

Colorization Depth Estimation +4

32,745

Paper
Code

GIVT: Generative Infinite-Vocabulary Transformers

1 code implementation • 4 Dec 2023 • Michael Tschannen, Cian Eastwood, Fabian Mentzer

We introduce generative infinite-vocabulary transformers (GIVT) which generate vector sequences with real-valued entries, instead of discrete tokens from a finite vocabulary.

Ranked #13 on Image Generation on ImageNet 256x256

Conditional Image Generation Depth Estimation +1

1,539

Paper
Code

Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers

no code implementations • 3 Jan 2024 • Aleksandar Stanić, Sergi Caelles, Michael Tschannen

Recently, these models achieved great performance on tasks such as compositional visual question answering, visual grounding, and video temporal reasoning.

Question Answering Visual Grounding +2

Paper
Add Code

LocCa: Visual Pretraining with Location-aware Captioners

no code implementations • 28 Mar 2024 • Bo Wan, Michael Tschannen, Yongqin Xian, Filip Pavetic, Ibrahim Alabdulmohsin, Xiao Wang, André Susano Pinto, Andreas Steiner, Lucas Beyer, Xiaohua Zhai

In this paper, we propose a simple visual pretraining method with location-aware captioners (LocCa).

Image Captioning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.