Search Results for author: Yuki M. Asano

Found 37 papers, 21 papers with code

The Common Stability Mechanism behind most Self-Supervised Learning Approaches

1 code implementation • 22 Feb 2024 • Abhishek Jha, Matthew B. Blaschko, Yuki M. Asano, Tinne Tuytelaars

Last couple of years have witnessed a tremendous progress in self-supervised learning (SSL), the success of which can be attributed to the introduction of useful inductive biases in the learning process to learn meaningful visual representations while avoiding collapse.

Self-Supervised Learning

Paper
Code

PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs

no code implementations • 13 Feb 2024 • Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek, Yuki M. Asano

Vision-Language Models (VLMs), such as Flamingo and GPT-4V, have shown immense potential by integrating large language models with vision systems.

Paper
Add Code

Object-Centric Diffusion for Efficient Video Editing

no code implementations • 11 Jan 2024 • Kumara Kahatapitiya, Adil Karjauv, Davide Abati, Fatih Porikli, Yuki M. Asano, Amirhossein Habibian

Diffusion-based video editing have reached impressive quality and can transform either the global style, local structure, and attributes of given video inputs, following textual edit prompts.

Object Video Editing

Paper
Add Code

The LLM Surgeon

1 code implementation • 28 Dec 2023 • Tycho F. A. van der Ouderaa, Markus Nagel, Mart van Baalen, Yuki M. Asano, Tijmen Blankevoort

Experimentally, our method can prune rows and columns from a range of OPT models and Llamav2-7B by 20%-30%, with a negligible loss in performance, and achieve state-of-the-art results in unstructured and semi-structured pruning of large language models.

Paper
Code

Protect Your Score: Contact Tracing With Differential Privacy Guarantees

no code implementations • 18 Dec 2023 • Rob Romijnders, Christos Louizos, Yuki M. Asano, Max Welling

The pandemic in 2020 and 2021 had enormous economic and societal consequences, and studies show that contact tracing algorithms can be key in the early containment of the virus.

Paper
Add Code

Guided Diffusion from Self-Supervised Diffusion Features

no code implementations • 14 Dec 2023 • Vincent Tao Hu, Yunlu Chen, Mathilde Caron, Yuki M. Asano, Cees G. M. Snoek, Bjorn Ommer

However, recent studies have revealed that the feature representation derived from diffusion model itself is discriminative for numerous downstream tasks as well, which prompts us to propose a framework to extract guidance from, and specifically for, diffusion models.

Self-Supervised Learning

Paper
Add Code

VaLID: Variable-Length Input Diffusion for Novel View Synthesis

no code implementations • 14 Dec 2023 • Shijie Li, Farhad G. Zanjani, Haitam Ben Yahia, Yuki M. Asano, Juergen Gall, Amirhossein Habibian

This is because the source-view images and corresponding poses are processed separately and injected into the model at different stages.

Image Generation Novel View Synthesis +1

Paper
Add Code

VeRA: Vector-based Random Matrix Adaptation

no code implementations • 17 Oct 2023 • Dawid J. Kopiczko, Tijmen Blankevoort, Yuki M. Asano

Low-rank adapation (LoRA) is a popular method that reduces the number of trainable parameters when finetuning large language models, but still faces acute storage challenges when scaling to even larger models or deploying numerous per-user or per-task adapted models.

Image Classification Instruction Following

Paper
Add Code

Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

no code implementations • 12 Oct 2023 • Shashanka Venkataramanan, Mamshad Nayeem Rizve, João Carreira, Yuki M. Asano, Yannis Avrithis

But are we making the best use of data?

Self-Supervised Learning

Paper
Add Code

Self-Supervised Open-Ended Classification with Small Visual Language Models

no code implementations • 30 Sep 2023 • Mohammad Mahdi Derakhshani, Ivona Najdenkoska, Cees G. M. Snoek, Marcel Worring, Yuki M. Asano

We present Self-Context Adaptation (SeCAt), a self-supervised approach that unlocks few-shot abilities for open-ended classification with small visual language models.

Few-Shot Learning Image Captioning

Paper
Add Code

Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations

1 code implementation • ICCV 2023 • Mohammadreza Salehi, Efstratios Gavves, Cees G. M. Snoek, Yuki M. Asano

Our paper aims to address this gap by proposing a novel approach that incorporates temporal consistency in dense self-supervised learning.

Self-Supervised Learning Unsupervised Semantic Segmentation

Paper
Code

Efficient Neural PDE-Solvers using Quantization Aware Training

no code implementations • 14 Aug 2023 • Winfried van den Dool, Tijmen Blankevoort, Max Welling, Yuki M. Asano

In the past years, the application of neural networks as an alternative to classical numerical methods to solve Partial Differential Equations has emerged as a potential paradigm shift in this century-old mathematical field.

Quantization

Paper
Add Code

Learning to Count without Annotations

1 code implementation • 17 Jul 2023 • Lukas Knobel, Tengda Han, Yuki M. Asano

While recent supervised methods for reference-based object counting continue to improve the performance on benchmark datasets, they have to rely on small datasets due to the cost associated with manually annotating dozens of objects in images.

Object Counting

Paper
Code

BISCUIT: Causal Representation Learning from Binary Interactions

1 code implementation • 16 Jun 2023 • Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

Identifying the causal variables of an environment and how to intervene on them is of core value in applications such as robotics and embodied AI.

Causal Discovery Causal Identification +1

Paper
Code

Self-Ordering Point Clouds

no code implementations • ICCV 2023 • Pengwan Yang, Cees G. M. Snoek, Yuki M. Asano

In this paper we address the task of finding representative subsets of points in a 3D point cloud by means of a point-wise ordering.

Paper
Add Code

Towards Label-Efficient Incremental Learning: A Survey

1 code implementation • 1 Feb 2023 • Mert Kilickaya, Joost Van de Weijer, Yuki M. Asano

The current dominant paradigm when building a machine learning model is to iterate over a dataset over and over until convergence.

Incremental Learning Self-Supervised Learning

Paper
Code

Skip-Attention: Improving Vision Transformers by Paying Less Attention

no code implementations • 5 Jan 2023 • Shashanka Venkataramanan, Amir Ghodrati, Yuki M. Asano, Fatih Porikli, Amirhossein Habibian

This work aims to improve the efficiency of vision transformers (ViT).

Image Classification Image Denoising +3

Paper
Add Code

VTC: Improving Video-Text Retrieval with User Comments

1 code implementation • 19 Oct 2022 • Laura Hanu, James Thewlis, Yuki M. Asano, Christian Rupprecht

In this paper, we a) introduce a new dataset of videos, titles and comments; b) present an attention-based mechanism that allows the model to learn from sometimes irrelevant data such as comments; c) show that by using comments, our method is able to learn better, more contextualised, representations for image, video and audio representations.

Representation Learning Retrieval +3

Paper
Code

Self-Guided Diffusion Models

1 code implementation • CVPR 2023 • Vincent Tao Hu, David W Zhang, Yuki M. Asano, Gertjan J. Burghouts, Cees G. M. Snoek

Diffusion models have demonstrated remarkable progress in image generation quality, especially when guidance is used to control the generative process.

Image Generation

Paper
Code

Prompt Generation Networks for Input-based Adaptation of Frozen Vision Transformers

1 code implementation • 12 Oct 2022 • Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M. Asano

With the introduction of the transformer architecture in computer vision, increasing model scale has been demonstrated as a clear path to achieving performance and robustness gains.

Transfer Learning

Paper
Code

Measuring the Interpretability of Unsupervised Representations via Quantized Reverse Probing

no code implementations • 7 Sep 2022 • Iro Laina, Yuki M. Asano, Andrea Vedaldi

Self-supervised visual representation learning has recently attracted significant research interest.

Representation Learning

Paper
Add Code

Causal Representation Learning for Instantaneous and Temporal Effects in Interactive Systems

1 code implementation • 13 Jun 2022 • Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

To address this issue, we propose iCITRIS, a causal representation learning method that allows for instantaneous effects in intervened temporal sequences when intervention targets can be observed, e. g., as actions of an agent.

Causal Discovery Representation Learning +1

Paper
Code

Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements

1 code implementation • NAACL (GeBNLP) 2022 • Conrad Borchers, Dalia Sara Gala, Benjamin Gilburt, Eduard Oravkin, Wilfried Bounsi, Yuki M. Asano, Hannah Rose Kirk

The growing capability and availability of generative language models has enabled a wide range of new downstream tasks.

Language Modelling Prompt Engineering

Paper
Code

Self-Supervised Learning of Object Parts for Semantic Segmentation

1 code implementation • CVPR 2022 • Adrian Ziegler, Yuki M. Asano

However, learning dense representations is challenging, as in the unsupervised context it is not clear how to guide the model to learn representations that correspond to various potential object categories.

Ranked #5 on Unsupervised Semantic Segmentation on PASCAL VOC 2012 val (using extra training data)

Community Detection Image Segmentation +6

Paper
Code

Less than Few: Self-Shot Video Instance Segmentation

no code implementations • 19 Apr 2022 • Pengwan Yang, Yuki M. Asano, Pascal Mettes, Cees G. M. Snoek

The goal of this paper is to bypass the need for labelled examples in few-shot video understanding at run time.

Few-Shot Learning Instance Segmentation +5

Paper
Add Code

CITRIS: Causal Identifiability from Temporal Intervened Sequences

1 code implementation • 7 Feb 2022 • Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, Efstratios Gavves

Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments.

Representation Learning Temporal Sequences

Paper
Code

The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image

1 code implementation • 1 Dec 2021 • Yuki M. Asano, Aaqib Saeed

What can neural networks learn about the visual world when provided with only a single image as input?

Knowledge Distillation

Paper
Code

PASS: An ImageNet replacement for self-supervised pretraining without humans

1 code implementation • NeurIPS Workshop ImageNet_PPF 2021 • Yuki M. Asano, Christian Rupprecht, Andrew Zisserman, Andrea Vedaldi

On the other hand, state-of-the-art pretraining is nowadays obtained with unsupervised methods, meaning that labelled datasets such as ImageNet may not be necessary, or perhaps not even optimal, for model pretraining.

Benchmarking Ethics +2

258

Paper
Code

Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset

no code implementations • ACL (WOAH) 2021 • Hannah Rose Kirk, Yennie Jun, Paulius Rauba, Gal Wachtel, Ruining Li, Xingjian Bai, Noah Broestl, Martin Doff-Sotta, Aleksandar Shtedritski, Yuki M. Asano

In this paper, we collect hateful and non-hateful memes from Pinterest to evaluate out-of-sample performance on models pre-trained on the Facebook dataset.

Optical Character Recognition (OCR)

Paper
Add Code

Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers

2 code implementations • NeurIPS 2021 • Mandela Patrick, Dylan Campbell, Yuki M. Asano, Ishan Misra, Florian Metze, Christoph Feichtenhofer, Andrea Vedaldi, João F. Henriques

In video transformers, the time dimension is often treated in the same way as the two spatial dimensions.

Ranked #15 on Action Recognition on EPIC-KITCHENS-100 (using extra training data)

Action Classification Action Recognition +1

7,508

Paper
Code

Self-supervised object detection from audio-visual correspondence

no code implementations • CVPR 2022 • Triantafyllos Afouras, Yuki M. Asano, Francois Fagan, Andrea Vedaldi, Florian Metze

We tackle the problem of learning object detectors without supervision.

Object object-detection +1

Paper
Add Code

Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning

1 code implementation • ICCV 2021 • Mandela Patrick, Yuki M. Asano, Bernie Huang, Ishan Misra, Florian Metze, Joao Henriques, Andrea Vedaldi

First, for space, we show that spatial augmentations such as cropping do work well for videos too, but that previous implementations, due to the high processing and memory cost, could not do this at a scale sufficient for it to work well.

Representation Learning Self-Supervised Learning

Paper
Code

Privacy-preserving Object Detection

no code implementations • 11 Mar 2021 • Peiyang He, Charlie Griffin, Krzysztof Kacprzyk, Artjom Joosen, Michael Collyer, Aleksandar Shtedritski, Yuki M. Asano

Privacy considerations and bias in datasets are quickly becoming high-priority issues that the computer vision community needs to face.

Object object-detection +2

Paper
Add Code

Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models

1 code implementation • NeurIPS 2021 • Hannah Kirk, Yennie Jun, Haider Iqbal, Elias Benussi, Filippo Volpin, Frederic A. Dreyer, Aleksandar Shtedritski, Yuki M. Asano

Using a template-based data collection pipeline, we collect 396K sentence completions made by GPT-2 and find: (i) The machine-predicted jobs are less diverse and more stereotypical for women than for men, especially for intersections; (ii) Intersectional interactions are highly relevant for occupational associations, which we quantify by fitting 262 logistic models; (iii) For most occupations, GPT-2 reflects the skewed gender and ethnicity distribution found in US Labor Bureau data, and even pulls the societally-skewed distribution towards gender parity in cases where its predictions deviate from real labor market observations.

Language Modelling Sentence +1

Paper
Code

Labelling unlabelled videos from scratch with multi-modal self-supervision

1 code implementation • NeurIPS 2020 • Yuki M. Asano, Mandela Patrick, Christian Rupprecht, Andrea Vedaldi

A large part of the current success of deep learning lies in the effectiveness of data -- more precisely: labelled data.

Benchmarking Clustering

114

Paper
Code

On Compositions of Transformations in Contrastive Self-Supervised Learning

1 code implementation • ICCV 2021 • Mandela Patrick, Yuki M. Asano, Polina Kuznetsova, Ruth Fong, João F. Henriques, Geoffrey Zweig, Andrea Vedaldi

In the image domain, excellent representations can be learned by inducing invariance to content-preserving transformations via noise contrastive learning.

Action Recognition Audio Classification +3

Paper
Code

A critical analysis of self-supervision, or what we can learn from a single image

2 code implementations • ICLR 2020 • Yuki M. Asano, Christian Rupprecht, Andrea Vedaldi

We look critically at popular self-supervision techniques for learning deep convolutional neural networks without manual labels.

Data Augmentation Representation Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.