no code implementations • 23 Mar 2023 • Relja Arandjelović, Alex Andonian, Arthur Mensch, Olivier J. Hénaff, Jean-Baptiste Alayrac, Andrew Zisserman
The core problem in zero-shot open vocabulary detection is how to align visual and text features, so that the detector performs well on unseen classes.
1 code implementation • 16 Mar 2022 • Olivier J. Hénaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira, Relja Arandjelović
The promise of self-supervised learning (SSL) is to leverage large amounts of unlabeled data to solve complex tasks.
no code implementations • CVPR 2022 • Wang Yifan, Carl Doersch, Relja Arandjelović, João Carreira, Andrew Zisserman
Much of the recent progress in 3D vision has been driven by the development of specialized architectures that incorporate geometrical inductive biases.
no code implementations • 9 Jun 2021 • Relja Arandjelović, Andrew Zisserman
In this work we address a clear limitation of the vanilla coarse-to-fine approach -- that it is based on a heuristic and not trained end-to-end for the task at hand.
1 code implementation • NeurIPS 2020 • Jean-Baptiste Alayrac, Adrià Recasens, Rosalia Schneider, Relja Arandjelović, Jason Ramapuram, Jeffrey De Fauw, Lucas Smaira, Sander Dieleman, Andrew Zisserman
In particular, we explore how best to combine the modalities, such that fine-grained representations of the visual and audio modalities can be maintained, whilst also integrating text into a common embedding.
1 code implementation • ECCV 2020 • Ignacio Rocco, Relja Arandjelović, Josef Sivic
In this work we target the problem of estimating accurately localised correspondences between a pair of images.
no code implementations • 26 Mar 2020 • Yujie Zhong, Relja Arandjelović, Andrew Zisserman
The objective of this work is to learn a compact embedding of a set of descriptors that is suitable for efficient retrieval and ranking, whilst maintaining discriminability of the individual descriptors.
no code implementations • ICCV 2019 • Jean-Baptiste Alayrac, João Carreira, Relja Arandjelović, Andrew Zisserman
The objective of this paper is to be able to separate a video into its natural layers, and to control which of the separated layers to attend to.
1 code implementation • 27 May 2019 • Relja Arandjelović, Andrew Zisserman
We tackle the problem of object discovery, where objects are segmented for a given input image, and the system is trained without using any direct supervision whatsoever.
3 code implementations • NeurIPS 2018 • Ignacio Rocco, Mircea Cimpoi, Relja Arandjelović, Akihiko Torii, Tomas Pajdla, Josef Sivic
Second, we demonstrate that the model can be trained effectively from weak supervision in the form of matching and non-matching image pairs without the need for costly manual annotation of point to point correspondences.
Ranked #2 on
Semantic correspondence
on PF-PASCAL
(PCK (weak) metric)
2 code implementations • 23 Oct 2018 • Yujie Zhong, Relja Arandjelović, Andrew Zisserman
The objective of this paper is to learn a compact representation of image sets for template-based face recognition.
Ranked #3 on
Face Verification
on IJB-A
2 code implementations • CVPR 2018 • Ignacio Rocco, Relja Arandjelović, Josef Sivic
We tackle the task of semantic alignment where the goal is to compute dense semantic correspondence aligning two images depicting objects of the same category.
no code implementations • ECCV 2018 • Relja Arandjelović, Andrew Zisserman
We make the following contributions: (i) show that audio and visual embeddings can be learnt that enable both within-mode (e. g. audio-to-audio) and between-mode retrieval; (ii) explore various architectures for the AVC task, including those for the visual stream that ingest a single image, or multiple images, or a single image and multi-frame optical flow; (iii) show that the semantic object that sounds within an image can be localized (using only the sound, no motion or flow information); and (iv) give a cautionary tale on how to avoid undesirable shortcuts in the data preparation.
1 code implementation • ICCV 2017 • Relja Arandjelović, Andrew Zisserman
We consider the question: what can be learnt by looking at and listening to a large number of unlabelled videos?
Ranked #27 on
Audio Classification
on ESC-50
5 code implementations • CVPR 2017 • Ignacio Rocco, Relja Arandjelović, Josef Sivic
We address the problem of determining correspondences between two images in agreement with a geometric model such as an affine or thin-plate spline transformation, and estimating its parameters.
no code implementations • 5 Jun 2016 • Artem Babenko, Relja Arandjelović, Victor Lempitsky
The proposed approach proceeds by finding a linear transformation of the data that effectively reduces the minimization of the pairwise distortions to the minimization of individual reconstruction errors.
15 code implementations • CVPR 2016 • Relja Arandjelović, Petr Gronat, Akihiko Torii, Tomas Pajdla, Josef Sivic
We tackle the problem of large scale visual place recognition, where the task is to quickly and accurately recognize the location of a given query photograph.
Ranked #3 on
Visual Place Recognition
on Mid-Atlantic Ridge
1 code implementation • CVPR 2012 • Relja Arandjelović, Andrew Zisserman
The objective of this work is object retrieval in large scale image datasets, where the object is specified by an image query and retrieval should be immediate at run time in the manner of Video Google [28].
Ranked #6 on
Image Matching
on IMC PhotoTourism
(using extra training data)