no code implementations • 19 Sep 2023 • Laura Hanu, Anita L. Verő, James Thewlis
Despite an exciting new wave of multimodal machine learning models, current approaches still struggle to interpret the complex contextual relationships between the different modalities present in videos.
1 code implementation • 19 Oct 2022 • Laura Hanu, James Thewlis, Yuki M. Asano, Christian Rupprecht
In this paper, we a) introduce a new dataset of videos, titles and comments; b) present an attention-based mechanism that allows the model to learn from sometimes irrelevant data such as comments; c) show that by using comments, our method is able to learn better, more contextualised, representations for image, video and audio representations.
no code implementations • 29 Sep 2021 • Laura Hanu, Yuki M Asano, James Thewlis, Christian Rupprecht
Learning strong representations for multi-modal retrieval is an important problem for many applications, such as recommendation and search.
1 code implementation • ICCV 2019 • James Thewlis, Samuel Albanie, Hakan Bilen, Andrea Vedaldi
Equivariance to random image transformations is an effective method to learn landmarks of object categories, such as the eyes and the nose in faces, without manual supervision.
Ranked #1 on Unsupervised Facial Landmark Detection on 300W
no code implementations • CVPR 2019 • Natalia Neverova, James Thewlis, Riza Alp Güler, Iasonas Kokkinos, Andrea Vedaldi
DensePose supersedes traditional landmark detectors by densely mapping image pixels to body surface coordinates.
no code implementations • 1 Apr 2019 • Samuel Albanie, James Thewlis, Sebastien Ehrhardt, Joao Henriques
The theory of deep learning is now considered largely solved, and is well understood by researchers and influencers alike.
no code implementations • NeurIPS 2018 • James Thewlis, Hakan Bilen, Andrea Vedaldi
We propose a new approach to model and learn, without manual supervision, the symmetries of natural objects, such as faces or flowers, given only images as input.
no code implementations • 15 Jul 2018 • Aravindh Mahendran, James Thewlis, Andrea Vedaldi
We propose a novel method for learning convolutional neural image representations without manual supervision.
1 code implementation • 1 Apr 2018 • Samuel Albanie, James Thewlis, Joao F. Henriques
Learning through experience is time-consuming, inefficient and often bad for your cortisol levels.
no code implementations • NeurIPS 2017 • James Thewlis, Hakan Bilen, Andrea Vedaldi
One of the key challenges of visual perception is to extract abstract models of 3D objects and object categories from visual measurements, which are affected by complex nuisance factors such as viewpoint, occlusion, motion, and deformations.
Ranked #3 on Unsupervised Facial Landmark Detection on AFLW-MTFL
1 code implementation • ICCV 2017 • James Thewlis, Hakan Bilen, Andrea Vedaldi
Learning automatically the structure of object categories remains an important open problem in computer vision.
Ranked #2 on Unsupervised Facial Landmark Detection on AFLW-MTFL
1 code implementation • 12 Sep 2016 • James Thewlis, Shuai Zheng, Philip H. S. Torr, Andrea Vedaldi
Deep Matching (DM) is a popular high-quality method for quasi-dense image matching.