Search Results for author: James Thewlis

Found 12 papers, 5 papers with code

Language as the Medium: Multimodal Video Classification through text only

no code implementations • 19 Sep 2023 • Laura Hanu, Anita L. Verő, James Thewlis

Despite an exciting new wave of multimodal machine learning models, current approaches still struggle to interpret the complex contextual relationships between the different modalities present in videos.

Action Recognition Video Classification +1

Paper
Add Code

VTC: Improving Video-Text Retrieval with User Comments

1 code implementation • 19 Oct 2022 • Laura Hanu, James Thewlis, Yuki M. Asano, Christian Rupprecht

In this paper, we a) introduce a new dataset of videos, titles and comments; b) present an attention-based mechanism that allows the model to learn from sometimes irrelevant data such as comments; c) show that by using comments, our method is able to learn better, more contextualised, representations for image, video and audio representations.

Representation Learning Retrieval +3

Paper
Code

Learning Context-Adapted Video-Text Retrieval by Attending to User Comments

no code implementations • 29 Sep 2021 • Laura Hanu, Yuki M Asano, James Thewlis, Christian Rupprecht

Learning strong representations for multi-modal retrieval is an important problem for many applications, such as recommendation and search.

Retrieval Text Retrieval +1

Paper
Add Code

Unsupervised Learning of Landmarks by Descriptor Vector Exchange

1 code implementation • ICCV 2019 • James Thewlis, Samuel Albanie, Hakan Bilen, Andrea Vedaldi

Equivariance to random image transformations is an effective method to learn landmarks of object categories, such as the eyes and the nose in faces, without manual supervision.

Ranked #1 on Unsupervised Facial Landmark Detection on 300W

Object Unsupervised Facial Landmark Detection

Paper
Code

Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues

no code implementations • CVPR 2019 • Natalia Neverova, James Thewlis, Riza Alp Güler, Iasonas Kokkinos, Andrea Vedaldi

DensePose supersedes traditional landmark detectors by densely mapping image pixels to body surface coordinates.

Pose Estimation

Paper
Add Code

Deep Industrial Espionage

no code implementations • 1 Apr 2019 • Samuel Albanie, James Thewlis, Sebastien Ehrhardt, Joao Henriques

The theory of deep learning is now considered largely solved, and is well understood by researchers and influencers alike.

Paper
Add Code

Modelling and unsupervised learning of symmetric deformable object categories

no code implementations • NeurIPS 2018 • James Thewlis, Hakan Bilen, Andrea Vedaldi

We propose a new approach to model and learn, without manual supervision, the symmetries of natural objects, such as faces or flowers, given only images as input.

Object

Paper
Add Code

Cross Pixel Optical Flow Similarity for Self-Supervised Learning

no code implementations • 15 Jul 2018 • Aravindh Mahendran, James Thewlis, Andrea Vedaldi

We propose a novel method for learning convolutional neural image representations without manual supervision.

Image Classification Image Segmentation +4

Paper
Add Code

Substitute Teacher Networks: Learning with Almost No Supervision

1 code implementation • 1 Apr 2018 • Samuel Albanie, James Thewlis, Joao F. Henriques

Learning through experience is time-consuming, inefficient and often bad for your cortisol levels.

Paper
Code

Unsupervised learning of object frames by dense equivariant image labelling

no code implementations • NeurIPS 2017 • James Thewlis, Hakan Bilen, Andrea Vedaldi

One of the key challenges of visual perception is to extract abstract models of 3D objects and object categories from visual measurements, which are affected by complex nuisance factors such as viewpoint, occlusion, motion, and deformations.

Ranked #3 on Unsupervised Facial Landmark Detection on AFLW-MTFL