no code implementations • 29 Jun 2022 • Violetta Shevchenko, Ehsan Abbasnejad, Anthony Dick, Anton Van Den Hengel, Damien Teney
In a simple setting similar to CLEVR, we find that CL representations also improve systematic generalization, and even match the performance of representations from a larger, supervised, ImageNet-pretrained model.
no code implementations • EACL (LANTERN) 2021 • Violetta Shevchenko, Damien Teney, Anthony Dick, Anton Van Den Hengel
The technique brings clear benefits to knowledge-demanding question answering tasks (OK-VQA, FVQA) by capturing semantic and relational knowledge absent from existing models.
no code implementations • 4 May 2020 • Violetta Shevchenko, Damien Teney, Anthony Dick, Anton Van Den Hengel
We present a novel mechanism to embed prior knowledge in a model for visual question answering.
no code implementations • 13 Sep 2017 • S. Hamid Rezatofighi, Anton Milan, Qinfeng Shi, Anthony Dick, Ian Reid
We present a novel approach for learning to predict sets using deep learning.
no code implementations • CVPR 2018 • Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton Van Den Hengel, Ian Reid
In this paper, we exploit a memory-augmented neural network to predict accurate answers to visual questions, even when those answers occur rarely in the training set.
no code implementations • 17 Jun 2017 • M. Ehsan Abbasnejad, Qinfeng Shi, Iman Abbasnejad, Anton Van Den Hengel, Anthony Dick
Traditional GANs use a deterministic generator function (typically a neural network) to transform a random noise input $z$ to a sample $\mathbf{x}$ that the discriminator seeks to distinguish.
no code implementations • ICCV 2017 • S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid
This paper addresses the task of set prediction using deep learning.
no code implementations • CVPR 2017 • Ehsan Abbasnejad, Anthony Dick, Anton Van Den Hengel
This paper presents an infinite variational autoencoder (VAE) whose capacity adapts to suit the input data.
1 code implementation • 20 Jul 2016 • Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anthony Dick, Anton Van Den Hengel
Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities.
no code implementations • 17 Jun 2016 • Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick
We evaluate several baseline models on the FVQA dataset, and describe a novel model which is capable of reasoning about an image on the basis of supporting facts.
Ranked #2 on Visual Question Answering (VQA) on F-VQA
no code implementations • CVPR 2016 • Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid
Matching between two sets of objects is typically approached by finding the object pairs that collectively maximize the joint matching score.
no code implementations • 13 Apr 2016 • Anton Milan, Seyed Hamid Rezatofighi, Anthony Dick, Ian Reid, Konrad Schindler
Here, we propose for the first time, an end-to-end learning approach for online multi-target tracking.
no code implementations • 9 Mar 2016 • Qi Wu, Chunhua Shen, Anton Van Den Hengel, Peng Wang, Anthony Dick
Much recent progress in Vision-to-Language problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).
1 code implementation • ICCV 2015 • Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid
In this paper, we revisit the joint probabilistic data association (JPDA) technique and propose a novel solution based on recent developments in finding the m-best solutions to an integer linear program.
no code implementations • CVPR 2016 • Qi Wu, Peng Wang, Chunhua Shen, Anthony Dick, Anton Van Den Hengel
Priming a recurrent neural network with this combined information, and the submitted question, leads to a very flexible visual question answering approach.
no code implementations • 9 Nov 2015 • Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick
We describe a method for visual question answering which is capable of reasoning about contents of an image on the basis of information extracted from a large-scale knowledge base.
no code implementations • 21 Jul 2015 • Xi Li, Chunhua Shen, Anthony Dick, Zhongfei Zhang, Yueting Zhuang
Object identification results for an entire video sequence are achieved by systematically combining the tracking information and visual recognition at each frame.
1 code implementation • CVPR 2016 • Qi Wu, Chunhua Shen, Lingqiao Liu, Anthony Dick, Anton Van Den Hengel
Much of the recent progress in Vision-to-Language (V2L) problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).
no code implementations • CVPR 2015 • Anton van den Hengel, Chris Russell, Anthony Dick, John Bastian, Daniel Pooley, Lachlan Fleming, Lourdes Agapito
We propose a method to recover the structure of a compound scene from multiple silhouettes.
no code implementations • 26 Feb 2014 • Anton van den Hengel, John Bastian, Anthony Dick, Lachlan Fleming
We propose a method to recover the structure of a compound object from multiple silhouettes.
no code implementations • 4 Jan 2014 • Xi Li, Weiming Hu, Chunhua Shen, Anthony Dick, Zhongfei Zhang
Using both CAHSM and DHPC, a robust spectral clustering algorithm is developed.
no code implementations • 22 Oct 2013 • Xi Li, Yao Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel
In this work, we model an image as a hypergraph that utilizes a set of hyperedges to capture the contextual properties of image pixels or regions.
no code implementations • CVPR 2013 • Xi Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel
A key problem in visual tracking is to represent the appearance of an object in a way that is robust to visual changes.