Search Results for author: Anthony Dick

Found 23 papers, 3 papers with code

EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering

no code implementations29 Jun 2022 Violetta Shevchenko, Ehsan Abbasnejad, Anthony Dick, Anton Van Den Hengel, Damien Teney

In a simple setting similar to CLEVR, we find that CL representations also improve systematic generalization, and even match the performance of representations from a larger, supervised, ImageNet-pretrained model.

Contrastive Learning Out of Distribution (OOD) Detection +4

Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge

no code implementations EACL (LANTERN) 2021 Violetta Shevchenko, Damien Teney, Anthony Dick, Anton Van Den Hengel

The technique brings clear benefits to knowledge-demanding question answering tasks (OK-VQA, FVQA) by capturing semantic and relational knowledge absent from existing models.

Question Answering Visual Question Answering (VQA) +1

Visual Question Answering with Memory-Augmented Networks

no code implementations CVPR 2018 Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton Van Den Hengel, Ian Reid

In this paper, we exploit a memory-augmented neural network to predict accurate answers to visual questions, even when those answers occur rarely in the training set.

Question Answering Visual Question Answering

Bayesian Conditional Generative Adverserial Networks

no code implementations17 Jun 2017 M. Ehsan Abbasnejad, Qinfeng Shi, Iman Abbasnejad, Anton Van Den Hengel, Anthony Dick

Traditional GANs use a deterministic generator function (typically a neural network) to transform a random noise input $z$ to a sample $\mathbf{x}$ that the discriminator seeks to distinguish.

Infinite Variational Autoencoder for Semi-Supervised Learning

no code implementations CVPR 2017 Ehsan Abbasnejad, Anthony Dick, Anton Van Den Hengel

This paper presents an infinite variational autoencoder (VAE) whose capacity adapts to suit the input data.

Visual Question Answering: A Survey of Methods and Datasets

1 code implementation20 Jul 2016 Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities.

General Knowledge Visual Question Answering

FVQA: Fact-based Visual Question Answering

no code implementations17 Jun 2016 Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick

We evaluate several baseline models on the FVQA dataset, and describe a novel model which is capable of reasoning about an image on the basis of supporting facts.

Common Sense Reasoning Question Answering +1

Joint Probabilistic Matching Using m-Best Solutions

no code implementations CVPR 2016 Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

Matching between two sets of objects is typically approached by finding the object pairs that collectively maximize the joint matching score.

Person Re-Identification

Online Multi-Target Tracking Using Recurrent Neural Networks

no code implementations13 Apr 2016 Anton Milan, Seyed Hamid Rezatofighi, Anthony Dick, Ian Reid, Konrad Schindler

Here, we propose for the first time, an end-to-end learning approach for online multi-target tracking.

Joint Probabilistic Data Association Revisited

1 code implementation ICCV 2015 Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

In this paper, we revisit the joint probabilistic data association (JPDA) technique and propose a novel solution based on recent developments in finding the m-best solutions to an integer linear program.

Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources

no code implementations CVPR 2016 Qi Wu, Peng Wang, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

Priming a recurrent neural network with this combined information, and the submitted question, leads to a very flexible visual question answering approach.

General Knowledge Question Answering +1

Explicit Knowledge-based Reasoning for Visual Question Answering

no code implementations9 Nov 2015 Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick

We describe a method for visual question answering which is capable of reasoning about contents of an image on the basis of information extracted from a large-scale knowledge base.

Question Answering Visual Question Answering

Online Metric-Weighted Linear Representations for Robust Visual Tracking

no code implementations21 Jul 2015 Xi Li, Chunhua Shen, Anthony Dick, Zhongfei Zhang, Yueting Zhuang

Object identification results for an entire video sequence are achieved by systematically combining the tracking information and visual recognition at each frame.

Metric Learning Object +2

What value do explicit high level concepts have in vision to language problems?

1 code implementation CVPR 2016 Qi Wu, Chunhua Shen, Lingqiao Liu, Anthony Dick, Anton Van Den Hengel

Much of the recent progress in Vision-to-Language (V2L) problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).

Image Captioning Question Answering +1

Part-Based Modelling of Compound Scenes From Images

no code implementations CVPR 2015 Anton van den Hengel, Chris Russell, Anthony Dick, John Bastian, Daniel Pooley, Lachlan Fleming, Lourdes Agapito

We propose a method to recover the structure of a compound scene from multiple silhouettes.

Deconstruction of compound objects from image sets

no code implementations26 Feb 2014 Anton van den Hengel, John Bastian, Anthony Dick, Lachlan Fleming

We propose a method to recover the structure of a compound object from multiple silhouettes.

Contextual Hypergraph Modelling for Salient Object Detection

no code implementations22 Oct 2013 Xi Li, Yao Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

In this work, we model an image as a hypergraph that utilizes a set of hyperedges to capture the contextual properties of image pixels or regions.

Object object-detection +2

Learning Compact Binary Codes for Visual Tracking

no code implementations CVPR 2013 Xi Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

A key problem in visual tracking is to represent the appearance of an object in a way that is robust to visual changes.

Visual Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.