Search Results for author: Anthony Dick

Found 23 papers, 3 papers with code

EBMs vs. CL: Exploring Self-Supervised Visual Pretraining for Visual Question Answering

no code implementations • 29 Jun 2022 • Violetta Shevchenko, Ehsan Abbasnejad, Anthony Dick, Anton Van Den Hengel, Damien Teney

In a simple setting similar to CLEVR, we find that CL representations also improve systematic generalization, and even match the performance of representations from a larger, supervised, ImageNet-pretrained model.

Contrastive Learning Out of Distribution (OOD) Detection +4

Paper
Add Code

Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge

no code implementations • EACL (LANTERN) 2021 • Violetta Shevchenko, Damien Teney, Anthony Dick, Anton Van Den Hengel

The technique brings clear benefits to knowledge-demanding question answering tasks (OK-VQA, FVQA) by capturing semantic and relational knowledge absent from existing models.

Question Answering Visual Question Answering (VQA) +1

Paper
Add Code

Visual Question Answering with Prior Class Semantics

no code implementations • 4 May 2020 • Violetta Shevchenko, Damien Teney, Anthony Dick, Anton Van Den Hengel

We present a novel mechanism to embed prior knowledge in a model for visual question answering.

Question Answering Visual Question Answering +1

Paper
Add Code

Joint Learning of Set Cardinality and State Distribution

no code implementations • 13 Sep 2017 • S. Hamid Rezatofighi, Anton Milan, Qinfeng Shi, Anthony Dick, Ian Reid

We present a novel approach for learning to predict sets using deep learning.

Multi-Label Image Classification valid

Paper
Add Code

Visual Question Answering with Memory-Augmented Networks

no code implementations • CVPR 2018 • Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton Van Den Hengel, Ian Reid

In this paper, we exploit a memory-augmented neural network to predict accurate answers to visual questions, even when those answers occur rarely in the training set.

Question Answering Visual Question Answering

Paper
Add Code

Bayesian Conditional Generative Adverserial Networks

no code implementations • 17 Jun 2017 • M. Ehsan Abbasnejad, Qinfeng Shi, Iman Abbasnejad, Anton Van Den Hengel, Anthony Dick

Traditional GANs use a deterministic generator function (typically a neural network) to transform a random noise input $z$ to a sample $\mathbf{x}$ that the discriminator seeks to distinguish.

Paper
Add Code

DeepSetNet: Predicting Sets with Deep Neural Networks

no code implementations • ICCV 2017 • S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid

This paper addresses the task of set prediction using deep learning.

Image Classification Object Counting +3

Paper
Add Code

Infinite Variational Autoencoder for Semi-Supervised Learning

no code implementations • CVPR 2017 • Ehsan Abbasnejad, Anthony Dick, Anton Van Den Hengel

This paper presents an infinite variational autoencoder (VAE) whose capacity adapts to suit the input data.

Paper
Add Code

Visual Question Answering: A Survey of Methods and Datasets

1 code implementation • 20 Jul 2016 • Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities.

General Knowledge Visual Question Answering

436

Paper
Code

FVQA: Fact-based Visual Question Answering

no code implementations • 17 Jun 2016 • Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick

We evaluate several baseline models on the FVQA dataset, and describe a novel model which is capable of reasoning about an image on the basis of supporting facts.

Ranked #2 on Visual Question Answering (VQA) on F-VQA

Common Sense Reasoning Question Answering +1

Paper
Add Code

Joint Probabilistic Matching Using m-Best Solutions

no code implementations • CVPR 2016 • Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

Matching between two sets of objects is typically approached by finding the object pairs that collectively maximize the joint matching score.

Person Re-Identification

Paper
Add Code

Online Multi-Target Tracking Using Recurrent Neural Networks

no code implementations • 13 Apr 2016 • Anton Milan, Seyed Hamid Rezatofighi, Anthony Dick, Ian Reid, Konrad Schindler

Here, we propose for the first time, an end-to-end learning approach for online multi-target tracking.

Paper
Add Code

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

no code implementations • 9 Mar 2016 • Qi Wu, Chunhua Shen, Anton Van Den Hengel, Peng Wang, Anthony Dick

Much recent progress in Vision-to-Language problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).

Ranked #9 on Visual Question Answering (VQA) on COCO Visual Question Answering (VQA) real images 1.0 open ended

General Knowledge Image Captioning +2

Paper
Add Code

Joint Probabilistic Data Association Revisited

1 code implementation • ICCV 2015 • Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony Dick, Ian Reid

In this paper, we revisit the joint probabilistic data association (JPDA) technique and propose a novel solution based on recent developments in finding the m-best solutions to an integer linear program.

203

Paper
Code

Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources

no code implementations • CVPR 2016 • Qi Wu, Peng Wang, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

Priming a recurrent neural network with this combined information, and the submitted question, leads to a very flexible visual question answering approach.

General Knowledge Question Answering +1

Paper
Add Code

Explicit Knowledge-based Reasoning for Visual Question Answering

no code implementations • 9 Nov 2015 • Peng Wang, Qi Wu, Chunhua Shen, Anton Van Den Hengel, Anthony Dick

We describe a method for visual question answering which is capable of reasoning about contents of an image on the basis of information extracted from a large-scale knowledge base.

Question Answering Visual Question Answering

Paper
Add Code

Online Metric-Weighted Linear Representations for Robust Visual Tracking

no code implementations • 21 Jul 2015 • Xi Li, Chunhua Shen, Anthony Dick, Zhongfei Zhang, Yueting Zhuang

Object identification results for an entire video sequence are achieved by systematically combining the tracking information and visual recognition at each frame.

Metric Learning Object +2

Paper
Add Code

What value do explicit high level concepts have in vision to language problems?

1 code implementation • CVPR 2016 • Qi Wu, Chunhua Shen, Lingqiao Liu, Anthony Dick, Anton Van Den Hengel

Much of the recent progress in Vision-to-Language (V2L) problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).

Image Captioning Question Answering +1

Paper
Code

Part-Based Modelling of Compound Scenes From Images

no code implementations • CVPR 2015 • Anton van den Hengel, Chris Russell, Anthony Dick, John Bastian, Daniel Pooley, Lachlan Fleming, Lourdes Agapito

We propose a method to recover the structure of a compound scene from multiple silhouettes.

Paper
Add Code

Deconstruction of compound objects from image sets

no code implementations • 26 Feb 2014 • Anton van den Hengel, John Bastian, Anthony Dick, Lachlan Fleming

We propose a method to recover the structure of a compound object from multiple silhouettes.

Paper
Add Code

Context-Aware Hypergraph Construction for Robust Spectral Clustering

no code implementations • 4 Jan 2014 • Xi Li, Weiming Hu, Chunhua Shen, Anthony Dick, Zhongfei Zhang

Using both CAHSM and DHPC, a robust spectral clustering algorithm is developed.

Clustering hypergraph partitioning

Paper
Add Code

Contextual Hypergraph Modelling for Salient Object Detection

no code implementations • 22 Oct 2013 • Xi Li, Yao Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

In this work, we model an image as a hypergraph that utilizes a set of hyperedges to capture the contextual properties of image pixels or regions.

Object object-detection +2

Paper
Add Code

Learning Compact Binary Codes for Visual Tracking

no code implementations • CVPR 2013 • Xi Li, Chunhua Shen, Anthony Dick, Anton Van Den Hengel

A key problem in visual tracking is to represent the appearance of an object in a way that is robust to visual changes.

Visual Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.