Search Results for author: Philip H. S. Torr

Found 207 papers, 116 papers with code

GDumb: A Simple Approach that Questions Our Progress in Continual Learning

2 code implementations • ECCV 2020 • Ameya Prabhu, Philip H. S. Torr, Puneet K. Dokania

We discuss a general formulation for the Continual Learning (CL) problem for classification---a learning task where a stream provides samples to a learner and the goal of the learner, depending on the samples it receives, is to continually upgrade its knowledge about the old classes and learn new ones.

Class Incremental Learning Open Set Learning

1,657

Paper
Code

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

1 code implementation • 4 Apr 2024 • Vishaal Udandarao, Ameya Prabhu, Adhiraj Ghosh, Yash Sharma, Philip H. S. Torr, Adel Bibi, Samuel Albanie, Matthias Bethge

Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image generation.

Benchmarking Image Generation +1

Paper
Code

NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning

1 code implementation • 2 Mar 2024 • Linsheng Chen, Guangrun Wang, Liuchun Yuan, Keze Wang, Ken Deng, Philip H. S. Torr

Furthermore, the cascading learning of NeRF-VPT introduces adaptability to scenarios with sparse inputs, resulting in a significant enhancement of accuracy for sparse-view novel view synthesis.

Novel View Synthesis

Paper
Code

Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

1 code implementation • 26 Feb 2024 • Pau de Jorge, Riccardo Volpi, Puneet K. Dokania, Philip H. S. Torr, Gregory Rogez

In our experiments, we present different anomaly segmentation datasets based on POC-generated data and show that POC can improve the performance of recent state-of-the-art anomaly fine-tuning methods in several standardized benchmarks.

Segmentation Semantic Segmentation

Paper
Code

Prompting a Pretrained Transformer Can Be a Universal Approximator

no code implementations • 22 Feb 2024 • Aleksandar Petrov, Philip H. S. Torr, Adel Bibi

Despite the widespread adoption of prompting, prompt tuning and prefix-tuning of transformer models, our theoretical understanding of these fine-tuning methods remains limited.

Paper
Add Code

Self-consistent Validation for Machine Learning Electronic Structure

no code implementations • 15 Feb 2024 • Gengyuan Hu, Gengchen Wei, Zekun Lou, Philip H. S. Torr, Wanli Ouyang, Han-sen Zhong, Chen Lin

Machine learning has emerged as a significant approach to efficiently tackle electronic structure problems.

Active Learning

Paper
Add Code

Revealing Decurve Flows for Generalized Graph Propagation

no code implementations • 13 Feb 2024 • Chen Lin, Liheng Ma, Yiyang Chen, Wanli Ouyang, Michael M. Bronstein, Philip H. S. Torr

\textbf{Secondly}, we propose the {\em Continuous Unified Ricci Curvature} (\textbf{CURC}), an extension of celebrated {\em Ollivier-Ricci Curvature} for directed and weighted graphs.

Graph Learning

Paper
Add Code

RanDumb: A Simple Approach that Questions the Efficacy of Continual Representation Learning

1 code implementation • 13 Feb 2024 • Ameya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip H. S. Torr, Ozan Sener, Puneet K. Dokania

Our investigation is both surprising and alarming as it questions our understanding of how to effectively design and train models that require efficient continual representation learning, and necessitates a principled reinvestigation of the widely explored problem formulation itself.

Continual Learning Representation Learning

Paper
Code

Secret Collusion Among Generative AI Agents

no code implementations • 12 Feb 2024 • Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip H. S. Torr, Lewis Hammond, Christian Schroeder de Witt

In this paper, we comprehensively formalise the problem of secret collusion in systems of generative AI agents by drawing on relevant concepts from both the AI and security literature.

Paper
Add Code

From Categories to Classifier: Name-Only Continual Learning by Exploring the Web

no code implementations • 19 Nov 2023 • Ameya Prabhu, Hasan Abed Al Kader Hammoud, Ser-Nam Lim, Bernard Ghanem, Philip H. S. Torr, Adel Bibi

Continual Learning (CL) often relies on the availability of extensive annotated datasets, an assumption that is unrealistically time-consuming and costly in practice.

Continual Learning Image Classification +1

Paper
Add Code

When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations

1 code implementation • 30 Oct 2023 • Aleksandar Petrov, Philip H. S. Torr, Adel Bibi

Context-based fine-tuning methods, including prompting, in-context learning, soft prompting (also known as prompt tuning), and prefix-tuning, have gained popularity due to their ability to often match the performance of full fine-tuning with a fraction of the parameters.

In-Context Learning

Paper
Code

Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation

no code implementations • 20 Oct 2023 • Francisco Eiras, Kemal Oksuz, Adel Bibi, Philip H. S. Torr, Puneet K. Dokania

Referring Image Segmentation (RIS) - the problem of identifying objects in images through natural language sentences - is a challenging task currently mostly solved through supervised learning.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

Fine-tuning can cripple your foundation model; preserving features may be the solution

no code implementations • 25 Aug 2023 • Jishnu Mukhoti, Yarin Gal, Philip H. S. Torr, Puneet K. Dokania

This is an undesirable effect of fine-tuning as a substantial amount of resources was used to learn these pre-trained concepts in the first place.

Continual Learning

Paper
Add Code

Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

1 code implementation • ICCV 2023 • Guangyi Chen, Xiao Liu, Guangrun Wang, Kun Zhang, Philip H. S. Torr, Xiao-Ping Zhang, Yansong Tang

To bridge these gaps, in this paper, we propose Tem-Adapter, which enables the learning of temporal dynamics and complex semantics by a visual Temporal Aligner and a textual Semantic Aligner.

Ranked #1 on Video Question Answering on SUTD-TrafficQA

Question Answering Video Question Answering

Paper
Code

Faithful Knowledge Distillation

no code implementations • 7 Jun 2023 • Tom A. Lamb, Rudy Brunel, Krishnamurthy Dj Dvijotham, M. Pawan Kumar, Philip H. S. Torr, Francisco Eiras

To address these questions, we introduce a faithful imitation framework to discuss the relative calibration of confidences and provide empirical and certified methods to evaluate the relative calibration of a student w. r. t.

Adversarial Robustness Knowledge Distillation

Paper
Add Code

Language Model Tokenizers Introduce Unfairness Between Languages

1 code implementation • NeurIPS 2023 • Aleksandar Petrov, Emanuele La Malfa, Philip H. S. Torr, Adel Bibi

Recent language models have shown impressive multilingual performance, even when not explicitly trained for it.

Language Modelling

Paper
Code

Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?

1 code implementation • ICCV 2023 • Hasan Abed Al Kader Hammoud, Ameya Prabhu, Ser-Nam Lim, Philip H. S. Torr, Adel Bibi, Bernard Ghanem

We revisit the common practice of evaluating adaptation of Online Continual Learning (OCL) algorithms through the metric of online accuracy, which measures the accuracy of the model on the immediate next few samples.

Continual Learning

Paper
Code

Certifying Ensembles: A General Certification Theory with S-Lipschitzness

no code implementations • 25 Apr 2023 • Aleksandar Petrov, Francisco Eiras, Amartya Sanyal, Philip H. S. Torr, Adel Bibi

Improving and guaranteeing the robustness of deep learning models has been a topic of intense research.

Paper
Add Code

Fairness in AI and Its Long-Term Implications on Society

no code implementations • 16 Apr 2023 • Ondrej Bohdal, Timothy Hospedales, Philip H. S. Torr, Fazl Barez

Successful deployment of artificial intelligence (AI) in various settings has led to numerous positive outcomes for individuals and society.

Decision Making Fairness

Paper
Add Code

Don't FREAK Out: A Frequency-Inspired Approach to Detecting Backdoor Poisoned Samples in DNNs

no code implementations • 23 Mar 2023 • Hasan Abed Al Kader Hammoud, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

In this paper we investigate the frequency sensitivity of Deep Neural Networks (DNNs) when presented with clean samples versus poisoned samples.

Paper
Add Code

Computationally Budgeted Continual Learning: What Does Matter?

1 code implementation • CVPR 2023 • Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet Dokania, Philip H. S. Torr, Ser-Nam Lim, Bernard Ghanem, Adel Bibi

Our conclusions are consistent in a different number of stream time steps, e. g., 20 to 200, and under several computational budgets.

Continual Learning

Paper
Code

Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation

no code implementations • 11 Mar 2023 • Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr

Referring image segmentation segments an image from a language expression.

Image Segmentation Object +1

Paper
Add Code

MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices

1 code implementation • CVPR 2023 • Kejie Li, Jia-Wang Bian, Robert Castle, Philip H. S. Torr, Victor Adrian Prisacariu

The distinct data modality offered by high-resolution RGB images and low-resolution depth maps captured on a mobile device, when combined with precise 3D geometry annotations, presents a unique opportunity for future research on high-fidelity 3D reconstruction.

3D Object Reconstruction 3D Reconstruction +1

133

Paper
Code

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

1 code implementation • ICCV 2023 • Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai

However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.

Object Segmentation +3

291

Paper
Code

Real-Time Evaluation in Online Continual Learning: A New Hope

1 code implementation • CVPR 2023 • Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip H. S. Torr, Bernard Ghanem

We show that a simple baseline outperforms state-of-the-art CL methods under this evaluation, questioning the applicability of existing methods in realistic settings.

Continual Learning

Paper
Code

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

no code implementations • CVPR 2023 • Jishnu Mukhoti, Tsung-Yu Lin, Omid Poursaeed, Rui Wang, Ashish Shah, Philip H. S. Torr, Ser-Nam Lim

We introduce Patch Aligned Contrastive Learning (PACL), a modified compatibility function for CLIP's contrastive loss, intending to train an alignment between the patch tokens of the vision encoder and the CLS token of the text encoder.

Ranked #1 on Open Vocabulary Semantic Segmentation on Cityscape-171

Contrastive Learning Image Classification +5

Paper
Add Code

Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs

no code implementations • 27 Nov 2022 • Guangrun Wang, Philip H. S. Torr

Proving that classifiers have learned the data distribution and are ready for image generation has far-reaching implications, for classifiers are much easier to train than generative models like DDPMs and GANs.

Text-to-Image Generation

Paper
Add Code

Label Alignment Regularization for Distribution Shift

no code implementations • 27 Nov 2022 • Ehsan Imani, Guojun Zhang, Runjia Li, Jun Luo, Pascal Poupart, Philip H. S. Torr, Yangchen Pan

Recent work has highlighted the label alignment property (LAP) in supervised learning, where the vector of all labels in the dataset is mostly in the span of the top few singular vectors of the data matrix.

Representation Learning Sentiment Analysis +1

Paper
Add Code

Structure-Preserving 3D Garment Modeling with Neural Sewing Machines

no code implementations • 12 Nov 2022 • Xipeng Chen, Guangrun Wang, Dizhong Zhu, Xiaodan Liang, Philip H. S. Torr, Liang Lin

In this paper, we propose a novel Neural Sewing Machine (NSM), a learning-based framework for structure-preserving 3D garment modeling, which is capable of learning representations for garments with diverse shapes and topologies and is successfully applied to 3D garment reconstruction and controllable manipulation.

Garment Reconstruction Representation Learning

Paper
Add Code

Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis

1 code implementation • 12 Nov 2022 • Hao Tang, Ling Shao, Philip H. S. Torr, Nicu Sebe

To further capture the change in pose of each part more precisely, we propose a novel part-aware bipartite graph reasoning (PBGR) block to decompose the task of reasoning the global structure transformation with a bipartite graph into learning different local transformations for different semantic body/face parts.

Generative Adversarial Network Image Generation

128

Paper
Code

Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning

1 code implementation • 24 Oct 2022 • Nan Xue, Tianfu Wu, Song Bai, Fu-Dong Wang, Gui-Song Xia, Liangpei Zhang, Philip H. S. Torr

This article presents Holistically-Attracted Wireframe Parsing (HAWP), a method for geometric analysis of 2D images containing wireframes formed by line segments and junctions.

Self-Supervised Learning Wireframe Parsing

276

Paper
Code

Raising the Bar on the Evaluation of Out-of-Distribution Detection

no code implementations • 24 Sep 2022 • Jishnu Mukhoti, Tsung-Yu Lin, Bor-Chun Chen, Ashish Shah, Philip H. S. Torr, Puneet K. Dokania, Ser-Nam Lim

In this paper, we define 2 categories of OoD data using the subtly different concepts of perceptual/visual and semantic similarity to in-distribution (iD) data.

Out-of-Distribution Detection Out of Distribution (OOD) Detection +2

Paper
Add Code

Learn what matters: cross-domain imitation learning with task-relevant embeddings

no code implementations • 24 Sep 2022 • Tim Franzmeyer, Philip H. S. Torr, João F. Henriques

We study how an autonomous agent learns to perform a task from demonstrations in a different domain, such as a different environment or different agent.

Imitation Learning

Paper
Add Code

Dynamic Graph Message Passing Networks for Visual Recognition

2 code implementations • 20 Sep 2022 • Li Zhang, Mohan Chen, Anurag Arnab, xiangyang xue, Philip H. S. Torr

A fully-connected graph, such as the self-attention operation in Transformers, is beneficial for such modelling, however, its computational overhead is prohibitive.

Image Classification object-detection +3

Paper
Code

Memory-Driven Text-to-Image Generation

no code implementations • 15 Aug 2022 • Bowen Li, Philip H. S. Torr, Thomas Lukasiewicz

We introduce a memory-driven semi-parametric approach to text-to-image generation, which is based on both parametric and non-parametric techniques.

Generative Adversarial Network Text-to-Image Generation

Paper
Add Code

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation

1 code implementation • 25 Jul 2022 • Jiaming Zhang, Kailun Yang, Hao Shi, Simon Reiß, Kunyu Peng, Chaoxiang Ma, Haodong Fu, Philip H. S. Torr, Kaiwei Wang, Rainer Stiefelhagen

In this paper, we address panoramic semantic segmentation which is under-explored due to two critical challenges: (1) image distortions and object deformations on panoramas; (2) lack of semantic annotations in the 360-degree imagery.

Ranked #1 on Semantic Segmentation on SynPASS

Pseudo Label Segmentation +2

Paper
Code

An Impartial Take to the CNN vs Transformer Robustness Contest

no code implementations • 22 Jul 2022 • Francesco Pinto, Philip H. S. Torr, Puneet K. Dokania

Following the surge of popularity of Transformers in Computer Vision, several studies have attempted to determine whether they could be more robust to distribution shifts and provide better uncertainty estimates than Convolutional Neural Networks (CNNs).

Paper
Add Code

Illusory Attacks: Detectability Matters in Adversarial Attacks on Sequential Decision-Makers

no code implementations • 20 Jul 2022 • Tim Franzmeyer, Stephen Mcaleer, João F. Henriques, Jakob N. Foerster, Philip H. S. Torr, Adel Bibi, Christian Schroeder de Witt

Autonomous agents deployed in the real world need to be robust against adversarial attacks on sensory inputs.

Adversarial Attack Adversarial Robustness

Paper
Add Code

Vision Transformers: From Semantic Segmentation to Dense Prediction

3 code implementations • 19 Jul 2022 • Li Zhang, Jiachen Lu, Sixiao Zheng, Xinxuan Zhao, Xiatian Zhu, Yanwei Fu, Tao Xiang, Jianfeng Feng, Philip H. S. Torr

In this work, for the first time we explore the global context learning potentials of ViTs for dense visual prediction (e. g., semantic segmentation).

Image Classification Instance Segmentation +5

1,012

Paper
Code

Sample-dependent Adaptive Temperature Scaling for Improved Calibration

1 code implementation • 13 Jul 2022 • Tom Joy, Francesco Pinto, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania

The most common post-hoc approach to compensate for this is to perform temperature scaling, which adjusts the confidences of the predictions on any input by scaling the logits by a fixed value.

Out of Distribution (OOD) Detection

Paper
Code

SiamMask: A Framework for Fast Online Object Tracking and Segmentation

no code implementations • 5 Jul 2022 • Weiming Hu, Qiang Wang, Li Zhang, Luca Bertinetto, Philip H. S. Torr

In this paper we introduce SiamMask, a framework to perform both visual object tracking and video object segmentation, in real-time, with the same simple method.

Multiple Object Tracking Object +5

Paper
Add Code

RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness

2 code implementations • 29 Jun 2022 • Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania

We show that the effectiveness of the well celebrated Mixup [Zhang et al., 2018] can be further improved if instead of using it as the sole learning objective, it is utilized as an additional regularizer to the standard cross-entropy loss.

Out-of-Distribution Detection

174

Paper
Code

How Robust is Unsupervised Representation Learning to Distribution Shift?

no code implementations • 17 Jun 2022 • Yuge Shi, Imant Daunhawer, Julia E. Vogt, Philip H. S. Torr, Amartya Sanyal

As such, there is a lack of insight on the robustness of the representations learned from unsupervised methods, such as self-supervised learning (SSL) and auto-encoder based algorithms (AE), to distribution shift.

Representation Learning Self-Supervised Learning

Paper
Add Code

Catastrophic overfitting can be induced with discriminative non-robust features

1 code implementation • 16 Jun 2022 • Guillermo Ortiz-Jiménez, Pau de Jorge, Amartya Sanyal, Adel Bibi, Puneet K. Dokania, Pascal Frossard, Gregory Rogéz, Philip H. S. Torr

Through extensive experiments we analyze this novel phenomenon and discover that the presence of these easy features induces a learning shortcut that leads to CO. Our findings provide new insights into the mechanisms of CO and improve our understanding of the dynamics of AT.

Robust classification

Paper
Code

Zero-Shot Logit Adjustment

1 code implementation • 25 Apr 2022 • Dubing Chen, Yuming Shen, Haofeng Zhang, Philip H. S. Torr

As a consequence of our derivation, the aforementioned two properties are incorporated into the classifier training as seen-unseen priors via logit adjustment.

Ranked #1 on Generalized Zero-Shot Learning on AwA2 (Accuracy Unseen metric)

Bayesian Inference Generalized Zero-Shot Learning +1

Paper
Code

Deconstructed Generation-Based Zero-Shot Model

1 code implementation • 24 Apr 2022 • Dubing Chen, Yuming Shen, Haofeng Zhang, Philip H. S. Torr

Recent research on Generalized Zero-Shot Learning (GZSL) has focused primarily on generation-based methods.

Attribute Generalized Zero-Shot Learning

Paper
Code

BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion

1 code implementation • CVPR 2022 • Kejie Li, Yansong Tang, Victor Adrian Prisacariu, Philip H. S. Torr

Dense 3D reconstruction from a stream of depth images is the key to many mixed reality and robotic applications.

3D Reconstruction Mixed Reality +1

136

Paper
Code

Local and Global GANs with Semantic-Aware Upsampling for Image Generation

1 code implementation • 28 Feb 2022 • Hao Tang, Ling Shao, Philip H. S. Torr, Nicu Sebe

To learn more discriminative class-specific feature representations for the local generation, we also propose a novel classification module.

Feature Upsampling Image Generation

145

Paper
Code

Make Some Noise: Reliable and Efficient Single-Step Adversarial Training

1 code implementation • 2 Feb 2022 • Pau de Jorge, Adel Bibi, Riccardo Volpi, Amartya Sanyal, Philip H. S. Torr, Grégory Rogez, Puneet K. Dokania

Recently, Wong et al. showed that adversarial training with single-step FGSM leads to a characteristic failure mode named Catastrophic Overfitting (CO), in which a model becomes suddenly vulnerable to multi-step attacks.

Paper
Code

On the Robustness of Quality Measures for GANs

1 code implementation • 31 Jan 2022 • Motasem Alfarra, Juan C. Pérez, Anna Frühstück, Philip H. S. Torr, Peter Wonka, Bernard Ghanem

Finally, we show that the FID can be robustified by simply replacing the standard Inception with a robust Inception.

Paper
Code

Adversarial Masking for Self-Supervised Learning

1 code implementation • 31 Jan 2022 • Yuge Shi, N. Siddharth, Philip H. S. Torr, Adam R. Kosiorek

We propose ADIOS, a masked image model (MIM) framework for self-supervised learning, which simultaneously learns a masking function and an image encoder using an adversarial objective.

Representation Learning Self-Supervised Learning +1

Paper
Code

Learning to Hash Naturally Sorts

no code implementations • 31 Jan 2022 • Jiaguo Yu, Yuming Shen, Menghan Wang, Haofeng Zhang, Philip H. S. Torr

In this paper, we tackle this problem by introducing Naturally-Sorted Hashing (NSH).

Contrastive Learning Deep Hashing

Paper
Add Code

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

1 code implementation • CVPR 2022 • Zhao Yang, Jiaqi Wang, Yansong Tang, Kai Chen, Hengshuang Zhao, Philip H. S. Torr

Referring image segmentation is a fundamental vision-language task that aims to segment out an object referred to by a natural language expression from an image.

Ranked #3 on Generalized Referring Expression Segmentation on gRefCOCO

Generalized Referring Expression Segmentation Image Segmentation +2

169

Paper
Code

Fixed Points in Cyber Space: Rethinking Optimal Evasion Attacks in the Age of AI-NIDS

no code implementations • 23 Nov 2021 • Christian Schroeder de Witt, Yongchao Huang, Philip H. S. Torr, Martin Strohmeier

We then argue that attacker-defender fixed points are themselves general-sum games with complex phase transitions, and introduce a temporally extended multi-agent reinforcement learning framework in which the resultant dynamics can be studied.

Continual Learning Multi-agent Reinforcement Learning +2

Paper
Add Code

Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge

no code implementations • 15 Nov 2021 • Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H. S. Torr, Song Bai

To promote the development of occlusion understanding, we collect a large-scale dataset called OVIS for video instance segmentation in the occluded scenario.

Instance Segmentation Object Recognition +3

Paper
Add Code

Deep Deterministic Uncertainty for Semantic Segmentation

no code implementations • 29 Oct 2021 • Jishnu Mukhoti, Joost van Amersfoort, Philip H. S. Torr, Yarin Gal

We extend Deep Deterministic Uncertainty (DDU), a method for uncertainty estimation using feature space densities, to semantic segmentation.

Segmentation Semantic Segmentation

Paper
Add Code

Detecting and Quantifying Malicious Activity with Simulation-based Inference

no code implementations • 6 Oct 2021 • Andrew Gambardella, Bogdan State, Naeemullah Khan, Leo Tsourides, Philip H. S. Torr, Atılım Güneş Baydin

We propose the use of probabilistic programming techniques to tackle the malicious user identification problem in a recommendation algorithm.

Probabilistic Programming

Paper
Add Code

RobustART: Benchmarking Robustness on Architecture Design and Training Techniques

1 code implementation • 11 Sep 2021 • Shiyu Tang, Ruihao Gong, Yan Wang, Aishan Liu, Jiakai Wang, Xinyun Chen, Fengwei Yu, Xianglong Liu, Dawn Song, Alan Yuille, Philip H. S. Torr, DaCheng Tao

Thus, we propose RobustART, the first comprehensive Robustness investigation benchmark on ImageNet regarding ARchitecture design (49 human-designed off-the-shelf architectures and 1200+ networks from neural architecture search) and Training techniques (10+ techniques, e. g., data augmentation) towards diverse noises (adversarial, natural, and system noises).

Adversarial Robustness Benchmarking +2

143

Paper
Code

ANCER: Anisotropic Certification via Sample-wise Volume Maximization

1 code implementation • 9 Jul 2021 • Francisco Eiras, Motasem Alfarra, M. Pawan Kumar, Philip H. S. Torr, Puneet K. Dokania, Bernard Ghanem, Adel Bibi

Randomized smoothing has recently emerged as an effective tool that enables certification of deep neural network classifiers at scale.

Paper
Code

Do Different Tracking Tasks Require Different Appearance Models?

1 code implementation • NeurIPS 2021 • Zhongdao Wang, Hengshuang Zhao, Ya-Li Li, Shengjin Wang, Philip H. S. Torr, Luca Bertinetto

We show how most tracking tasks can be solved within this framework, and that the same appearance model can be successfully used to obtain results that are competitive against specialised methods for most of the tasks considered.

Ranked #2 on Video Object Segmentation on DAVIS 2017 (mIoU metric)

Multi-Object Tracking Multi-Object Tracking and Segmentation +10

335

Paper
Code

DeformRS: Certifying Input Deformations with Randomized Smoothing

2 code implementations • 2 Jul 2021 • Motasem Alfarra, Adel Bibi, Naeemullah Khan, Philip H. S. Torr, Bernard Ghanem

Deep neural networks are vulnerable to input deformations in the form of vector fields of pixel displacements and to other parameterized geometric deformations e. g. translations, rotations, etc.

Paper
Code

Learning Multimodal VAEs through Mutual Supervision

1 code implementation • ICLR 2022 • Tom Joy, Yuge Shi, Philip H. S. Torr, Tom Rainforth, Sebastian M. Schmon, N. Siddharth

Here we introduce a novel alternative, the MEME, that avoids such explicit combinations by repurposing semi-supervised VAEs to combine information between modalities implicitly through mutual supervision.

Paper
Code

KL Guided Domain Adaptation

1 code implementation • ICLR 2022 • A. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, Atılım Güneş Baydin

A common approach in the domain adaptation literature is to learn a representation of the input that has the same (marginal) distribution over the source and the target domain.

Domain Adaptation

Paper
Code

You Never Cluster Alone

no code implementations • NeurIPS 2021 • Yuming Shen, Ziyi Shen, Menghan Wang, Jie Qin, Philip H. S. Torr, Ling Shao

On one hand, with the corresponding assignment variables being the weight, a weighted aggregation along the data points implements the set representation of a cluster.

Clustering Contrastive Learning +1

Paper
Add Code

Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning

1 code implementation • 12 May 2021 • Yansong Tang, Zhenyu Jiang, Zhenda Xie, Yue Cao, Zheng Zhang, Philip H. S. Torr, Han Hu

Previous cycle-consistency correspondence learning methods usually leverage image patches for training.

Landmark Tracking Pose Tracking +3

Paper
Code

Gradient Matching for Domain Generalization

2 code implementations • ICLR 2022 • Yuge Shi, Jeffrey Seely, Philip H. S. Torr, N. Siddharth, Awni Hannun, Nicolas Usunier, Gabriel Synnaeve

We perform experiments on both the Wilds benchmark, which captures distribution shift in the real world, as well as datasets in DomainBed benchmark that focuses more on synthetic-to-real transfer.

Domain Generalization

1,327

Paper
Code

Solving Inefficiency of Self-supervised Representation Learning

1 code implementation • ICCV 2021 • Guangrun Wang, Keze Wang, Guangcong Wang, Philip H. S. Torr, Liang Lin

In this paper, we reveal two contradictory phenomena in contrastive learning that we call under-clustering and over-clustering problems, which are major obstacles to learning efficiency.

Ranked #1 on Self-Supervised Person Re-Identification on SYSU-30k

Clustering Contrastive Learning +4

Paper
Code

Improved Branch and Bound for Neural Network Verification via Lagrangian Decomposition

no code implementations • 14 Apr 2021 • Alessandro De Palma, Rudy Bunel, Alban Desmaison, Krishnamurthy Dvijotham, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

Finally, we design a BaB framework, named Branch and Dual Network Bound (BaDNB), based on our novel bounding and branching algorithms.

Adversarial Robustness

Paper
Add Code

Cloth Interactive Transformer for Virtual Try-On

1 code implementation • 12 Apr 2021 • Bin Ren, Hao Tang, Fanyang Meng, Runwei Ding, Philip H. S. Torr, Nicu Sebe

In the second stage, we put forth a CIT reasoning block for establishing global mutual interactive dependencies among person representation, the warped clothing item, and the corresponding warped cloth mask.

Virtual Try-on

Paper
Code

Combating Adversaries with Anti-Adversaries

1 code implementation • ICML Workshop AML 2021 • Motasem Alfarra, Juan C. Pérez, Ali Thabet, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

Deep neural networks are vulnerable to small input perturbations known as adversarial attacks.

Paper
Code

Deep Deterministic Uncertainty: A Simple Baseline

4 code implementations • 23 Feb 2021 • Jishnu Mukhoti, Andreas Kirsch, Joost van Amersfoort, Philip H. S. Torr, Yarin Gal

Reliable uncertainty from deterministic single-forward pass models is sought after because conventional methods of uncertainty quantification are computationally expensive.

Active Learning Uncertainty Quantification

115

Paper
Code

Shape-Tailored Deep Neural Networks

no code implementations • 16 Feb 2021 • Naeemullah Khan, Angira Sharma, Ganesh Sundaramoorthi, Philip H. S. Torr

We stack multiple PDE layers to generalize a deep CNN to arbitrary regions, and apply it to segmentation.

Segmentation

Paper
Add Code

Occluded Video Instance Segmentation: A Benchmark

2 code implementations • 2 Feb 2021 • Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H. S. Torr, Song Bai

On the OVIS dataset, the highest AP achieved by state-of-the-art algorithms is only 16. 3, which reveals that we are still at a nascent stage for understanding objects, instances, and videos in a real-world scenario.

Ranked #39 on Video Instance Segmentation on OVIS validation

Instance Segmentation Segmentation +3

Paper
Code

Scaling the Convex Barrier with Sparse Dual Algorithms

no code implementations • ICLR 2021 • Alessandro De Palma, Harkirat Singh Behl, Rudy Bunel, Philip H. S. Torr, M. Pawan Kumar

Tight and efficient neural network bounding is crucial to the scaling of neural network verification systems.

Efficient Neural Network valid

Paper
Add Code

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

5 code implementations • CVPR 2021 • Sixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip H. S. Torr, Li Zhang

In this paper, we aim to provide an alternative perspective by treating semantic segmentation as a sequence-to-sequence prediction task.

Ranked #2 on Semantic Segmentation on FoodSeg103 (using extra training data)

Medical Image Segmentation Segmentation +1

8,218

Paper
Code

On Batch Normalisation for Approximate Bayesian Inference

no code implementations • pproximateinference AABI Symposium 2021 • Jishnu Mukhoti, Puneet K. Dokania, Philip H. S. Torr, Yarin Gal

We study batch normalisation in the context of variational inference methods in Bayesian neural networks, such as mean-field or MC Dropout.

Bayesian Inference valid +1

Paper
Add Code

Multi-shot Temporal Event Localization: a Benchmark

1 code implementation • CVPR 2021 • Xiaolong Liu, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr

Current developments in temporal event or action localization usually target actions captured by a single camera.

Ranked #2 on Temporal Action Localization on MUSES

Temporal Action Localization

Paper
Code

GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation

2 code implementations • 13 Dec 2020 • Xiaojuan Qi, Zhengzhe Liu, Renjie Liao, Philip H. S. Torr, Raquel Urtasun, Jiaya Jia

Note that GeoNet++ is generic and can be used in other depth/normal prediction frameworks to improve the quality of 3D reconstruction and pixel-wise accuracy of depth and surface normals.

3D Reconstruction Depth Estimation +2

119

Paper
Code

Data-Dependent Randomized Smoothing

no code implementations • 8 Dec 2020 • Motasem Alfarra, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

In this work, we revisit Gaussian randomized smoothing and show that the variance of the Gaussian distribution can be optimized at each input so as to maximize the certification radius for the construction of the smooth classifier.

Paper
Add Code

Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?

6 code implementations • 18 Nov 2020 • Christian Schroeder de Witt, Tarun Gupta, Denys Makoviichuk, Viktor Makoviychuk, Philip H. S. Torr, Mingfei Sun, Shimon Whiteson

Most recently developed approaches to cooperative multi-agent reinforcement learning in the \emph{centralized training with decentralized execution} setting involve estimating a centralized, joint value function.

reinforcement-learning Reinforcement Learning (RL) +2

163

Paper
Code

Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation

1 code implementation • NeurIPS 2020 • Bowen Li, Xiaojuan Qi, Philip H. S. Torr, Thomas Lukasiewicz

To achieve this, a new word-level discriminator is proposed, which provides the generator with fine-grained training feedback at word-level, to facilitate training a lightweight generator that has a small number of parameters, but can still correctly focus on specific visual attributes of an image, and then edit them without affecting other contents that are not described in the text.

Generative Adversarial Network Image Manipulation +1

Paper
Code

Continual Learning in Low-rank Orthogonal Subspaces

1 code implementation • NeurIPS 2020 • Arslan Chaudhry, Naeemullah Khan, Puneet K. Dokania, Philip H. S. Torr

In continual learning (CL), a learner is faced with a sequence of tasks, arriving one after the other, and the goal is to remember all the tasks once the continual learning experience is finished.

Continual Learning

Paper
Code

AutoSimulate: (Quickly) Learning Synthetic Data Generation

no code implementations • ECCV 2020 • Harkirat Singh Behl, Atılım Güneş Baydin, Ran Gal, Philip H. S. Torr, Vibhav Vineet

Simulation is increasingly being used for generating large labelled datasets in many machine learning problems.

Synthetic Data Generation

Paper
Add Code

Bipartite Graph Reasoning GANs for Person Image Generation

1 code implementation • 10 Aug 2020 • Hao Tang, Song Bai, Philip H. S. Torr, Nicu Sebe

We present a novel Bipartite Graph Reasoning GAN (BiGraphGAN) for the challenging person image generation task.

Ranked #1 on Pose Transfer on Market-1501 (PCKh metric)

Pose Transfer

128

Paper
Code

XingGAN for Person Image Generation

2 code implementations • ECCV 2020 • Hao Tang, Song Bai, Li Zhang, Philip H. S. Torr, Nicu Sebe

We propose a novel Generative Adversarial Network (XingGAN or CrossingGAN) for person image generation tasks, i. e., translating the pose of a given person to a desired one.

Ranked #1 on Pose Transfer on Market-1501 (IS metric)

Generative Adversarial Network Pose Transfer

229

Paper
Code

WordCraft: An Environment for Benchmarking Commonsense Agents

1 code implementation • ICML Workshop LaReL 2020 • Minqi Jiang, Jelena Luketina, Nantas Nardelli, Pasquale Minervini, Philip H. S. Torr, Shimon Whiteson, Tim Rocktäschel

This is partly due to the lack of lightweight simulation environments that sufficiently reflect the semantics of the real world and provide knowledge sources grounded with respect to observations in an RL environment.

Benchmarking Knowledge Graphs +2

Paper
Code

How benign is benign overfitting?

no code implementations • 8 Jul 2020 • Amartya Sanyal, Puneet K. Dokania, Varun Kanade, Philip H. S. Torr

We investigate two causes for adversarial vulnerability in deep neural networks: bad data and (poorly) trained models.

Adversarial Robustness Representation Learning

Paper
Add Code

Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models

no code implementations • ICLR 2021 • Yuge Shi, Brooks Paige, Philip H. S. Torr, N. Siddharth

Multimodal learning for generative models often refers to the learning of abstract concepts from the commonality of information in multiple modalities, such as vision and language.

Paper
Add Code

STEER: Simple Temporal Regularization For Neural ODEs

no code implementations • 18 Jun 2020 • Arnab Ghosh, Harkirat Singh Behl, Emilien Dupont, Philip H. S. Torr, Vinay Namboodiri

Training Neural Ordinary Differential Equations (ODEs) is often computationally expensive.

Time Series Time Series Analysis

Paper
Add Code

Capturing Label Characteristics in VAEs

2 code implementations • ICLR 2021 • Tom Joy, Sebastian M. Schmon, Philip H. S. Torr, N. Siddharth, Tom Rainforth

We present a principled approach to incorporating labels in VAEs that captures the rich characteristic information associated with those labels.

Paper
Code

Progressive Skeletonization: Trimming more fat from a network at initialization

1 code implementation • ICLR 2021 • Pau de Jorge, Amartya Sanyal, Harkirat S. Behl, Philip H. S. Torr, Gregory Rogez, Puneet K. Dokania

Recent studies have shown that skeletonization (pruning parameters) of networks \textit{at initialization} provides all the practical benefits of sparsity both at inference and training time, while only marginally degrading their performance.

Paper
Code

A Revised Generative Evaluation of Visual Dialogue

1 code implementation • 20 Apr 2020 • Daniela Massiceti, Viveka Kulharia, Puneet K. Dokania, N. Siddharth, Philip H. S. Torr

Evaluating Visual Dialogue, the task of answering a sequence of questions relating to a visual input, remains an open research challenge.

Paper
Code

Understanding the Effects of Data Parallelism and Sparsity on Neural Network Training

no code implementations • ICLR 2021 • Namhoon Lee, Thalaiyasingam Ajanthan, Philip H. S. Torr, Martin Jaggi

As a result, we find across various workloads of data set, network model, and optimization algorithm that there exists a general scaling trend between batch size and number of training steps to convergence for the effect of data parallelism, and further, difficulty of training under sparsity.

Network Pruning

Paper
Add Code

Cross-modal Deep Face Normals with Deactivable Skip Connections

1 code implementation • CVPR 2020 • Victoria Fernandez Abrevaya, Adnane Boukhayma, Philip H. S. Torr, Edmond Boyer

Core to our approach is a novel module that we call deactivable skip connections, which allows integrating both the auto-encoded and image-to-normal branches within the same architecture that can be trained end-to-end.

3D Face Reconstruction

Paper
Code

FACMAC: Factored Multi-Agent Centralised Policy Gradients

3 code implementations • NeurIPS 2021 • Bei Peng, Tabish Rashid, Christian A. Schroeder de Witt, Pierre-Alexandre Kamienny, Philip H. S. Torr, Wendelin Böhmer, Shimon Whiteson

We propose FACtored Multi-Agent Centralised policy gradients (FACMAC), a new method for cooperative multi-agent reinforcement learning in both discrete and continuous action spaces.

Q-Learning SMAC +2

309

Paper
Code

Holistically-Attracted Wireframe Parsing

1 code implementation • CVPR 2020 • Nan Xue, Tianfu Wu, Song Bai, Fu-Dong Wang, Gui-Song Xia, Liangpei Zhang, Philip H. S. Torr

For computing line segment proposals, a novel exact dual representation is proposed which exploits a parsimonious geometric reparameterization for line segments and forms a holistic 4-dimensional attraction field map for an input image.

Ranked #4 on Line Segment Detection on York Urban Dataset

Line Segment Detection Wireframe Parsing

276

Paper
Code

Lagrangian Decomposition for Neural Network Verification

2 code implementations • 24 Feb 2020 • Rudy Bunel, Alessandro De Palma, Alban Desmaison, Krishnamurthy Dvijotham, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

Both the algorithms offer three advantages: (i) they yield bounds that are provably at least as tight as previous dual algorithms relying on Lagrangian relaxations; (ii) they are based on operations analogous to forward/backward pass of neural networks layers and are therefore easily parallelizable, amenable to GPU implementation and able to take advantage of the convolutional structure of problems; and (iii) they allow for anytime stopping while still providing valid bounds.

valid

Paper
Code

Calibrating Deep Neural Networks using Focal Loss

2 code implementations • NeurIPS 2020 • Jishnu Mukhoti, Viveka Kulharia, Amartya Sanyal, Stuart Golodetz, Philip H. S. Torr, Puneet K. Dokania

To facilitate the use of focal loss in practice, we also provide a principled approach to automatically select the hyperparameter involved in the loss function.

144

Paper
Code

Image-to-Image Translation with Text Guidance

no code implementations • 12 Feb 2020 • Bowen Li, Xiaojuan Qi, Philip H. S. Torr, Thomas Lukasiewicz

The goal of this paper is to embed controllable factors, i. e., natural language descriptions, into image-to-image translation with generative adversarial networks, which allows text descriptions to determine the visual attributes of synthetic images.

Image-to-Image Translation Part-Of-Speech Tagging +1

Paper
Add Code

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation

1 code implementation • 3 Feb 2020 • Hao Tang, Philip H. S. Torr, Nicu Sebe

In the first stage, the input image and the conditional semantic guidance are fed into a cycled semantic-guided generation network to produce initial coarse results.

Generative Adversarial Network Image-to-Image Translation +1

458

Paper
Code

Unifying Training and Inference for Panoptic Segmentation

no code implementations • CVPR 2020 • Qizhu Li, Xiaojuan Qi, Philip H. S. Torr

This panoptic submodule gives rise to a novel propagation mechanism for panoptic logits and enables the network to output a coherent panoptic segmentation map for both "stuff" and "thing" classes, without any post-processing.

Panoptic Segmentation Segmentation

Paper
Add Code

Rethinking Class Relations: Absolute-relative Supervised and Unsupervised Few-shot Learning

1 code implementation • CVPR 2021 • Hongguang Zhang, Piotr Koniusz, Songlei Jian, Hongdong Li, Philip H. S. Torr

The majority of existing few-shot learning methods describe image relations with binary labels.

Ranked #10 on Unsupervised Few-Shot Image Classification on Tiered ImageNet 5-way (5-shot)

Relation Unsupervised Few-Shot Image Classification +1

Paper
Code

Few-shot Action Recognition with Permutation-invariant Attention

1 code implementation • ECCV 2020 • Hongguang Zhang, Li Zhang, Xiaojuan Qi, Hongdong Li, Philip H. S. Torr, Piotr Koniusz

Such encoded blocks are aggregated by permutation-invariant pooling to make our approach robust to varying action lengths and long-range temporal dependencies whose patterns are unlikely to repeat even in clips of the same class.

Ranked #6 on Few Shot Action Recognition on Kinetics-100

Few-Shot action recognition Few Shot Action Recognition +3

Paper
Code

Improving Few-shot Learning by Spatially-aware Matching and CrossTransformer

no code implementations • 6 Jan 2020 • Hongguang Zhang, Philip H. S. Torr, Piotr Koniusz

In this paper, we study the impact of scale and location mismatch in the few-shot learning scenario, and propose a novel Spatially-aware Matching (SM) scheme to effectively perform matching across multiple scales and locations, and learn image relations by giving the highest weights to the best matching pairs.

Deblurring Few-Shot Learning +2

Paper
Add Code

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

2 code implementations • CVPR 2020 • Hao Tang, Dan Xu, Yan Yan, Philip H. S. Torr, Nicu Sebe

To tackle this issue, in this work we consider learning the scene generation in a local context, and correspondingly design a local class-specific generative network with semantic maps as a guidance, which separately constructs and learns sub-generators concentrating on the generation of different classes, and is able to provide more scene details.

Ranked #2 on Cross-View Image-to-Image Translation on Dayton (256×256) - aerial-to-ground

Image Generation Scene Generation

145

Paper
Code

Learning Regional Attraction for Line Segment Detection

no code implementations • 18 Dec 2019 • Nan Xue, Song Bai, Fu-Dong Wang, Gui-Song Xia, Tianfu Wu, Liangpei Zhang, Philip H. S. Torr

Given a line segment map, the proposed regional attraction first establishes the relationship between line segments and regions in the image lattice.

Line Segment Detection

Paper
Add Code

Lessons from reinforcement learning for biological representations of space

no code implementations • 13 Dec 2019 • Alex Muryy, Siddharth Narayanaswamy, Nantas Nardelli, Andrew Glennerster, Philip H. S. Torr

Neuroscientists postulate 3D representations in the brain in a variety of different coordinate frames (e. g. 'head-centred', 'hand-centred' and 'world-based').

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

ManiGAN: Text-Guided Image Manipulation

3 code implementations • 12 Dec 2019 • Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip H. S. Torr

The goal of our paper is to semantically edit parts of an image matching a given text that describes desired attributes (e. g., texture, colour, and background), while preserving other contents that are irrelevant to the text.

Generative Adversarial Network Image Manipulation +1

367

Paper
Code

Transflow Learning: Repurposing Flow Models Without Retraining

no code implementations • 29 Nov 2019 • Andrew Gambardella, Atılım Güneş Baydin, Philip H. S. Torr

It is well known that deep generative models have a rich latent space, and that it is possible to smoothly manipulate their outputs by traversing this latent space.

Bayesian Inference Style Transfer

Paper
Add Code

Siam R-CNN: Visual Tracking by Re-Detection

1 code implementation • CVPR 2020 • Paul Voigtlaender, Jonathon Luiten, Philip H. S. Torr, Bastian Leibe

We present Siam R-CNN, a Siamese re-detection architecture which unleashes the full power of two-stage object detection approaches for visual object tracking.

Ranked #5 on Object Tracking on COESOT

Object object-detection +4

217

Paper
Code

AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks

2 code implementations • 27 Nov 2019 • Hao Tang, Hong Liu, Dan Xu, Philip H. S. Torr, Nicu Sebe

State-of-the-art methods in image-to-image translation are capable of learning a mapping from a source domain to a target domain with unpaired image data.

Ranked #1 on Facial Expression Translation on CelebA

Image-to-Image Translation Translation

621

Paper
Code

Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models

3 code implementations • NeurIPS 2019 • Yuge Shi, N. Siddharth, Brooks Paige, Philip H. S. Torr

In this work, we characterise successful learning of such models as the fulfillment of four criteria: i) implicit latent decomposition into shared and private subspaces, ii) coherent joint generation over all modalities, iii) coherent cross-generation across individual modalities, and iv) improved model learning for individual modalities through multi-modal integration.

172

Paper
Code

Anchor Diffusion for Unsupervised Video Object Segmentation

1 code implementation • ICCV 2019 • Zhao Yang, Qiang Wang, Luca Bertinetto, Weiming Hu, Song Bai, Philip H. S. Torr

Unsupervised video object segmentation has often been tackled by methods based on recurrent neural networks and optical flow.

Ranked #18 on Unsupervised Video Object Segmentation on DAVIS 2016 val

Image Segmentation Object +4

116

Paper
Code

Amortized Rejection Sampling in Universal Probabilistic Programming

1 code implementation • 20 Oct 2019 • Saeid Naderiparizi, Adam Ścibior, Andreas Munk, Mehrdad Ghadiri, Atılım Güneş Baydin, Bradley Gram-Hansen, Christian Schroeder de Witt, Robert Zinkov, Philip H. S. Torr, Tom Rainforth, Yee Whye Teh, Frank Wood

Naive approaches to amortized inference in probabilistic programs with unbounded loops can produce estimators with infinite variance.

Probabilistic Programming

Paper
Code

Mirror Descent View for Neural Network Quantization

1 code implementation • 18 Oct 2019 • Thalaiyasingam Ajanthan, Kartik Gupta, Philip H. S. Torr, Richard Hartley, Puneet K. Dokania

Quantizing large Neural Networks (NN) while maintaining the performance is highly desirable for resource-limited devices due to reduced memory and time complexity.

Quantization valid

Paper
Code

Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation

1 code implementation • ICCV 2019 • Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros, Philip H. S. Torr, Eli Shechtman

We propose an interactive GAN-based sketch-to-image translation method that helps novice users create images of simple objects.

Object Sketch-to-Image Translation +1

191

Paper
Code

Controllable Text-to-Image Generation

2 code implementations • NeurIPS 2019 • Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip H. S. Torr

In this paper, we propose a novel controllable text-to-image generative adversarial network (ControlGAN), which can effectively synthesise high-quality images and also control parts of the image generation according to natural language descriptions.

Ranked #7 on Text-to-Image Generation on Multi-Modal-CelebA-HQ

Generative Adversarial Network Text-to-Image Generation

163

Paper
Code

Branch and Bound for Piecewise Linear Neural Network Verification

no code implementations • 14 Sep 2019 • Rudy Bunel, Jingyue Lu, Ilker Turkaslan, Philip H. S. Torr, Pushmeet Kohli, M. Pawan Kumar

We use the data sets to conduct a thorough experimental comparison of existing and new algorithms and to provide an inclusive analysis of the factors impacting the hardness of verification problems.

Paper
Add Code

Dual Graph Convolutional Network for Semantic Segmentation

6 code implementations • 13 Sep 2019 • Li Zhang, Xiangtai Li, Anurag Arnab, Kuiyuan Yang, Yunhai Tong, Philip H. S. Torr

Exploiting long-range contextual information is key for pixel-wise prediction tasks such as semantic segmentation.

Ranked #32 on Semantic Segmentation on Cityscapes test

Semantic Segmentation

342

Paper
Code

Dynamic Graph Message Passing Networks

1 code implementation • CVPR 2020 • Li Zhang, Dan Xu, Anurag Arnab, Philip H. S. Torr

We propose a dynamic graph message passing network, that significantly reduces the computational complexity compared to related works modelling a fully-connected graph.

Image Classification object-detection +3

Paper
Code

Real-Time Highly Accurate Dense Depth on a Power Budget using an FPGA-CPU Hybrid SoC

no code implementations • 17 Jul 2019 • Oscar Rahnama, Tommaso Cavallari, Stuart Golodetz, Alessio Tonioni, Thomas Joy, Luigi Di Stefano, Simon Walker, Philip H. S. Torr

Obtaining highly accurate depth from stereo images in real time has many applications across computer vision and robotics, but in some contexts, upper bounds on power consumption constrain the feasible hardware to embedded platforms such as FPGAs.

Paper
Add Code

A Signal Propagation Perspective for Pruning Neural Networks at Initialization

1 code implementation • ICLR 2020 • Namhoon Lee, Thalaiyasingam Ajanthan, Stephen Gould, Philip H. S. Torr

Alternatively, a recent approach shows that pruning can be done at initialization prior to training, based on a saliency criterion called connection sensitivity.

Image Classification Network Pruning

Paper
Code

Stable Rank Normalization for Improved Generalization in Neural Networks and GANs

no code implementations • ICLR 2020 • Amartya Sanyal, Philip H. S. Torr, Puneet K. Dokania

Exciting new work on the generalization bounds for neural networks (NN) given by Neyshabur et al. , Bartlett et al. closely depend on two parameter-depenedent quantities: the Lipschitz constant upper-bound and the stable rank (a softer version of the rank operator).

Ranked #122 on Image Generation on CIFAR-10

Generalization Bounds Image Generation +1

Paper
Add Code

Hijacking Malaria Simulators with Probabilistic Programming

no code implementations • 29 May 2019 • Bradley Gram-Hansen, Christian Schröder de Witt, Tom Rainforth, Philip H. S. Torr, Yee Whye Teh, Atılım Güneş Baydin

Epidemiology simulations have become a fundamental tool in the fight against the epidemics of various infectious diseases like AIDS and malaria.

Epidemiology Probabilistic Programming

Paper
Add Code

Straight to Shapes++: Real-time Instance Segmentation Made More Accurate

1 code implementation • 27 May 2019 • Laurynas Miksys, Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr

The STS model can run at 35 FPS on a high-end desktop, but its accuracy is significantly worse than that of offline state-of-the-art methods.

Autonomous Driving Data Augmentation +5

Paper
Code

Alpha MAML: Adaptive Model-Agnostic Meta-Learning

no code implementations • 17 May 2019 • Harkirat Singh Behl, Atılım Güneş Baydin, Philip H. S. Torr

Model-agnostic meta-learning (MAML) is a meta-learning technique to train a model on a multitude of learning tasks in a way that primes the model for few-shot learning of new tasks.

Few-Shot Learning General Classification

Paper
Add Code

GA-Net: Guided Aggregation Net for End-to-end Stereo Matching

3 code implementations • CVPR 2019 • Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H. S. Torr

In the stereo matching task, matching cost aggregation is crucial in both traditional methods and deep neural network models in order to accurately estimate disparities.

Ranked #4 on Stereo Depth Estimation on Spring

Stereo Depth Estimation Stereo Matching

541

Paper
Code

Deep Virtual Networks for Memory Efficient Inference of Multiple Tasks

no code implementations • CVPR 2019 • Eunwoo Kim, Chanho Ahn, Philip H. S. Torr, Songhwai Oh

To this end, we propose a novel network architecture producing multiple networks of different configurations, termed deep virtual networks (DVNs), for different tasks.

Paper
Add Code

Learning to Adapt for Stereo

1 code implementation • CVPR 2019 • Alessio Tonioni, Oscar Rahnama, Thomas Joy, Luigi Di Stefano, Thalaiyasingam Ajanthan, Philip H. S. Torr

Real world applications of stereo depth estimation require models that are robust to dynamic variations in the environment.

Autonomous Driving Stereo Depth Estimation

Paper
Code

On Tiny Episodic Memories in Continual Learning

6 code implementations • 27 Feb 2019 • Arslan Chaudhry, Marcus Rohrbach, Mohamed Elhoseiny, Thalaiyasingam Ajanthan, Puneet K. Dokania, Philip H. S. Torr, Marc'Aurelio Ranzato

But for a successful knowledge transfer, the learner needs to remember how to perform previous tasks.

Ranked #7 on Class Incremental Learning on cifar100

Class Incremental Learning Transfer Learning

1,657

Paper
Code

Domain Partitioning Network

no code implementations • 21 Feb 2019 • Botos Csaba, Adnane Boukhayma, Viveka Kulharia, András Horváth, Philip H. S. Torr

Standard adversarial training involves two agents, namely a generator and a discriminator, playing a mini-max game.

Paper
Add Code

The StarCraft Multi-Agent Challenge

20 code implementations • 11 Feb 2019 • Mikayel Samvelyan, Tabish Rashid, Christian Schroeder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob Foerster, Shimon Whiteson

In this paper, we propose the StarCraft Multi-Agent Challenge (SMAC) as a benchmark problem to fill this gap.

Ranked #6 on SMAC on SMAC 6h_vs_8z

Benchmarking Reinforcement Learning (RL) +3

1,716

Paper
Code

3D Hand Shape and Pose from Images in the Wild

2 code implementations • CVPR 2019 • Adnane Boukhayma, Rodrigo de Bem, Philip H. S. Torr

We present in this work the first end-to-end deep learning based method that predicts both 3D hand shape and pose from RGB images in the wild.

Ranked #10 on 3D Hand Pose Estimation on FreiHAND (PA-MPVPE metric)

3D Hand Pose Estimation Pose Prediction +1

403

Paper
Code

Adversarial Metric Attack and Defense for Person Re-identification

1 code implementation • 30 Jan 2019 • Song Bai, Yingwei Li, Yuyin Zhou, Qizhu Li, Philip H. S. Torr

However, our work observes the extreme vulnerability of existing distance metrics to adversarial examples, generated by simply adding human-imperceptible perturbations to person images.

Adversarial Attack Benchmarking +2

Paper
Code

Hypergraph Convolution and Hypergraph Attention

1 code implementation • 23 Jan 2019 • Song Bai, Feihu Zhang, Philip H. S. Torr

To efficiently learn deep embeddings on the high-order graph-structured data, we introduce two end-to-end trainable operators to the family of graph neural networks, i. e., hypergraph convolution and hypergraph attention.

Node Classification Representation Learning

20,054

Paper
Code

Learn to Interpret Atari Agents

1 code implementation • 29 Dec 2018 • Zhao Yang, Song Bai, Li Zhang, Philip H. S. Torr

Deep reinforcement learning (DeepRL) agents surpass human-level performance in many tasks.

Decision Making

Paper
Code

Visual Dialogue without Vision or Dialogue

2 code implementations • 16 Dec 2018 • Daniela Massiceti, Puneet K. Dokania, N. Siddharth, Philip H. S. Torr

We characterise some of the quirks and shortcomings in the exploration of Visual Dialogue - a sequential question-answering task where the questions and corresponding answers are related through given visual stimuli.

Question Answering Visual Dialog

Paper
Code

Fast Online Object Tracking and Segmentation: A Unifying Approach

3 code implementations • CVPR 2019 • Qiang Wang, Li Zhang, Luca Bertinetto, Weiming Hu, Philip H. S. Torr

In this paper we illustrate how to perform both visual object tracking and semi-supervised video object segmentation, in real-time, with a single simple approach.

Ranked #3 on Visual Object Tracking on YouTube-VOS 2018

Object Real-Time Visual Tracking +4

3,448

Paper
Code

Proximal Mean-field for Neural Network Quantization

1 code implementation • ICCV 2019 • Thalaiyasingam Ajanthan, Puneet K. Dokania, Richard Hartley, Philip H. S. Torr

Compressing large Neural Networks (NN) by quantizing the parameters, while maintaining the performance is highly desirable due to reduced memory and time complexity.

Image Classification Quantization

Paper
Code

Meta Learning Deep Visual Words for Fast Video Object Segmentation

1 code implementation • 4 Dec 2018 • Harkirat Singh Behl, Mohammad Najafi, Anurag Arnab, Philip H. S. Torr

We address this problem by considering the task of video object segmentation.

Meta-Learning Object +5

Paper
Code

Deeper Interpretability of Deep Networks

no code implementations • 19 Nov 2018 • Tian Xu, Jiayu Zhan, Oliver G. B. Garrod, Philip H. S. Torr, Song-Chun Zhu, Robin A. A. Ince, Philippe G. Schyns

However, understanding the information represented and processed in CNNs remains in most cases challenging.

Paper
Add Code

R$^3$SGM: Real-time Raster-Respecting Semi-Global Matching for Power-Constrained Systems

no code implementations • 30 Oct 2018 • Oscar Rahnama, Tommaso Cavallari, Stuart Golodetz, Simon Walker, Philip H. S. Torr

Stereo depth estimation is used for many computer vision applications.

Stereo Depth Estimation

Paper
Add Code

Real-Time RGB-D Camera Pose Estimation in Novel Scenes using a Relocalisation Cascade

1 code implementation • 29 Oct 2018 • Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Victor A. Prisacariu, Luigi Di Stefano, Philip H. S. Torr

The adapted forests achieved relocalisation performance that was on par with that of offline forests, and our approach was able to estimate the camera pose in close to real time.

Pose Estimation

190

Paper
Code

Multi-Agent Common Knowledge Reinforcement Learning

1 code implementation • NeurIPS 2019 • Christian A. Schroeder de Witt, Jakob N. Foerster, Gregory Farquhar, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson

In this paper, we show that common knowledge between agents allows for complex decentralised coordination.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Code

SNIP: Single-shot Network Pruning based on Connection Sensitivity

8 code implementations • ICLR 2019 • Namhoon Lee, Thalaiyasingam Ajanthan, Philip H. S. Torr

To achieve this, we introduce a saliency criterion based on connection sensitivity that identifies structurally important connections in the network for the given task.

Image Classification Network Pruning +1

314

Paper
Code

Weakly- and Semi-Supervised Panoptic Segmentation

1 code implementation • ECCV 2018 • Qizhu Li, Anurag Arnab, Philip H. S. Torr

We present a weakly supervised model that jointly performs both semantic- and instance-segmentation -- a particularly relevant problem given the substantial cost of obtaining pixel-perfect annotation for these tasks.

Ranked #33 on Panoptic Segmentation on Cityscapes val

Instance Segmentation Panoptic Segmentation +4

160

Paper
Code

With Friends Like These, Who Needs Adversaries?

1 code implementation • NeurIPS 2018 • Saumya Jetley, Nicholas A. Lord, Philip H. S. Torr

Via a novel experimental analysis, we illustrate some facts about deep convolutional networks for image classification that shed new light on their behaviour and how it connects to the problem of adversaries.

Adversarial Attack Classification +2

Paper
Code

Value Propagation Networks

no code implementations • ICLR 2018 • Nantas Nardelli, Gabriel Synnaeve, Zeming Lin, Pushmeet Kohli, Philip H. S. Torr, Nicolas Usunier

We present Value Propagation (VProp), a set of parameter-efficient differentiable planning modules built on Value Iteration which can successfully be trained using reinforcement learning to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments.

Navigate reinforcement-learning +2

Paper
Add Code

Efficient Relaxations for Dense CRFs with Sparse Higher Order Potentials

no code implementations • 23 May 2018 • Thomas Joy, Alban Desmaison, Thalaiyasingam Ajanthan, Rudy Bunel, Mathieu Salzmann, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

The presented algorithms can be applied to any labelling problem using a dense CRF with sparse higher-order potentials.

Semantic Segmentation

Paper
Add Code

Meta-learning with differentiable closed-form solvers

5 code implementations • ICLR 2019 • Luca Bertinetto, João F. Henriques, Philip H. S. Torr, Andrea Vedaldi

The main idea is to teach a deep network to use standard machine learning tools, such as ridge regression, as part of its own internal model, enabling it to quickly adapt to novel data.

BIG-bench Machine Learning Few-Shot Learning +1

2,531

Paper
Code

Robustness via Deep Low-Rank Representations

no code implementations • ICLR 2019 • Amartya Sanyal, Varun Kanade, Philip H. S. Torr, Puneet K. Dokania

To achieve low dimensionality of learned representations, we propose an easy-to-use, end-to-end trainable, low-rank regularizer (LR) that can be applied to any intermediate layer representation of a DNN.

Clustering General Classification +2

Paper
Add Code

Learn To Pay Attention

4 code implementations • ICLR 2018 • Saumya Jetley, Nicholas A. Lord, Namhoon Lee, Philip H. S. Torr

We propose an end-to-end-trainable attention module for convolutional neural network (CNN) architectures built for image classification.

Adversarial Attack General Classification +3

161

Paper
Code

WebSeg: Learning Semantic Segmentation from Web Searches

no code implementations • 27 Mar 2018 • Qibin Hou, Ming-Ming Cheng, Jiang-Jiang Liu, Philip H. S. Torr

In this paper, we improve semantic segmentation by automatically learning from Flickr images associated with a particular keyword, without relying on any explicit user annotations, thus substantially alleviating the dependence on accurate annotations when compared to previous weakly supervised methods.

Segmentation Semantic Segmentation

Paper
Add Code

Three Birds One Stone: A General Architecture for Salient Object Segmentation, Edge Detection and Skeleton Extraction

no code implementations • 27 Mar 2018 • Qibin Hou, Jiang-Jiang Liu, Ming-Ming Cheng, Ali Borji, Philip H. S. Torr

Although these tasks are inherently very different, we show that our unified approach performs very well on all of them and works far better than current single-purpose state-of-the-art methods.

Edge Detection Semantic Segmentation

Paper
Add Code

Real-Time Dense Stereo Matching With ELAS on FPGA Accelerated Embedded Devices

1 code implementation • 20 Feb 2018 • Oscar Rahnama, Duncan Frost, Ondrej Miksik, Philip H. S. Torr

For many applications in low-power real-time robotics, stereo cameras are the sensors of choice for depth perception as they are typically cheaper and more versatile than their active counterparts.

Stereo Matching Stereo Matching Hand

Paper
Code

Devon: Deformable Volume Network for Learning Optical Flow

no code implementations • 20 Feb 2018 • Yao Lu, Jack Valmadre, Heng Wang, Juho Kannala, Mehrtash Harandi, Philip H. S. Torr

State-of-the-art neural network models estimate large displacement optical flow in multi-resolution and use warping to propagate the estimation between two resolutions.

Optical Flow Estimation

Paper
Add Code

FlipDial: A Generative Model for Two-Way Visual Dialogue

no code implementations • CVPR 2018 • Daniela Massiceti, N. Siddharth, Puneet K. Dokania, Philip H. S. Torr

We are the first to extend this paradigm to full two-way visual dialogue, where our model is capable of generating both questions and answers in sequence based on a visual input, for which we propose a set of novel evaluation measures and metrics.

Visual Dialog Vocal Bursts Valence Prediction

Paper
Add Code

Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence

2 code implementations • ECCV 2018 • Arslan Chaudhry, Puneet K. Dokania, Thalaiyasingam Ajanthan, Philip H. S. Torr

We observe that, in addition to forgetting, a known issue while preserving knowledge, IL also suffers from a problem we call intransigence, inability of a model to update its knowledge.

Incremental Learning

1,657

Paper
Code

Collaborative Large-Scale Dense 3D Reconstruction with Online Inter-Agent Pose Optimisation

no code implementations • 25 Jan 2018 • Stuart Golodetz, Tommaso Cavallari, Nicholas A. Lord, Victor A. Prisacariu, David W. Murray, Philip H. S. Torr

Reconstructing dense, volumetric models of real-world 3D scenes is important for many tasks, but capturing large scenes can take significant time, and the risk of transient changes to the scene goes up as the capture time increases.

3D Reconstruction

Paper
Add Code

Piecewise Linear Neural Networks verification: A comparative study

no code implementations • ICLR 2018 • Rudy Bunel, Ilker Turkaslan, Philip H. S. Torr, Pushmeet Kohli, M. Pawan Kumar

Motivated by the need of accelerating progress in this very important area, we investigate the trade-offs of a number of different approaches based on Mixed Integer Programming, Satisfiability Modulo Theory, as well as a novel method based on the Branch-and-Bound framework.

Paper
Add Code

On the Robustness of Semantic Segmentation Models to Adversarial Attacks

1 code implementation • CVPR 2018 • Anurag Arnab, Ondrej Miksik, Philip H. S. Torr

Deep Neural Networks (DNNs) have demonstrated exceptional performance on most recognition tasks such as image classification and segmentation.

General Classification Image Classification +3

Paper
Code

Learning to Compare: Relation Network for Few-Shot Learning

13 code implementations • CVPR 2018 • Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H. S. Torr, Timothy M. Hospedales

Once trained, a RN is able to classify images of new classes by computing relation scores between query images and the few examples of each new class without further updating the network.

Ranked #2 on Image Classification on Tiered ImageNet 5-way (5-shot)

Few-Shot Image Classification Few-Shot Learning +3

1,040

Paper
Code

A Unified View of Piecewise Linear Neural Network Verification

2 code implementations • NeurIPS 2018 • Rudy Bunel, Ilker Turkaslan, Philip H. S. Torr, Pushmeet Kohli, M. Pawan Kumar

The success of Deep Learning and its potential use in many safety-critical applications has motivated research on formal verification of Neural Network (NN) models.

Paper
Code

Holistic, Instance-Level Human Parsing

1 code implementation • 11 Sep 2017 • Qizhu Li, Anurag Arnab, Philip H. S. Torr

We address this problem by segmenting the parts of objects at an instance-level, such that each pixel in the image is assigned a part label, as well as the identity of the object it belongs to.

Ranked #2 on Multi-Human Parsing on PASCAL-Part

Human Detection Multi-Human Parsing +2

Paper
Code

InfiniTAM v3: A Framework for Large-Scale 3D Reconstruction with Loop Closure

1 code implementation • 2 Aug 2017 • Victor Adrian Prisacariu, Olaf Kähler, Stuart Golodetz, Michael Sapienza, Tommaso Cavallari, Philip H. S. Torr, David W. Murray

Representing the reconstruction volumetrically as a TSDF leads to most of the simplicity and efficiency that can be achieved with GPU implementations of these systems.

3D Reconstruction Simultaneous Localization and Mapping

909

Paper
Code

Spatio-temporal Human Action Localisation and Instance Segmentation in Temporally Untrimmed Videos

no code implementations • 22 Jul 2017 • Suman Saha, Gurkirt Singh, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin

Current state-of-the-art human action recognition is focused on the classification of temporally trimmed videos in which only one action occurs per frame.

Action Recognition Instance Segmentation +2

Paper
Add Code

Discovering Class-Specific Pixels for Weakly-Supervised Semantic Segmentation

1 code implementation • 18 Jul 2017 • Arslan Chaudhry, Puneet K. Dokania, Philip H. S. Torr

We propose an approach to discover class-specific pixels for the weakly-supervised semantic segmentation task.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Code

Learning Disentangled Representations with Semi-Supervised Deep Generative Models

1 code implementation • NeurIPS 2017 • N. Siddharth, Brooks Paige, Jan-Willem van de Meent, Alban Desmaison, Noah D. Goodman, Pushmeet Kohli, Frank Wood, Philip H. S. Torr

We propose to learn such representations using model architectures that generalise from standard VAEs, employing a general graphical model structure in the encoder and decoder.

Representation Learning

878

Paper
Code

End-to-end representation learning for Correlation Filter based tracking

no code implementations • CVPR 2017 • Jack Valmadre, Luca Bertinetto, João F. Henriques, Andrea Vedaldi, Philip H. S. Torr

The Correlation Filter is an algorithm that trains a linear template to discriminate between images and their translations.

Ranked #3 on Visual Object Tracking on OTB-50

Object Tracking Representation Learning

Paper
Add Code

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

3 code implementations • CVPR 2017 • Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker

DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i. e., given the same context, future may vary), 2) foreseeing the potential future outcomes and make a strategic prediction based on that, and 3) reasoning not only from the past motion history, but also from the scene context as well as the interactions among the agents.

Ranked #1 on Trajectory Prediction on PAID

Future prediction Multi Future Trajectory Prediction +1

Paper
Code

Multi-Agent Diverse Generative Adversarial Networks

1 code implementation • CVPR 2018 • Arnab Ghosh, Viveka Kulharia, Vinay Namboodiri, Philip H. S. Torr, Puneet K. Dokania

Second, to enforce that different generators capture diverse high probability modes, the discriminator of MAD-GAN is designed such that along with finding the real and fake samples, it is also required to identify the generator that generated the given fake sample.

Face Generation Image-to-Image Translation +1

Paper
Code

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

1 code implementation • CVPR 2017 • Anurag Arnab, Philip H. S. Torr

This subnetwork uses the initial category-level segmentation, along with cues from the output of an object detector, within an end-to-end CRF to predict instances.

Ranked #10 on Panoptic Segmentation on Cityscapes test

Instance Segmentation Object +4

Paper
Code

Incremental Tube Construction for Human Action Detection

1 code implementation • 5 Apr 2017 • Harkirat Singh Behl, Michael Sapienza, Gurkirt Singh, Suman Saha, Fabio Cuzzolin, Philip H. S. Torr

In this work, we introduce a real-time and online joint-labelling and association algorithm for action detection that can incrementally construct space-time action tubes on the most challenging action videos in which different action categories occur concurrently.

Action Detection

Paper
Code

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

5 code implementations • ICML 2017 • Jakob Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson

Many real-world problems, such as network packet routing and urban traffic control, are naturally modeled as multi-agent reinforcement learning (RL) problems.

Multi-agent Reinforcement Learning Q-Learning +3

350

Paper
Code

On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

no code implementations • CVPR 2017 • Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano, Philip H. S. Torr

Camera relocalisation is an important problem in computer vision, with applications in simultaneous localisation and mapping, virtual/augmented reality and navigation.

Camera Relocalization regression

Paper
Add Code

ROAM: a Rich Object Appearance Model with Application to Rotoscoping

no code implementations • CVPR 2017 • Ondrej Miksik, Juan-Manuel Pérez-Rúa, Philip H. S. Torr, Patrick Pérez

Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines.

Object

Paper
Add Code

Learning to superoptimize programs - Workshop Version

no code implementations • 4 Dec 2016 • Rudy Bunel, Alban Desmaison, M. Pawan Kumar, Philip H. S. Torr, Pushmeet Kohli

Superoptimization requires the estimation of the best program for a given computational task.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Playing Doom with SLAM-Augmented Deep Reinforcement Learning

1 code implementation • 1 Dec 2016 • Shehroze Bhatti, Alban Desmaison, Ondrej Miksik, Nantas Nardelli, N. Siddharth, Philip H. S. Torr

A number of recent approaches to policy learning in 2D game domains have been successful going directly from raw input images to actions.

object-detection Object Detection +3

Paper
Code

Efficient Linear Programming for Dense CRFs

no code implementations • CVPR 2017 • Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, Philip H. S. Torr, M. Pawan Kumar

To this end, we develop a proximal minimization framework, where the dual of each proximal problem is optimized via block coordinate descent.

Semantic Segmentation

Paper
Add Code

Straight to Shapes: Real-time Detection of Encoded Shapes

1 code implementation • CVPR 2017 • Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr

To achieve this, we use a denoising convolutional auto-encoder to establish an embedding space, and place the decoder after a fast end-to-end network trained to regress directly to the encoded shape vectors.

Ranked #5 on Semantic Contour Prediction on Sbd val

Denoising object-detection +1

Paper
Code

Inducing Interpretable Representations with Variational Autoencoders

no code implementations • 22 Nov 2016 • N. Siddharth, Brooks Paige, Alban Desmaison, Jan-Willem van de Meent, Frank Wood, Noah D. Goodman, Pushmeet Kohli, Philip H. S. Torr

We develop a framework for incorporating structured graphical models in the \emph{encoders} of variational autoencoders (VAEs) that allows us to induce interpretable representations through approximate variational inference.

General Classification Variational Inference

Paper
Add Code

Learning to superoptimize programs

no code implementations • 6 Nov 2016 • Rudy Bunel, Alban Desmaison, M. Pawan Kumar, Philip H. S. Torr, Pushmeet Kohli

This approach involves repeated sampling of modifications to the program from a proposal distribution, which are accepted or rejected based on whether they preserve correctness, and the improvement they achieve.

Paper
Add Code

Random Forests versus Neural Networks - What's Best for Camera Localization?

no code implementations • 19 Sep 2016 • Daniela Massiceti, Alexander Krull, Eric Brachmann, Carsten Rother, Philip H. S. Torr

This work addresses the task of camera localization in a known 3D scene given a single input RGB image.

Camera Localization regression

Paper
Add Code

Fully-Trainable Deep Matching

1 code implementation • 12 Sep 2016 • James Thewlis, Shuai Zheng, Philip H. S. Torr, Andrea Vedaldi

Deep Matching (DM) is a popular high-quality method for quasi-dense image matching.

Image Segmentation Semantic Segmentation

Paper
Code

Bottom-up Instance Segmentation using Deep Higher-Order CRFs

no code implementations • 8 Sep 2016 • Anurag Arnab, Philip H. S. Torr

Traditional Scene Understanding problems such as Object Detection and Semantic Segmentation have made breakthroughs in recent years due to the adoption of deep learning.

Instance Segmentation Object +5

Paper
Add Code

Efficient Continuous Relaxations for Dense CRF

no code implementations • 22 Aug 2016 • Alban Desmaison, Rudy Bunel, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

In contrast to the continuous relaxation-based energy minimisation algorithms used for sparse CRFs, the mean-field algorithm fails to provide strong theoretical guarantees on the quality of its solutions.

Semantic Segmentation Variational Inference

Paper
Add Code

Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos

no code implementations • 4 Aug 2016 • Suman Saha, Gurkirt Singh, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin

In stage 2, the appearance network detections are boosted by combining them with the motion detection scores, in proportion to their respective spatial overlap.

Action Detection Motion Detection +1

Paper
Add Code

Fully-Convolutional Siamese Networks for Object Tracking

10 code implementations • 30 Jun 2016 • Luca Bertinetto, Jack Valmadre, João F. Henriques, Andrea Vedaldi, Philip H. S. Torr

The problem of arbitrary object tracking has traditionally been tackled by learning a model of the object's appearance exclusively online, using as sole training data the video itself.

Ranked #3 on Visual Object Tracking on OTB-50

Object object-detection +2

615

Paper
Code

Learning feed-forward one-shot learners

no code implementations • NeurIPS 2016 • Luca Bertinetto, João F. Henriques, Jack Valmadre, Philip H. S. Torr, Andrea Vedaldi

In this paper, we propose a method to learn the parameters of a deep model in one shot.

One-Shot Learning Visual Object Tracking

Paper
Add Code

Adaptive Neural Compilation

1 code implementation • NeurIPS 2016 • Rudy Bunel, Alban Desmaison, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

We show that it is possible to compile programs written in a low-level language to a differentiable representation.

Paper
Code

Recurrent Instance Segmentation

no code implementations • 25 Nov 2015 • Bernardino Romera-Paredes, Philip H. S. Torr

Instance segmentation is the problem of detecting and delineating each distinct object of interest appearing in an image.

Instance Segmentation Occlusion Handling +3

Paper
Add Code

Joint Training of Generic CNN-CRF Models with Stochastic Optimization

no code implementations • 16 Nov 2015 • Alexander Kirillov, Dmitrij Schlesinger, Shuai Zheng, Bogdan Savchynskyy, Philip H. S. Torr, Carsten Rother

We propose a new CNN-CRF end-to-end learning framework, which is based on joint stochastic optimization with respect to both Convolutional Neural Network (CNN) and Conditional Random Field (CRF) parameters.

Stochastic Optimization

Paper
Add Code

Sequential Optimization for Efficient High-Quality Object Proposal Generation

no code implementations • 14 Nov 2015 • Ziming Zhang, Yun Liu, Xi Chen, Yanjun Zhu, Ming-Ming Cheng, Venkatesh Saligrama, Philip H. S. Torr

We propose a novel object proposal algorithm, BING++, which inherits the virtue of good computational efficiency of BING but significantly improves its proposal localization quality.

Computational Efficiency Object +3

Paper
Add Code

SemanticPaint: A Framework for the Interactive Segmentation of 3D Scenes

no code implementations • 13 Oct 2015 • Stuart Golodetz, Michael Sapienza, Julien P. C. Valentin, Vibhav Vineet, Ming-Ming Cheng, Anurag Arnab, Victor A. Prisacariu, Olaf Kähler, Carl Yuheng Ren, David W. Murray, Shahram Izadi, Philip H. S. Torr

We present an open-source, real-time implementation of SemanticPaint, a system for geometric reconstruction, object-class segmentation and learning of 3D scenes.

Interactive Segmentation Segmentation

Paper
Add Code

An embarrassingly simple approach to zero-shot learning

1 code implementation • Proceedings of the International Conference on International Conference on Machine Learning 2015 • Bernardino Romera-Paredes, Philip H. S. Torr

Zero-shot learning consists in learning how to recognise new concepts by just having a description of them.

Ranked #8 on Zero-Shot Action Recognition on Olympics

Domain Adaptation Zero-Shot Action Recognition +1

Paper
Code

Exploiting Uncertainty in Regression Forests for Accurate Camera Relocalization

no code implementations • CVPR 2015 • Julien Valentin, Matthias Niessner, Jamie Shotton, Andrew Fitzgibbon, Shahram Izadi, Philip H. S. Torr

Recent advances in camera relocalization use predictions from a regression forest to guide the camera pose optimization procedure.

Camera Relocalization regression

Paper
Add Code

Target Identity-Aware Network Flow for Online Multiple Target Tracking

no code implementations • CVPR 2015 • Afshin Dehghan, Yicong Tian, Philip H. S. Torr, Mubarak Shah

In this paper we show that multiple object tracking (MOT) can be formulated in a framework, where the detection and data-association are performed simultaneously.

Multiple Object Tracking object-detection +1

Paper
Add Code

Conditional Random Fields as Recurrent Neural Networks

6 code implementations • ICCV 2015 • Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr

Pixel-level labelling tasks, such as semantic segmentation, play a central role in image understanding.

Ranked #36 on Semantic Segmentation on PASCAL VOC 2012 test

Image Segmentation Real-Time Semantic Segmentation +1

1,334

Paper
Code

Large-scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications

no code implementations • 27 Nov 2014 • Peng Wang, Chunhua Shen, Anton Van Den Hengel, Philip H. S. Torr

Two standard relaxation methods are widely used for solving general BQPs--spectral methods and semidefinite programming (SDP), each with their own advantages and disadvantages.

Clustering Image Segmentation +2

Paper
Add Code

A Framework for the Volumetric Integration of Depth Images

no code implementations • 3 Oct 2014 • Victor Adrian Prisacariu, Olaf Kähler, Ming Ming Cheng, Carl Yuheng Ren, Julien Valentin, Philip H. S. Torr, Ian D. Reid, David W. Murray

Along with the framework we also provide a set of components for scalable reconstruction: two implementations of camera trackers, based on RGB data and on depth data, two representations of the 3D volumetric data, a dense volume and one based on hashes of subblocks, and an optional module for swapping subblocks in and out of the typically limited GPU memory.

3D Reconstruction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.