Search Results for author: Philip Torr

Found 123 papers, 49 papers with code

Box2Seg: Attention Weighted Loss and Discriminative Feature Learning for Weakly Supervised Segmentation

no code implementations • ECCV 2020 • Viveka Kulharia, Siddhartha Chandra, Amit Agrawal, Philip Torr, Ambrish Tyagi

We propose a weakly supervised approach to semantic segmentation using bounding box annotations.

Paper
Add Code

Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples

no code implementations • 25 Apr 2024 • Kuofeng Gao, Jindong Gu, Yang Bai, Shu-Tao Xia, Philip Torr, Wei Liu, Zhifeng Li

For verbose videos, a frame feature diversity loss is proposed to increase the feature diversity among frames.

Paper
Add Code

An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models

no code implementations • 23 Apr 2024 • Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr

In traditional statistical learning, data points are usually assumed to be independently and identically distributed (i. i. d.)

Image Classification Reinforcement Learning (RL)

Paper
Add Code

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

no code implementations • 15 Apr 2024 • Zhongrui Gui, Shuyang Sun, Runjia Li, Jianhao Yuan, Zhaochong An, Karsten Roth, Ameya Prabhu, Philip Torr

Rapid advancements in continual segmentation have yet to bridge the gap of scaling to large continually expanding vocabularies under compute-constrained scenarios.

Panoptic Segmentation Retrieval +2

Paper
Add Code

Latent Guard: a Safety Framework for Text-to-image Generation

2 code implementations • 11 Apr 2024 • Runtao Liu, Ashkan Khakzar, Jindong Gu, Qifeng Chen, Philip Torr, Fabio Pizzati

Hence, we propose Latent Guard, a framework designed to improve safety measures in text-to-image generation.

Contrastive Learning Text-to-Image Generation

181

Paper
Code

AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment

no code implementations • 7 Apr 2024 • Yuanfeng Xu, Yuhao Chen, Zhongzhan Huang, Zijian He, Guangrun Wang, Philip Torr, Liang Lin

In this paper, we present AnimateZoo, a zero-shot diffusion-based video generator to address this challenging cross-species animation issue, aiming to accurately produce animal animations while preserving the background.

Video Editing Video Generation

Paper
Add Code

Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

no code implementations • 4 Apr 2024 • Shuo Chen, Zhen Han, Bailan He, Zifeng Ding, Wenqian Yu, Philip Torr, Volker Tresp, Jindong Gu

Various jailbreak attacks have been proposed to red-team Large Language Models (LLMs) and revealed the vulnerable safeguards of LLMs.

Paper
Add Code

Model-agnostic Origin Attribution of Generated Images with Few-shot Examples

no code implementations • 3 Apr 2024 • Fengyuan Liu, Haochen Luo, Yiming Li, Philip Torr, Jindong Gu

In this work, we study the origin attribution of generated images in a practical setting where only a few images generated by a source model are available and the source model cannot be accessed.

One-Class Classification

Paper
Add Code

DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion

1 code implementation • 25 Mar 2024 • Yuanze Lin, Ronald Clark, Philip Torr

We present DreamPolisher, a novel Gaussian Splatting based method with geometric guidance, tailored to learn cross-view consistency and intricate detail from textual descriptions.

3D Generation Text to 3D

Paper
Code

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

no code implementations • 21 Mar 2024 • Yufan Chen, Jiaming Zhang, Kunyu Peng, Junwei Zheng, Ruiping Liu, Philip Torr, Rainer Stiefelhagen

To address this, we are the first to introduce a robustness benchmark for DLA models, which includes 450K document images of three datasets.

Benchmarking Document Layout Analysis

Paper
Add Code

On Pretraining Data Diversity for Self-Supervised Learning

1 code implementation • 20 Mar 2024 • Hasan Abed Al Kader Hammoud, Tuhin Das, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem

We explore the impact of training with more diverse datasets, characterized by the number of unique samples, on the performance of self-supervised learning (SSL) under a fixed computational budget.

Self-Supervised Learning

Paper
Code

As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?

no code implementations • 19 Mar 2024 • Anjun Hu, Jindong Gu, Francesco Pinto, Konstantinos Kamnitsas, Philip Torr

Foundation models pre-trained on web-scale vision-language data, such as CLIP, are widely used as cornerstones of powerful machine learning systems.

Adversarial Attack Image Captioning +5

Paper
Add Code

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

no code implementations • 19 Mar 2024 • Yixuan Wu, Yizhou Wang, Shixiang Tang, Wenhao Wu, Tong He, Wanli Ouyang, Jian Wu, Philip Torr

We present DetToolChain, a novel prompting paradigm, to unleash the zero-shot object detection ability of multimodal large language models (MLLMs), such as GPT-4V and Gemini.

Object object-detection +3

Paper
Add Code

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

no code implementations • 18 Mar 2024 • Junlin Han, Filippos Kokkinos, Philip Torr

This results in a significant disparity in scale compared to the vast quantities of other types of data.

Paper
Add Code

A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

no code implementations • 17 Mar 2024 • Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart

Reinforcement learning algorithms utilizing policy gradients (PG) to optimize Conditional Value at Risk (CVaR) face significant challenges with sample inefficiency, hindering their practical applications.

Paper
Add Code

An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models

1 code implementation • 14 Mar 2024 • Haochen Luo, Jindong Gu, Fengyuan Liu, Philip Torr

Given that VLMs rely on prompts to adapt to different tasks, an intriguing question emerges: Can a single adversarial image mislead all predictions of VLMs when a thousand different prompts are given?

Paper
Code

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

no code implementations • 13 Mar 2024 • Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariu

We propose GaussCtrl, a text-driven method to edit a 3D scene reconstructed by the 3D Gaussian Splatting (3DGS).

Paper
Add Code

CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

1 code implementation • 7 Mar 2024 • Qilang Ye, Zitong Yu, Rui Shao, Xinyu Xie, Philip Torr, Xiaochun Cao

This paper focuses on the challenge of answering questions in scenarios that are composed of rich and complex dynamic audio-visual components.

Ranked #4 on Video-based Generative Performance Benchmarking on VideoInstruct

Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +5

Paper
Code

Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress

1 code implementation • 29 Feb 2024 • Ameya Prabhu, Vishaal Udandarao, Philip Torr, Matthias Bethge, Adel Bibi, Samuel Albanie

However, with repeated testing, the risk of overfitting grows as algorithms over-exploit benchmark idiosyncrasies.

Benchmarking

Paper
Code

Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images

no code implementations • 22 Feb 2024 • Zefeng Wang, Zhen Han, Shuo Chen, Fan Xue, Zifeng Ding, Xun Xiao, Volker Tresp, Philip Torr, Jindong Gu

Our research evaluates the adversarial robustness of MLLMs when employing CoT reasoning, finding that CoT marginally improves adversarial robustness against existing attack methods.

Adversarial Robustness

Paper
Add Code

Corrective Machine Unlearning

1 code implementation • 21 Feb 2024 • Shashwat Goel, Ameya Prabhu, Philip Torr, Ponnurangam Kumaraguru, Amartya Sanyal

We hope our work spurs research towards developing better methods for corrective unlearning and offers practitioners a new strategy to handle data integrity challenges arising from web-scale training.

Machine Unlearning

Paper
Code

Can Large Language Model Agents Simulate Human Trust Behaviors?

1 code implementation • 7 Feb 2024 • Chengxing Xie, Canyu Chen, Feiran Jia, Ziyu Ye, Kai Shu, Adel Bibi, Ziniu Hu, Philip Torr, Bernard Ghanem, Guohao Li

In addition, we probe into the biases in agent trust and the differences in agent trust towards agents and humans.

Language Modelling Large Language Model

Paper
Code

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

1 code implementation • 2 Feb 2024 • Hasan Abed Al Kader Hammoud, Hani Itani, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem

We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic text-image pairs, significantly departing from previous methods relying on real data.

Paper
Code

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images

1 code implementation • 20 Jan 2024 • Kuofeng Gao, Yang Bai, Jindong Gu, Shu-Tao Xia, Philip Torr, Zhifeng Li, Wei Liu

Once attackers maliciously induce high energy consumption and latency time (energy-latency cost) during inference of VLMs, it will exhaust computational resources.

Paper
Code

Measuring Value Alignment

no code implementations • 23 Dec 2023 • Fazl Barez, Philip Torr

As artificial intelligence (AI) systems become increasingly integrated into various domains, ensuring that they align with human values becomes critical.

Autonomous Vehicles Recommendation Systems

Paper
Add Code

Scene-Conditional 3D Object Stylization and Composition

no code implementations • 19 Dec 2023 • Jinghao Zhou, Tomas Jakab, Philip Torr, Christian Rupprecht

Recently, 3D generative models have made impressive progress, enabling the generation of almost arbitrary 3D assets from text or image inputs.

Object

Paper
Add Code

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

no code implementations • 12 Dec 2023 • Shuyang Sun, Runjia Li, Philip Torr, Xiuye Gu, Siyang Li

Mask labels are labor-intensive, which limits the number of categories in segmentation datasets.

Image Segmentation Segmentation +1

Paper
Add Code

Label Delay in Online Continual Learning

no code implementations • 1 Dec 2023 • Botos Csaba, Wenxuan Zhang, Matthias Müller, Ser-Nam Lim, Mohamed Elhoseiny, Philip Torr, Adel Bibi

We introduce a new continual learning framework with explicit modeling of the label delay between data and label streams over time steps.

Continual Learning

Paper
Add Code

Improving Adversarial Transferability via Model Alignment

no code implementations • 30 Nov 2023 • Avery Ma, Amir-Massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu

During the alignment process, the parameters of the source model are fine-tuned to minimize an alignment loss.

Paper
Add Code

Understanding and Improving In-Context Learning on Vision-language Models

no code implementations • 29 Nov 2023 • Shuo Chen, Zhen Han, Bailan He, Mark Buckley, Philip Torr, Volker Tresp, Jindong Gu

Our findings indicate that ICL in VLMs is predominantly driven by the textual information in the demonstrations whereas the visual information in the demonstrations barely affects the ICL performance.

In-Context Learning

Paper
Add Code

Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation

no code implementations • 28 Nov 2023 • Hang Li, Chengzhi Shen, Philip Torr, Volker Tresp, Jindong Gu

A risk with these models is the potential generation of inappropriate content, such as biased or harmful images.

Text-to-Image Generation

Paper
Add Code

Managing AI Risks in an Era of Rapid Progress

no code implementations • 26 Oct 2023 • Yoshua Bengio, Geoffrey Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atılım Güneş Baydin, Sheila Mcilraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca Dragan, Philip Torr, Stuart Russell, Daniel Kahneman, Jan Brauner, Sören Mindermann

In this short consensus paper, we outline risks from upcoming, advanced AI systems.

Paper
Add Code

A Survey on Transferability of Adversarial Examples across Deep Neural Networks

1 code implementation • 26 Oct 2023 • Jindong Gu, Xiaojun Jia, Pau de Jorge, Wenqain Yu, Xinwei Liu, Avery Ma, Yuan Xun, Anjun Hu, Ashkan Khakzar, Zhijiang Li, Xiaochun Cao, Philip Torr

This survey explores the landscape of the adversarial transferability of adversarial examples.

Image Classification

Paper
Code

Real-Fake: Effective Training Data Synthesis Through Distribution Matching

1 code implementation • 16 Oct 2023 • Jianhao Yuan, Jie Zhang, Shuyang Sun, Philip Torr, Bo Zhao

Synthetic training data has gained prominence in numerous learning tasks and scenarios, offering advantages such as dataset augmentation, generalization evaluation, and privacy preservation.

Image Classification Out-of-Distribution Generalization

Paper
Code

Beyond Training Objectives: Interpreting Reward Model Divergence in Large Language Models

no code implementations • 12 Oct 2023 • Luke Marks, Amir Abdullah, Clement Neo, Rauno Arike, Philip Torr, Fazl Barez

Large language models (LLMs) fine-tuned by reinforcement learning from human feedback (RLHF) are becoming more widely deployed.

Paper
Add Code

PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction

1 code implementation • 11 Oct 2023 • Jia-Wang Bian, Wenjing Bian, Victor Adrian Prisacariu, Philip Torr

On the MobileBrick dataset that contains casually captured unbounded 360-degree videos, our method refines ARKit poses and improves the reconstruction F1 score from 69. 18 to 75. 67, outperforming that with the dataset provided ground-truth pose (75. 14).

Multi-View 3D Reconstruction Neural Rendering +2

105

Paper
Code

AttributionLab: Faithfulness of Feature Attribution Under Controllable Environments

no code implementations • 10 Oct 2023 • Yang Zhang, Yawei Li, Hannah Brown, Mina Rezaei, Bernd Bischl, Philip Torr, Ashkan Khakzar, Kenji Kawaguchi

Feature attribution explains neural network outputs by identifying relevant input features.

Paper
Add Code

Exploring Non-additive Randomness on ViT against Query-Based Black-Box Attacks

no code implementations • 12 Sep 2023 • Jindong Gu, Fangyun Wei, Philip Torr, Han Hu

In this work, we first taxonomize the stochastic defense strategies against QBBA.

Paper
Add Code

Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants

2 code implementations • 3 Aug 2023 • Yibo Yang, Haobo Yuan, Xiangtai Li, Jianlong Wu, Lefei Zhang, Zhouchen Lin, Philip Torr, DaCheng Tao, Bernard Ghanem

Beyond the normal case, long-tail class incremental learning and few-shot class incremental learning are also proposed to consider the data imbalance and data scarcity, respectively, which are common in real-world implementations and further exacerbate the well-known problem of catastrophic forgetting.

Few-Shot Class-Incremental Learning Incremental Learning

Paper
Code

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

1 code implementation • 24 Jul 2023 • Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr

This paper aims to provide a comprehensive survey of cutting-edge research in prompt engineering on three types of vision-language models: multimodal-to-text generation models (e. g. Flamingo), image-text matching models (e. g.

Image-text matching Language Modelling +4

264

Paper
Code

OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?

no code implementations • ICCV 2023 • Runjia Li, Shuyang Sun, Mohamed Elhoseiny, Philip Torr

Hence, humour generation and understanding can serve as a new task for evaluating the ability of deep-learning methods to process abstract and subjective information.

Image Captioning

Paper
Add Code

ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation

1 code implementation • NeurIPS 2023 • Shuyang Sun, Weijun Wang, Qihang Yu, Andrew Howard, Philip Torr, Liang-Chieh Chen

This paper presents a new mechanism to facilitate the training of mask transformers for efficient panoptic segmentation, democratizing its deployment.

Panoptic Segmentation Segmentation

988

Paper
Code

Reliable Evaluation of Adversarial Transferability

no code implementations • 14 Jun 2023 • Wenqian Yu, Jindong Gu, Zhijiang Li, Philip Torr

Adversarial examples (AEs) with small adversarial perturbations can mislead deep neural networks (DNNs) into wrong predictions.

Paper
Add Code

Graph Inductive Biases in Transformers without Message Passing

1 code implementation • 27 May 2023 • Liheng Ma, Chen Lin, Derek Lim, Adriana Romero-Soriano, Puneet K. Dokania, Mark Coates, Philip Torr, Ser-Nam Lim

Graph inductive biases are crucial for Graph Transformers, and previous works incorporate them using message-passing modules and/or positional encodings.

Ranked #1 on Node Classification on PATTERN

Graph Classification Graph Regression +2

Paper
Code

Provably Correct Physics-Informed Neural Networks

no code implementations • 17 May 2023 • Francisco Eiras, Adel Bibi, Rudy Bunel, Krishnamurthy Dj Dvijotham, Philip Torr, M. Pawan Kumar

Recent work provides promising evidence that Physics-informed neural networks (PINN) can efficiently solve partial differential equations (PDE).

Paper
Add Code

Online Continual Learning Without the Storage Constraint

1 code implementation • 16 May 2023 • Ameya Prabhu, Zhipeng Cai, Puneet Dokania, Philip Torr, Vladlen Koltun, Ozan Sener

In this paper, we target such applications, investigating the online continual learning problem under relaxed storage constraints and limited computational budgets.

Continual Learning

Paper
Code

Towards Robust Prompts on Vision-Language Models

no code implementations • 17 Apr 2023 • Jindong Gu, Ahmad Beirami, Xuezhi Wang, Alex Beutel, Philip Torr, Yao Qin

With the advent of vision-language models (VLMs) that can perform in-context and prompt-based learning, how can we design prompting approaches that robustly generalize to distribution shift and can be used on novel classes outside the support set of the prompts?

In-Context Learning

Paper
Add Code

Influencer Backdoor Attack on Semantic Segmentation

1 code implementation • 21 Mar 2023 • Haoheng Lan, Jindong Gu, Philip Torr, Hengshuang Zhao

In this work, we explore backdoor attacks on segmentation models to misclassify all pixels of a victim class by injecting a specific trigger on non-victim pixels during inferences, which is dubbed Influencer Backdoor Attack (IBA).

Backdoor Attack Position +2

Paper
Code

Reliability in Semantic Segmentation: Are We on the Right Track?

1 code implementation • CVPR 2023 • Pau de Jorge, Riccardo Volpi, Philip Torr, Gregory Rogez

We analyze a broad variety of models, spanning from older ResNet-based architectures to novel transformers and assess their reliability based on four metrics: robustness, calibration, misclassification detection and out-of-distribution (OOD) detection.

Out of Distribution (OOD) Detection Semantic Segmentation

Paper
Code

PhysFormer++: Facial Video-based Physiological Measurement with SlowFast Temporal Difference Transformer

no code implementations • 7 Feb 2023 • Zitong Yu, Yuming Shen, Jingang Shi, Hengshuang Zhao, Yawen Cui, Jiehua Zhang, Philip Torr, Guoying Zhao

As key modules in PhysFormer, the temporal difference transformers first enhance the quasi-periodic rPPG features with temporal difference guided global attention, and then refine the local spatio-temporal representation against interference.

Paper
Add Code

Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class Incremental Learning

1 code implementation • 6 Feb 2023 • Yibo Yang, Haobo Yuan, Xiangtai Li, Zhouchen Lin, Philip Torr, DaCheng Tao

In this paper, we deal with this misalignment dilemma in FSCIL inspired by the recently discovered phenomenon named neural collapse, which reveals that the last-layer features of the same class will collapse into a vertex, and the vertices of all classes are aligned with the classifier prototypes, which are formed as a simplex equiangular tight frame (ETF).

Few-Shot Class-Incremental Learning Incremental Learning

Paper
Code

Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning

1 code implementation • ICLR 2023 • Yibo Yang, Haobo Yuan, Xiangtai Li, Zhouchen Lin, Philip Torr, DaCheng Tao

Ranked #3 on Few-Shot Class-Incremental Learning on CUB-200-2011

Few-Shot Class-Incremental Learning Incremental Learning

Paper
Code

Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators

no code implementations • 21 Dec 2022 • Jianhao Yuan, Francesco Pinto, Adam Davies, Philip Torr

Neural image classifiers are known to undergo severe performance degradation when exposed to inputs that exhibit covariate shifts with respect to the training distribution.

Domain Generalization Image Augmentation +1

Paper
Add Code

General Adversarial Defense Against Black-box Attacks via Pixel Level and Feature Level Distribution Alignments

no code implementations • 11 Dec 2022 • Xiaogang Xu, Hengshuang Zhao, Philip Torr, Jiaya Jia

In this paper, we use Deep Generative Networks (DGNs) with a novel training mechanism to eliminate the distribution gap.

Adversarial Attack Adversarial Defense +4

Paper
Add Code

LUMix: Improving Mixup by Better Modelling Label Uncertainty

no code implementations • 29 Nov 2022 • Shuyang Sun, Jie-Neng Chen, Ruifei He, Alan Yuille, Philip Torr, Song Bai

LUMix is simple as it can be implemented in just a few lines of code and can be universally applied to any deep networks \eg CNNs and Vision Transformers, with minimal computational cost.

Data Augmentation

Paper
Add Code

Is synthetic data from generative models ready for image recognition?

1 code implementation • 14 Oct 2022 • Ruifei He, Shuyang Sun, Xin Yu, Chuhui Xue, Wenqing Zhang, Philip Torr, Song Bai, Xiaojuan Qi

Recent text-to-image generation models have shown promising results in generating high-fidelity photo-realistic images.

Text-to-Image Generation Transfer Learning

162

Paper
Code

Diversified Dynamic Routing for Vision Tasks

no code implementations • 26 Sep 2022 • Botos Csaba, Adel Bibi, Yanwei Li, Philip Torr, Ser-Nam Lim

Deep learning models for vision tasks are trained on large datasets under the assumption that there exists a universal representation that can be used to make predictions for all samples.

Instance Segmentation object-detection +2

Paper
Add Code

SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness

1 code implementation • 25 Jul 2022 • Jindong Gu, Hengshuang Zhao, Volker Tresp, Philip Torr

Since SegPGD can create more effective adversarial examples, the adversarial training with our SegPGD can boost the robustness of segmentation models.

Adversarial Attack Segmentation +1

Paper
Code

Collaborative Quantization Embeddings for Intra-Subject Prostate MR Image Registration

no code implementations • 13 Jul 2022 • Ziyi Shen, Qianye Yang, Yuming Shen, Francesco Giganti, Vasilis Stavrinides, Richard Fan, Caroline Moore, Mirabela Rusu, Geoffrey Sonn, Philip Torr, Dean Barratt, Yipeng Hu

Image registration is useful for quantifying morphological changes in longitudinal MR images from prostate cancer patients.

Image Registration Quantization

Paper
Add Code

MP2: A Momentum Contrast Approach for Recommendation with Pointwise and Pairwise Learning

no code implementations • 18 Apr 2022 • Menghan Wang, Yuchen Guo, Zhenqi Zhao, Guangzheng Hu, Yuming Shen, Mingming Gong, Philip Torr

To alleviate the influence of the annotation bias, we perform a momentum update to ensure a consistent item representation.

Paper
Add Code

Unsupervised Contrastive Domain Adaptation for Semantic Segmentation

no code implementations • 18 Apr 2022 • Feihu Zhang, Vladlen Koltun, Philip Torr, René Ranftl, Stephan R. Richter

Semantic segmentation models struggle to generalize in the presence of domain shift.

Contrastive Learning Domain Adaptation +1

Paper
Add Code

Task-Agnostic Robust Representation Learning

no code implementations • 15 Mar 2022 • A. Tuan Nguyen, Ser Nam Lim, Philip Torr

To tackle this problem, a great amount of research has been done to study the training procedure of a network to improve its robustness.

Representation Learning Self-Supervised Learning

Paper
Add Code

Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting

no code implementations • 8 Mar 2022 • Chuhui Xue, Wenqing Zhang, Yu Hao, Shijian Lu, Philip Torr, Song Bai

Our network consists of an image encoder and a character-aware text encoder that extract visual and textual features, respectively, as well as a visual-textual decoder that models the interaction among textual and visual features for learning effective scene text representations.

Optical Character Recognition Optical Character Recognition (OCR) +2

Paper
Add Code

Gradients without Backpropagation

2 code implementations • 17 Feb 2022 • Atılım Güneş Baydin, Barak A. Pearlmutter, Don Syme, Frank Wood, Philip Torr

Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning.

Paper
Code

Deeply Explain CNN via Hierarchical Decomposition

no code implementations • 23 Jan 2022 • Ming-Ming Cheng, Peng-Tao Jiang, Ling-Hao Han, Liang Wang, Philip Torr

The proposed framework can generate a deep hierarchy of strongly associated supporting evidence for the network decision, which provides insight into the decision-making process.

Decision Making

Paper
Add Code

Towards Adversarial Evaluations for Inexact Machine Unlearning

3 code implementations • 17 Jan 2022 • Shashwat Goel, Ameya Prabhu, Amartya Sanyal, Ser-Nam Lim, Philip Torr, Ponnurangam Kumaraguru

Machine Learning models face increased concerns regarding the storage of personal user data and adverse impacts of corrupted data like backdoors or systematic bias.

Machine Unlearning Memorization

Paper
Code

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

1 code implementation • CVPR 2022 • Yujun Shi, Kuangqi Zhou, Jian Liang, Zihang Jiang, Jiashi Feng, Philip Torr, Song Bai, Vincent Y. F. Tan

Specifically, we experimentally show that directly encouraging CIL Learner at the initial phase to output similar representations as the model jointly trained on all classes can greatly boost the CIL performance.

Class Incremental Learning Incremental Learning

Paper
Code

A Continuous Mapping For Augmentation Design

no code implementations • NeurIPS 2021 • Keyu Tian, Chen Lin, Ser Nam Lim, Wanli Ouyang, Puneet Dokania, Philip Torr

Automated data augmentation (ADA) techniques have played an important role in boosting the performance of deep models.

Data Augmentation

Paper
Add Code

Looking Beyond Single Images for Contrastive Semantic Segmentation Learning

no code implementations • NeurIPS 2021 • Feihu Zhang, Philip Torr, Rene Ranftl, Stephan Richter

We present an approach to contrastive representation learning for semantic segmentation.

Contrastive Learning Image Classification +3

Paper
Add Code

Overcoming the Convex Barrier for Simplex Inputs

no code implementations • NeurIPS 2021 • Harkirat Singh Behl, M. Pawan Kumar, Philip Torr, Krishnamurthy Dvijotham

Recent progress in neural network verification has challenged the notion of a convex barrier, that is, an inherent weakness in the convex relaxation of the output of a neural network.

Paper
Add Code

PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer

1 code implementation • CVPR 2022 • Zitong Yu, Yuming Shen, Jingang Shi, Hengshuang Zhao, Philip Torr, Guoying Zhao

Remote photoplethysmography (rPPG), which aims at measuring heart activities and physiological signals from facial video without any contact, has great potential in many applications (e. g., remote healthcare and affective computing).

103

Paper
Code

Hierarchical interaction network for video object segmentation from referring expressions

no code implementations • British Machine Vision Conference 2021 • Zhao Yang, Yansong Tang, Luca Bertinetto, Hengshuang Zhao, Philip Torr

In this paper, we investigate the problem of video object segmentation from referring expressions (VOSRE).

Ranked #1 on Referring Expression Segmentation on J-HMDB (Precision@0.9 metric)

Optical Flow Estimation Referring Expression Segmentation +3

Paper
Add Code

Adversarial Examples on Segmentation Models Can be Easy to Transfer

no code implementations • 22 Nov 2021 • Jindong Gu, Hengshuang Zhao, Volker Tresp, Philip Torr

The high transferability achieved by our method shows that, in contrast to the observations in previous work, adversarial examples on a segmentation model can be easy to transfer to other segmentation models.

Adversarial Robustness Attribute +5

Paper
Add Code

TransMix: Attend to Mix for Vision Transformers

2 code implementations • CVPR 2022 • Jie-Neng Chen, Shuyang Sun, Ju He, Philip Torr, Alan Yuille, Song Bai

The confidence of the label will be larger if the corresponding input image is weighted higher by the attention map.

Instance Segmentation object-detection +3

574

Paper
Code

Mix-MaxEnt: Creating High Entropy Barriers To Improve Accuracy and Uncertainty Estimates of Deterministic Neural Networks

no code implementations • 29 Sep 2021 • Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip Torr, Puneet K. Dokania

We propose an extremely simple approach to regularize a single deterministic neural network to obtain improved accuracy and reliable uncertainty estimates.

Paper
Add Code

Towards fast and effective single-step adversarial training

no code implementations • 29 Sep 2021 • Pau de Jorge, Adel Bibi, Riccardo Volpi, Amartya Sanyal, Philip Torr, Grégory Rogez, Puneet K. Dokania

In this work, we methodically revisit the role of noise and clipping in single-step adversarial training.

Paper
Add Code

Shape-Tailored Deep Neural Networks With PDEs

no code implementations • NeurIPS Workshop DLDE 2021 • Naeemullah Khan, Angira Sharma, Philip Torr, Ganesh Sundaramoorthi

ST-DNN are deep networks formulated through the use of partial differential equations (PDE) to be defined on arbitrarily shaped regions.

Paper
Add Code

Vision Transformer with Progressive Sampling

1 code implementation • ICCV 2021 • Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip Torr, Wayne Zhang, Dahua Lin

As a typical example, the Vision Transformer (ViT) directly applies a pure transformer architecture on image classification, by simply splitting images into tokens with a fixed length, and employing transformers to learn relations between these tokens.

Image Classification

147

Paper
Code

Multilevel Knowledge Transfer for Cross-Domain Object Detection

no code implementations • 2 Aug 2021 • Botos Csaba, Xiaojuan Qi, Arslan Chaudhry, Puneet Dokania, Philip Torr

The key ingredients to our approach are -- (a) mapping the source to the target domain on pixel-level; (b) training a teacher network on the mapped source and the unannotated target domain using adversarial feature alignment; and (c) finally training a student network using the pseudo-labels obtained from the teacher.

Object object-detection +2

Paper
Add Code

Open-World Entity Segmentation

2 code implementations • 29 Jul 2021 • Lu Qi, Jason Kuen, Yi Wang, Jiuxiang Gu, Hengshuang Zhao, Zhe Lin, Philip Torr, Jiaya Jia

By removing the need of class label prediction, the models trained for such task can focus more on improving segmentation quality.

Image Manipulation Image Segmentation +2

668

Paper
Code

Communicating via Markov Decision Processes

1 code implementation • 17 Jul 2021 • Samuel Sokota, Christian Schroeder de Witt, Maximilian Igl, Luisa Zintgraf, Philip Torr, Martin Strohmeier, J. Zico Kolter, Shimon Whiteson, Jakob Foerster

We contribute a theoretically grounded approach to MCGs based on maximum entropy reinforcement learning and minimum entropy coupling that we call MEME.

Multi-agent Reinforcement Learning

Paper
Code

Class-Agnostic Segmentation Loss and Its Application to Salient Object Detection and Segmentation

1 code implementation • 16 Jul 2021 • Angira Sharma, Naeemullah Khan, Muhammad Mubashar, Ganesh Sundaramoorthi, Philip Torr

For low-fidelity training data (incorrect class label) class-agnostic segmentation loss outperforms the state-of-the-art methods on salient object detection datasets by staggering margins of around 50%.

Object object-detection +3

Paper
Code

Visual Parser: Representing Part-whole Hierarchies with Transformers

2 code implementations • 13 Jul 2021 • Shuyang Sun, Xiaoyu Yue, Song Bai, Philip Torr

To model the representations of the two levels, we first encode the information from the whole into part vectors through an attention mechanism, then decode the global information within the part vectors back into the whole representation.

Ranked #313 on Image Classification on ImageNet

Image Classification Instance Segmentation +3

124

Paper
Code

Large-scale Unsupervised Semantic Segmentation

3 code implementations • 6 Jun 2021 • ShangHua Gao, Zhong-Yu Li, Ming-Hsuan Yang, Ming-Ming Cheng, Junwei Han, Philip Torr

In this work, we propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to help the research progress.

Ranked #1 on Unsupervised Semantic Segmentation on ImageNet-S-300

Representation Learning Segmentation +1

156

Paper
Code

General Adversarial Defense via Pixel Level and Feature Level Distribution Alignment

no code implementations • 1 Jan 2021 • Xiaogang Xu, Hengshuang Zhao, Philip Torr, Jiaya Jia

Specifically, compared with previous methods, we propose a more efficient pixel-level training constraint to weaken the hardness of aligning adversarial samples to clean samples, which can thus obviously enhance the robustness on adversarial samples.

Adversarial Defense Image Classification +3

Paper
Add Code

How Benign is Benign Overfitting ?

no code implementations • ICLR 2021 • Amartya Sanyal, Puneet K. Dokania, Varun Kanade, Philip Torr

We investigate two causes for adversarial vulnerability in deep neural networks: bad data and (poorly) trained models.

Adversarial Robustness Representation Learning

Paper
Add Code

Shape-Tailored Deep Neural Networks Using PDEs for Segmentation

no code implementations • 1 Jan 2021 • Naeemullah Khan, Angira Sharma, Philip Torr, Ganesh Sundaramoorthi

We present Shape-Tailored Deep Neural Networks (ST-DNN).

Segmentation

Paper
Add Code

Don't be picky, all students in the right family can learn from good teachers

no code implementations • 1 Jan 2021 • Roy Henha Eyono, Fabio Maria Carlucci, Pedro M Esperança, Binxin Ru, Philip Torr

State-of-the-art results in deep learning have been improving steadily, in good part due to the use of larger models.

Bayesian Optimization Knowledge Distillation +1

Paper
Add Code

Point Transformer

24 code implementations • ICCV 2021 • Hengshuang Zhao, Li Jiang, Jiaya Jia, Philip Torr, Vladlen Koltun

For example, on the challenging S3DIS dataset for large-scale semantic scene segmentation, the Point Transformer attains an mIoU of 70. 4% on Area 5, outperforming the strongest prior model by 3. 3 absolute percentage points and crossing the 70% mIoU threshold for the first time.

Ranked #3 on 3D Semantic Segmentation on STPLS3D

3D Part Segmentation 3D Point Cloud Classification +8

1,675

Paper
Code

STEER : Simple Temporal Regularization For Neural ODE

no code implementations • NeurIPS 2020 • Arnab Ghosh, Harkirat Behl, Emilien Dupont, Philip Torr, Vinay Namboodiri

Training Neural Ordinary Differential Equations (ODEs) is often computationally expensive.

Time Series Time Series Analysis

Paper
Add Code

Learning to Sample the Most Useful Training Patches from Images

no code implementations • 24 Nov 2020 • Shuyang Sun, Liang Chen, Gregory Slabaugh, Philip Torr

Some image restoration tasks like demosaicing require difficult training samples to learn effective models.

Demosaicking

Paper
Add Code

Class-Agnostic Segmentation Loss and Its Application to Salient Object Detection and Segmentation

1 code implementation • 28 Oct 2020 • Angira Sharma, Naeemullah Khan, Ganesh Sundaramoorthi, Philip Torr

Object object-detection +3

Paper
Code

Diagnosing and Preventing Instabilities in Recurrent Video Processing

no code implementations • 10 Oct 2020 • Thomas Tanay, Aivar Sootla, Matteo Maggioni, Puneet K. Dokania, Philip Torr, Ales Leonardis, Gregory Slabaugh

Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-resolution.

Denoising Super-Resolution +2

Paper
Add Code

HOTA: A Higher Order Metric for Evaluating Multi-Object Tracking

5 code implementations • 16 Sep 2020 • Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixe, Bastian Leibe

Multi-Object Tracking (MOT) has been notoriously difficult to evaluate.

Multi-Object Tracking

883

Paper
Code

Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection

1 code implementation • ECCV 2020 • Carlo Biffi, Steven McDonagh, Philip Torr, Ales Leonardis, Sarah Parisot

Object detection has witnessed significant progress by relying on large, manually annotated datasets.

Few-Shot Object Detection Object +1

Paper
Code

Simulation-Based Inference for Global Health Decisions

2 code implementations • 14 May 2020 • Christian Schroeder de Witt, Bradley Gram-Hansen, Nantas Nardelli, Andrew Gambardella, Rob Zinkov, Puneet Dokania, N. Siddharth, Ana Belen Espinosa-Gonzalez, Ara Darzi, Philip Torr, Atılım Güneş Baydin

The COVID-19 pandemic has highlighted the importance of in-silico epidemiological modelling in predicting the dynamics of infectious diseases to inform health policy and decision makers about suitable prevention and containment strategies.

Bayesian Inference Epidemiology

1,228

Paper
Code

Using Hindsight to Anchor Past Knowledge in Continual Learning

no code implementations • 19 Feb 2020 • Arslan Chaudhry, Albert Gordo, Puneet K. Dokania, Philip Torr, David Lopez-Paz

In continual learning, the learner faces a stream of data whose distribution changes over time.

Bilevel Optimization Continual Learning

Paper
Add Code

Global Texture Enhancement for Fake Face Detection in the Wild

1 code implementation • CVPR 2020 • Zhengzhe Liu, Xiaojuan Qi, Philip Torr

In this paper, we conduct an empirical study on fake/real faces, and have two important observations: firstly, the texture of fake faces is substantially different from real ones; secondly, global texture statistics are more robust to image editing and transferable to fake faces from different GANs and datasets.

Face Detection Fake Image Detection

Paper
Code

Domain-invariant Stereo Matching Networks

1 code implementation • ECCV 2020 • Feihu Zhang, Xiaojuan Qi, Ruigang Yang, Victor Prisacariu, Benjamin Wah, Philip Torr

State-of-the-art stereo matching networks have difficulties in generalizing to new unseen environments due to significant domain differences, such as color, illumination, contrast, and texture.

Stereo Matching

226

Paper
Code

Efficient Bayesian Inference for Nested Simulators

no code implementations • pproximateinference AABI Symposium 2019 • Bradley Gram-Hansen, Christian Schroeder de Witt, Robert Zinkov, Saeid Naderiparizi, Adam Scibior, Andreas Munk, Frank Wood, Mehrdad Ghadiri, Philip Torr, Yee Whye Teh, Atilim Gunes Baydin, Tom Rainforth

We introduce two approaches for conducting efficient Bayesian inference in stochastic simulators containing nested stochastic sub-procedures, i. e., internal procedures for which the density cannot be calculated directly such as rejection sampling loops.

Bayesian Inference

Paper
Add Code

ShardNet: One Filter Set to Rule Them All

no code implementations • 25 Sep 2019 • Saumya Jetley, Tommaso Cavallari, Philip Torr, Stuart Golodetz

Deep CNNs have achieved state-of-the-art performance for numerous machine learning and computer vision tasks in recent years, but as they have become increasingly deep, the number of parameters they use has also increased, making them hard to deploy in memory-constrained environments and difficult to interpret.

Learning Theory

Paper
Add Code

The Intriguing Effects of Focal Loss on the Calibration of Deep Neural Networks

no code implementations • 25 Sep 2019 • Jishnu Mukhoti, Viveka Kulharia, Amartya Sanyal, Stuart Golodetz, Philip Torr, Puneet Dokania

When combined with temperature scaling, focal loss, whilst preserving accuracy and yielding state-of-the-art calibrated models, also preserves the confidence of the model's correct predictions, which is extremely desirable for downstream tasks.

Paper
Add Code

Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

3 code implementations • 8 Jul 2019 • Atılım Güneş Baydin, Lei Shao, Wahid Bhimji, Lukas Heinrich, Lawrence Meadows, Jialin Liu, Andreas Munk, Saeid Naderiparizi, Bradley Gram-Hansen, Gilles Louppe, Mingfei Ma, Xiaohui Zhao, Philip Torr, Victor Lee, Kyle Cranmer, Prabhat, Frank Wood

Probabilistic programming languages (PPLs) are receiving widespread attention for performing Bayesian inference in complex generative models.

Probabilistic Programming

389

Paper
Code

Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

no code implementations • 20 Jun 2019 • Tommaso Cavallari, Luca Bertinetto, Jishnu Mukhoti, Philip Torr, Stuart Golodetz

Many applications require a camera to be relocalised online, without expensive offline training on the target scene.

Camera Relocalization Clustering +1

Paper
Add Code

Res2Net: A New Multi-scale Backbone Architecture

32 code implementations • 2 Apr 2019 • Shang-Hua Gao, Ming-Ming Cheng, Kai Zhao, Xin-Yu Zhang, Ming-Hsuan Yang, Philip Torr

We evaluate the Res2Net block on all these models and demonstrate consistent performance gains over baseline models on widely-used datasets, e. g., CIFAR-100 and ImageNet.

Ranked #2 on Image Classification on GasHisSDB

Image Classification Instance Segmentation +4

29,828

Paper
Code

Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

3 code implementations • NeurIPS 2019 • Atılım Güneş Baydin, Lukas Heinrich, Wahid Bhimji, Lei Shao, Saeid Naderiparizi, Andreas Munk, Jialin Liu, Bradley Gram-Hansen, Gilles Louppe, Lawrence Meadows, Philip Torr, Victor Lee, Prabhat, Kyle Cranmer, Frank Wood

We present a novel probabilistic programming framework that couples directly to existing large-scale simulators through a cross-platform probabilistic execution protocol, which allows general-purpose inference engines to record and control random number draws within simulators in a language-agnostic way.

Probabilistic Programming

389

Paper
Code

DGPose: Deep Generative Models for Human Body Analysis

no code implementations • 17 Apr 2018 • Rodrigo de Bem, Arnab Ghosh, Thalaiyasingam Ajanthan, Ondrej Miksik, Adnane Boukhayma, N. Siddharth, Philip Torr

However, the latent space learned by such approaches is typically not interpretable, resulting in less flexibility.

Disentanglement Pose Estimation +1

Paper
Add Code

Long-term Tracking in the Wild: A Benchmark

no code implementations • ECCV 2018 • Jack Valmadre, Luca Bertinetto, João F. Henriques, Ran Tao, Andrea Vedaldi, Arnold Smeulders, Philip Torr, Efstratios Gavves

We introduce the OxUvA dataset and benchmark for evaluating single-object tracking algorithms.

Object Object Tracking

Paper
Add Code

A Projected Gradient Descent Method for CRF Inference allowing End-To-End Training of Arbitrary Pairwise Potentials

no code implementations • 24 Jan 2017 • Måns Larsson, Anurag Arnab, Fredrik Kahl, Shuai Zheng, Philip Torr

It is empirically demonstrated that such learned potentials can improve segmentation accuracy and that certain label class interactions are indeed better modelled by a non-Gaussian potential.

Segmentation Semantic Segmentation +1

Paper
Add Code

Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation

no code implementations • 7 Dec 2016 • Qinbin Hou, Puneet Kumar Dokania, Daniela Massiceti, Yunchao Wei, Ming-Ming Cheng, Philip Torr

We focus on the following three aspects of EM: (i) initialization; (ii) latent posterior estimation (E-step) and (iii) the parameter update (M-step).

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Add Code

Online Real-time Multiple Spatiotemporal Action Localisation and Prediction

4 code implementations • ICCV 2017 • Gurkirt Singh, Suman Saha, Michael Sapienza, Philip Torr, Fabio Cuzzolin

To the best of our knowledge, ours is the first real-time (up to 40fps) system able to perform online S/T action localisation and early action prediction on the untrimmed videos of UCF101-24.

Early Action Prediction

315

Paper
Code

Deeply supervised salient object detection with short connections

4 code implementations • CVPR 2017 • Qibin Hou, Ming-Ming Cheng, Xiao-Wei Hu, Ali Borji, Zhuowen Tu, Philip Torr

Recent progress on saliency detection is substantial, benefiting mostly from the explosive development of Convolutional Neural Networks (CNNs).

Ranked #4 on RGB Salient Object Detection on SBU

Boundary Detection Object +5

238

Paper
Code

Learning to Navigate the Energy Landscape

no code implementations • 18 Mar 2016 • Julien Valentin, Angela Dai, Matthias Nießner, Pushmeet Kohli, Philip Torr, Shahram Izadi, Cem Keskin

We demonstrate the efficacy of our approach on the challenging problem of RGB Camera Relocalization.

Camera Relocalization Hand Pose Estimation +3

Paper
Add Code

Joint Object-Material Category Segmentation from Audio-Visual Cues

no code implementations • 10 Jan 2016 • Anurag Arnab, Michael Sapienza, Stuart Golodetz, Julien Valentin, Ondrej Miksik, Shahram Izadi, Philip Torr

It is not always possible to recognise objects and infer material properties for a scene from visual cues alone, since objects can look visually similar whilst being made of very different materials.

Object

Paper
Add Code

Staple: Complementary Learners for Real-Time Tracking

3 code implementations • CVPR 2016 • Luca Bertinetto, Jack Valmadre, Stuart Golodetz, Ondrej Miksik, Philip Torr

Correlation Filter-based trackers have recently achieved excellent performance, showing great robustness to challenging situations exhibiting motion blur and illumination changes.

Ranked #28 on Visual Object Tracking on TrackingNet

regression Visual Object Tracking

505

Paper
Code

Prototypical Priors: From Improving Classification to Zero-Shot Learning

no code implementations • 3 Dec 2015 • Saumya Jetley, Bernardino Romera-Paredes, Sadeep Jayasumana, Philip Torr

Recent works on zero-shot learning make use of side information such as visual attributes or natural language semantics to define the relations between output visual classes and then use these relationships to draw inference on new unseen classes at test time.

Classification General Classification +1

Paper
Add Code

Higher Order Conditional Random Fields in Deep Neural Networks

1 code implementation • 25 Nov 2015 • Anurag Arnab, Sadeep Jayasumana, Shuai Zheng, Philip Torr

Recent deep learning approaches have incorporated CRFs into Convolutional Neural Networks (CNNs), with some even training the CRF end-to-end with the rest of the network.

Ranked #56 on Semantic Segmentation on PASCAL Context

Segmentation Semantic Segmentation +1

Paper
Code

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

no code implementations • CVPR 2014 • Ming-Ming Cheng, Ziming Zhang, Wen-Yan Lin, Philip Torr

Training a generic objectness measure to produce a small set of candidate object windows, has been shown to speed up the classical sliding window object detection paradigm.

Object object-detection +1

Paper
Add Code

Efficient Semidefinite Branch-and-Cut for MAP-MRF Inference

no code implementations • 20 Apr 2014 • Peng Wang, Chunhua Shen, Anton Van Den Hengel, Philip Torr

We propose a Branch-and-Cut (B&C) method for solving general MAP-MRF inference problems.

Paper
Add Code

Higher Order Priors for Joint Intrinsic Image, Objects, and Attributes Estimation

no code implementations • NeurIPS 2013 • Vibhav Vineet, Carsten Rother, Philip Torr

Many methods have been proposed to recover the intrinsic scene properties such as shape, reflectance and illumination from a single image.

Paper
Add Code

ImageSpirit: Verbal Guided Image Parsing

no code implementations • 16 Oct 2013 • Ming-Ming Cheng, Shuai Zheng, Wen-Yan Lin, Jonathan Warrell, Vibhav Vineet, Paul Sturgess, Nigel Crook, Niloy Mitra, Philip Torr

This allows us to formulate the image parsing problem as one of jointly estimating per-pixel object and attribute labels from a set of training images.

Attribute Object

Paper
Add Code

Learning Anchor Planes for Classification

no code implementations • NeurIPS 2011 • Ziming Zhang, Lubor Ladicky, Philip Torr, Amir Saffari

It provides a set of anchor points which form a local coordinate system, such that each data point on the manifold can be approximated by a linear combination of its anchor points, and the linear weights become the local coordinate coding.

Classification General Classification

Paper
Add Code

Improved Moves for Truncated Convex Models

no code implementations • NeurIPS 2008 • Philip Torr, M. P. Kumar

Compared to previous approaches based on the LP relaxation, e. g. interior-point algorithms or tree-reweighted message passing (TRW), our method is faster as it uses only the efficient st-mincut algorithm in its design.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.