Search Results for author: Nick Barnes

Found 77 papers, 42 papers with code

LAM3D: Large Image-Point-Cloud Alignment Model for 3D Reconstruction from Single Image

no code implementations24 May 2024 Ruikai Cui, Xibin Song, Weixuan Sun, Senbo Wang, Weizhe Liu, Shenzhou Chen, Taizhang Shang, Yang Li, Nick Barnes, Hongdong Li, Pan Ji

Large Reconstruction Models have made significant strides in the realm of automated 3D content generation from single or multiple input images.

3D Reconstruction

Learning Gaussian Representation for Eye Fixation Prediction

no code implementations21 Mar 2024 Peipei Song, Jing Zhang, Piotr Koniusz, Nick Barnes

Existing eye fixation prediction methods perform the mapping from input images to the corresponding dense fixation maps generated from raw fixation points.

Self-Calibrating Vicinal Risk Minimisation for Model Calibration

no code implementations CVPR 2024 Jiawei Liu, Changkun Ye, Ruikai Cui, Nick Barnes

Model calibration measuring the alignment between the prediction accuracy and model confidence is an important metric reflecting model trustworthiness.

Binary Classification

All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation

1 code implementation8 Aug 2023 Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, Nick Barnes

Given a pair of augmented views, our approach regularizes the activation intensities between a pair of augmented views, while also ensuring that the affinity across regions within each view remains consistent.

Object Localization Weakly supervised Semantic Segmentation +1

Model Calibration in Dense Classification with Adaptive Label Perturbation

1 code implementation ICCV 2023 Jiawei Liu, Changkun Ye, Shan Wang, Ruikai Cui, Jing Zhang, Kaihao Zhang, Nick Barnes

To improve model calibration, we propose Adaptive Stochastic Label Perturbation (ASLP) which learns a unique label perturbation level for each training image.

Binary Classification Classification +1

Measuring and Modeling Uncertainty Degree for Monocular Depth Estimation

no code implementations19 Jul 2023 Mochu Xiang, Jing Zhang, Nick Barnes, Yuchao Dai

Effectively measuring and modeling the reliability of a trained model is essential to the real-world deployment of monocular depth estimation (MDE) models.

Monocular Depth Estimation

A Comprehensive Overview of Large Language Models

1 code implementation12 Jul 2023 Humza Naveed, Asad Ullah Khan, Shi Qiu, Muhammad Saqib, Saeed Anwar, Muhammad Usman, Naveed Akhtar, Nick Barnes, Ajmal Mian

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond.

Benchmarking

Weakly-supervised Contrastive Learning for Unsupervised Object Discovery

1 code implementation7 Jul 2023 Yunqiu Lv, Jing Zhang, Nick Barnes, Yuchao Dai

Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of objects from the background within a scene without relying on labeled datasets, which benefits the task of bounding-box-level localization and pixel-level segmentation.

Contrastive Learning Image Reconstruction +4

Rethinking Polyp Segmentation from an Out-of-Distribution Perspective

1 code implementation13 Jun 2023 Ge-Peng Ji, Jing Zhang, Dylan Campbell, Huan Xiong, Nick Barnes

Unlike existing fully-supervised approaches, we rethink colorectal polyp segmentation from an out-of-distribution perspective with a simple but effective self-supervised learning approach.

Segmentation Self-Supervised Learning

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning

1 code implementation CVPR 2023 Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, Nick Barnes

Based on this observation, we propose a new learning strategy named False Negative Aware Contrastive (FNAC) to mitigate the problem of misleading the training with such false negative samples.

Contrastive Learning

Transmission-Guided Bayesian Generative Model for Smoke Segmentation

1 code implementation2 Mar 2023 Siyuan Yan, Jing Zhang, Nick Barnes

To effectively model the two types of uncertainty, we introduce a Bayesian generative model to simultaneously estimate the posterior distribution of model parameters and its predictions.

Image Dehazing Image Segmentation +2

PointCaM: Cut-and-Mix for Open-Set Point Cloud Learning

1 code implementation5 Dec 2022 Jie Hong, Shi Qiu, Weihao Li, Saeed Anwar, Mehrtash Harandi, Nick Barnes, Lars Petersson

Specifically, we use the Unknown-Point Simulator to simulate out-of-distribution data in the training stage by manipulating the geometric context of partial known data.

Energy-Based Residual Latent Transport for Unsupervised Point Cloud Completion

1 code implementation13 Nov 2022 Ruikai Cui, Shi Qiu, Saeed Anwar, Jing Zhang, Nick Barnes

Unsupervised point cloud completion aims to infer the whole geometry of a partial object observation without requiring partial-complete correspondence.

Decoder Point Cloud Completion

The Devil in Linear Transformer

1 code implementation19 Oct 2022 Zhen Qin, Xiaodong Han, Weixuan Sun, Dongxu Li, Lingpeng Kong, Nick Barnes, Yiran Zhong

In this paper, we examine existing kernel-based linear transformers and identify two key issues that lead to such performance gaps: 1) unbounded gradients in the attention computation adversely impact the convergence of linear transformer models; 2) attention dilution which trivially distributes attention scores over long sequences while neglecting neighbouring structures.

Language Modelling Text Classification

Generalised Co-Salient Object Detection

no code implementations20 Aug 2022 Jiawei Liu, Jing Zhang, Ruikai Cui, Kaihao Zhang, Weihao Li, Nick Barnes

We propose a new setting that relaxes an assumption in the conventional Co-Salient Object Detection (CoSOD) setting by allowing the presence of "noisy images" which do not show the shared co-salient object.

Co-Salient Object Detection Object +3

Vicinity Vision Transformer

1 code implementation21 Jun 2022 Weixuan Sun, Zhen Qin, Hui Deng, Jianyuan Wang, Yi Zhang, Kaihao Zhang, Nick Barnes, Stan Birchfield, Lingpeng Kong, Yiran Zhong

Based on this observation, we present a Vicinity Attention that introduces a locality bias to vision transformers with linear complexity.

Image Classification

Towards Deeper Understanding of Camouflaged Object Detection

1 code implementation23 May 2022 Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Nick Barnes, Deng-Ping Fan

With the above understanding about camouflaged objects, we present the first triple-task learning framework to simultaneously localize, segment, and rank camouflaged objects, indicating the conspicuousness level of camouflage.

Object object-detection +1

An Energy-Based Prior for Generative Saliency

1 code implementation19 Apr 2022 Jing Zhang, Jianwen Xie, Nick Barnes, Ping Li

We propose a novel generative saliency prediction framework that adopts an informative energy-based model as a prior distribution.

object-detection RGB-D Salient Object Detection +3

Towards Open-Set Object Detection and Discovery

no code implementations12 Apr 2022 Jiyang Zheng, Weihao Li, Jie Hong, Lars Petersson, Nick Barnes

This new task aims to extend the ability of open-set object detectors to further discover the categories of unknown objects based on their visual appearance without human effort.

Incremental Learning Object +2

Semi-supervised Salient Object Detection with Effective Confidence Estimation

no code implementations28 Dec 2021 Jiawei Liu, Jing Zhang, Nick Barnes

We study semi-supervised salient object detection, with access to a small number of labeled samples and a large number of unlabeled samples.

Object object-detection +3

Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction

no code implementations NeurIPS 2021 Jing Zhang, Jianwen Xie, Nick Barnes, Ping Li

In this paper, we take a step further by proposing a novel generative vision transformer with latent variables following an informative energy-based prior for salient object detection.

object-detection RGB-D Salient Object Detection +3

Learning To Segment Dominant Object Motion From Watching Videos

no code implementations28 Nov 2021 Sahir Shrestha, Mohammad Ali Armin, Hongdong Li, Nick Barnes

Existing deep learning based unsupervised video object segmentation methods still rely on ground-truth segmentation masks to train.

Object Optical Flow Estimation +4

PU-Transformer: Point Cloud Upsampling Transformer

2 code implementations24 Nov 2021 Shi Qiu, Saeed Anwar, Nick Barnes

Given the rapid development of 3D scanners, point clouds are becoming popular in AI-driven machines.

point cloud upsampling

Dense Uncertainty Estimation via an Ensemble-based Conditional Latent Variable Model

no code implementations22 Nov 2021 Jing Zhang, Yuchao Dai, Mehrtash Harandi, Yiran Zhong, Nick Barnes, Richard Hartley

Uncertainty estimation has been extensively studied in recent literature, which can usually be classified as aleatoric uncertainty and epistemic uncertainty.

Attribute object-detection +1

Inferring the Class Conditional Response Map for Weakly Supervised Semantic Segmentation

1 code implementation27 Oct 2021 Weixuan Sun, Jing Zhang, Nick Barnes

To solve this, most existing approaches follow a multi-training pipeline to refine CAMs for better pseudo-labels, which includes: 1) re-training the classification model to generate CAMs; 2) post-processing CAMs to obtain pseudo labels; and 3) training a semantic segmentation model with the obtained pseudo labels.

Segmentation Weakly supervised Semantic Segmentation +1

Dense Uncertainty Estimation

1 code implementation13 Oct 2021 Jing Zhang, Yuchao Dai, Mochu Xiang, Deng-Ping Fan, Peyman Moghadam, Mingyi He, Christian Walder, Kaihao Zhang, Mehrtash Harandi, Nick Barnes

Deep neural networks can be roughly divided into deterministic neural networks and stochastic neural networks. The former is usually trained to achieve a mapping from input space to output space via maximum likelihood estimation for the weights, which leads to deterministic predictions during testing.

Decision Making

RGB-D Saliency Detection via Cascaded Mutual Information Minimization

1 code implementation ICCV 2021 Jing Zhang, Deng-Ping Fan, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao

In this paper, we introduce a novel multi-stage cascaded learning framework via mutual information minimization to "explicitly" model the multi-modal information between RGB image and depth data.

Saliency Detection Thermal Image Segmentation

PnP-3D: A Plug-and-Play for 3D Point Clouds

1 code implementation16 Aug 2021 Shi Qiu, Saeed Anwar, Nick Barnes

With the help of the deep learning paradigm, many point cloud networks have been invented for visual analysis.

object-detection Object Detection +1

Energy-Based Generative Cooperative Saliency Prediction

1 code implementation25 Jun 2021 Jing Zhang, Jianwen Xie, Zilong Zheng, Nick Barnes

In this paper, to model the uncertainty of visual saliency, we study the saliency prediction problem from the perspective of generative models by learning a conditional probability distribution over the saliency map given an input image, and treating the saliency prediction as a sampling process from the learned distribution.

Saliency Prediction

Confidence-Aware Learning for Camouflaged Object Detection

1 code implementation22 Jun 2021 Jiawei Liu, Jing Zhang, Nick Barnes

Then, we concatenate it with the input image and feed it to the confidence estimation network to produce an one channel confidence map. We generate dynamic supervision for the confidence estimation network, representing the agreement of camouflage prediction with the ground truth camouflage map.

Object object-detection +1

Generative Transformer for Accurate and Reliable Salient Object Detection

2 code implementations20 Apr 2021 Yuxin Mao, Jing Zhang, Zhexiong Wan, Yuchao Dai, Aixuan Li, Yunqiu Lv, Xinyu Tian, Deng-Ping Fan, Nick Barnes

For the former, we apply transformer to a deterministic model, and explain that the effective structure modeling and global context modeling abilities lead to its superior performance compared with the CNN based frameworks.

Attribute Camouflaged Object Segmentation +8

Learning structure-aware semantic segmentation with image-level supervision

1 code implementation15 Apr 2021 Jiawei Liu, Jing Zhang, Yicong Hong, Nick Barnes

Within this pipeline, the class activation map (CAM) is obtained and further processed to serve as a pseudo label to train the semantic segmentation model in a fully-supervised manner.

Boundary Detection Common Sense Reasoning +4

Weakly Supervised Video Salient Object Detection

1 code implementation CVPR 2021 Wangbo Zhao, Jing Zhang, Long Li, Nick Barnes, Nian Liu, Junwei Han

Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain.

Object object-detection +4

Simultaneously Localize, Segment and Rank the Camouflaged Objects

1 code implementation CVPR 2021 Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan

With the above understanding about camouflaged objects, we present the first ranking based COD network (Rank-Net) to simultaneously localize, segment and rank camouflaged objects.

object-detection Object Detection

Recursive Training for Zero-Shot Semantic Segmentation

no code implementations26 Feb 2021 Ce Wang, Moshiur Farazi, Nick Barnes

We propose a recursive training scheme to supervise the retraining of a semantic segmentation model for a zero-shot setting using a pseudo-feature representation.

Segmentation Semantic Segmentation +1

Robust normalizing flows using Bernstein-type polynomials

no code implementations6 Feb 2021 Sameera Ramasinghe, Kasun Fernando, Salman Khan, Nick Barnes

Modeling real-world distributions can often be challenging due to sample data that are subjected to perturbations, e. g., instrumentation errors, or added random noise.

Vocal Bursts Type Prediction

Uncertainty-Aware Deep Calibrated Salient Object Detection

no code implementations10 Dec 2020 Jing Zhang, Yuchao Dai, Xin Yu, Mehrtash Harandi, Nick Barnes, Richard Hartley

Existing deep neural network based salient object detection (SOD) methods mainly focus on pursuing high network accuracy.

Object object-detection +2

3D Guided Weakly Supervised Semantic Segmentation

no code implementations1 Dec 2020 Weixuan Sun, Jing Zhang, Nick Barnes

In this paper, we propose a weakly supervised 2D semantic segmentation model by incorporating sparse bounding box labels with available 3D information, which is much easier to obtain with advanced sensors.

2D Semantic Segmentation Segmentation +2

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds

1 code implementation NeurIPS 2021 Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, Stephen Gould

Conditional GANs (cGAN), in their rudimentary form, suffer from critical drawbacks such as the lack of diversity in generated outputs and distortion between the latent and output manifolds.

Image-to-Image Translation Translation

Conditional Generative Modeling via Learning the Latent Space

no code implementations ICLR 2021 Sameera Ramasinghe, Kanchana Ranasinghe, Salman Khan, Nick Barnes, Stephen Gould

Although deep learning has achieved appealing results on several machine learning tasks, most of the models are deterministic at inference, limiting their application to single-modal settings.

Attention Guided Semantic Relationship Parsing for Visual Question Answering

no code implementations5 Oct 2020 Moshiur Farazi, Salman Khan, Nick Barnes

Humans explain inter-object relationships with semantic labels that demonstrate a high-level understanding required to perform complex Vision-Language tasks such as Visual Question Answering (VQA).

Object Question Answering +1

Uncertainty Inspired RGB-D Saliency Detection

4 code implementations7 Sep 2020 Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Saleh, Sadegh Aliakbarian, Nick Barnes

Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution.

Decoder RGB-D Salient Object Detection +2

Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection

no code implementations ECCV 2020 Jing Zhang, Jianwen Xie, Nick Barnes

The proposed model consists of two sub-models parameterized by neural networks: (1) a saliency predictor that maps input images to clean saliency maps, and (2) a noise generator, which is a latent variable model that produces noises from Gaussian latent vectors.

Decoder Saliency Detection

Attention Based Real Image Restoration

no code implementations26 Apr 2020 Saeed Anwar, Nick Barnes, Lars Petersson

Furthermore, the evaluation in terms of quantitative metrics and visual quality for four restoration tasks i. e. Denoising, Super-resolution, Raindrop Removal, and JPEG Compression on 11 real degraded datasets against more than 30 state-of-the-art algorithms demonstrate the superiority of our R$^2$Net.

Denoising Image Restoration +2

A Systematic Evaluation: Fine-Grained CNN vs. Traditional CNN Classifiers

1 code implementation24 Mar 2020 Saeed Anwar, Nick Barnes, Lars Petersson

In this work, we investigate the performance of the landmark general CNN classifiers, which presented top-notch results on large scale classification datasets, on the fine-grained datasets, and compare it against state-of-the-art fine-grained classifiers.

General Classification

Reducing the Sim-to-Real Gap for Event Cameras

1 code implementation ECCV 2020 Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, Robert Mahony

We present strategies for improving training data for event based CNNs that result in 20-40% boost in performance of existing state-of-the-art (SOTA) video reconstruction networks retrained with our method, and up to 15% for optic flow networks.

Event-Based Video Reconstruction Video Reconstruction

Any-Shot Object Detection

no code implementations16 Mar 2020 Shafin Rahman, Salman Khan, Nick Barnes, Fahad Shahbaz Khan

Any-shot detection offers unique challenges compared to conventional novel object detection such as, a high imbalance between unseen, few-shot and seen object classes, susceptibility to forget base-training while learning novel classes and distinguishing novel classes from the background.

Novel Object Detection Object +2

Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models

no code implementations20 Jan 2020 Moshiur R. Farazi, Salman H. Khan, Nick Barnes

However, modelling the visual and semantic features in a high dimensional (joint embedding) space is computationally expensive, and more complex models often result in trivial improvements in the VQA accuracy.

Question Answering Visual Question Answering

Spectral-GANs for High-Resolution 3D Point-cloud Generation

1 code implementation4 Dec 2019 Sameera Ramasinghe, Salman Khan, Nick Barnes, Stephen Gould

Point-clouds are a popular choice for vision and graphics tasks due to their accurate shape description and direct acquisition from range-scanners.

Generative Adversarial Network Point Cloud Generation +1

Representation Learning on Unit Ball with 3D Roto-Translational Equivariance

no code implementations30 Nov 2019 Sameera Ramasinghe, Salman Khan, Nick Barnes, Stephen Gould

In this work, we propose a novel `\emph{volumetric convolution}' operation that can effectively model and convolve arbitrary functions in $\mathbb{B}^3$.

3D Object Recognition Representation Learning

Question-Agnostic Attention for Visual Question Answering

no code implementations9 Aug 2019 Moshiur R. Farazi, Salman H. Khan, Nick Barnes

Visual Question Answering (VQA) models employ attention mechanisms to discover image locations that are most relevant for answering a specific question.

Question Answering Visual Question Answering

Densely Residual Laplacian Super-Resolution

1 code implementation28 Jun 2019 Saeed Anwar, Nick Barnes

Super-Resolution convolutional neural networks have recently demonstrated high-quality restoration for single images.

Image Super-Resolution

Unsupervised Primitive Discovery for Improved 3D Generative Modeling

no code implementations CVPR 2019 Salman H. Khan, Yulan Guo, Munawar Hayat, Nick Barnes

Using the primitive parts for shapes as attributes, a parameterized 3D representation is modeled in the first stage.

3D Shape Generation

CED: Color Event Camera Dataset

no code implementations24 Apr 2019 Cedric Scheerlinck, Henri Rebecq, Timo Stoffregen, Nick Barnes, Robert Mahony, Davide Scaramuzza

Event cameras are novel, bio-inspired visual sensors, whose pixels output asynchronous and independent timestamped spikes at local intensity changes, called 'events'.

Event-based vision Image Reconstruction

A Deep Journey into Super-resolution: A survey

2 code implementations16 Apr 2019 Saeed Anwar, Salman Khan, Nick Barnes

Deep convolutional networks based super-resolution is a fast-growing field with numerous practical applications.

Image Super-Resolution

Real Image Denoising with Feature Attention

3 code implementations ICCV 2019 Saeed Anwar, Nick Barnes

Deep convolutional neural networks perform better on images containing spatially invariant noise (synthetic noise); however, their performance is limited on real-noisy photographs and requires multiple stage network modeling.

Color Image Denoising Image Denoising

Asynchronous Spatial Image Convolutions for Event Cameras

no code implementations2 Dec 2018 Cedric Scheerlinck, Nick Barnes, Robert Mahony

In this paper, we propose a method to compute the convolution of a linear spatial kernel with the output of an event camera.

Polarity Loss for Zero-shot Object Detection

3 code implementations22 Nov 2018 Shafin Rahman, Salman Khan, Nick Barnes

This setting gives rise to the need for correct alignment between visual and semantic concepts, so that the unseen objects can be identified using only their semantic attributes.

Generalized Zero-Shot Object Detection Metric Learning +4

Continuous-time Intensity Estimation Using Event Cameras

no code implementations1 Nov 2018 Cedric Scheerlinck, Nick Barnes, Robert Mahony

Event cameras provide asynchronous, data-driven measurements of local temporal contrast over a large dynamic range with extremely high temporal resolution.

Adversarial Training of Variational Auto-encoders for High Fidelity Image Generation

1 code implementation27 Apr 2018 Salman H. Khan, Munawar Hayat, Nick Barnes

Our model simultaneously learns to match the data, reconstruction loss and the latent distributions of real and fake images to improve the quality of generated samples.

Image Generation Vocal Bursts Intensity Prediction

Deep Texture and Structure Aware Filtering Network for Image Smoothing

no code implementations ECCV 2018 Kaiyue Lu, ShaoDi You, Nick Barnes

Image smoothing is a fundamental task in computer vision, that aims to retain salient structures and remove insignificant textures.

image smoothing

Learning RGB-D Salient Object Detection using background enclosure, depth contrast, and top-down features

no code implementations10 May 2017 Riku Shigematsu, David Feng, ShaoDi You, Nick Barnes

Here we propose a novel deep CNN architecture for RGB-D salient object detection that exploits high-level, mid-level, and low level features.

Object object-detection +3

Perceptually Consistent Color-to-Gray Image Conversion

no code implementations6 May 2016 Shaodi You, Nick Barnes, Janine Walker

In this paper, we propose a color to grayscale image conversion algorithm (C2G) that aims to preserve the perceptual properties of the color image as much as possible.

Cannot find the paper you are looking for? You can Submit a new open access paper.