Search Results for author: Guosheng Hu

Found 31 papers, 16 papers with code

Reducing Distributional Uncertainty by Mutual Information Maximisation and Transferable Feature Learning

no code implementations • ECCV 2020 • Jian Gao, Yang Hua, Guosheng Hu, Chi Wang, Neil M. Robertson

Distributional uncertainty exists broadly in many real-world applications, one of which in the form of domain discrepancy.

Domain Adaptation

Paper
Add Code

Adaptive Variance Based Label Distribution Learning For Facial Age Estimation

no code implementations • ECCV 2020 • Xin Wen, Biying Li, Haiyun Guo, Zhiwei Liu, Guosheng Hu, Ming Tang, Jinqiao Wang

Some existing methods adopt distribution learning to tackle this issue by exploiting the semantic correlation between age labels.

Ranked #6 on Age Estimation on MORPH album2 (Caucasian)

Age Estimation Meta-Learning +1

Paper
Add Code

Object Pose Estimation via the Aggregation of Diffusion Features

1 code implementation • 27 Mar 2024 • Tianfu Wang, Guosheng Hu, Hongguang Wang

To achieve this, we propose three distinct architectures that can effectively capture and aggregate diffusion features of different granularity, greatly improving the generalizability of object pose estimation.

Pose Estimation Scene Understanding

Paper
Code

Gradient-Guided Modality Decoupling for Missing-Modality Robustness

no code implementations • 26 Feb 2024 • Hao Wang, Shengda Luo, Guosheng Hu, JianGuo Zhang

In aid of this indicator, we present a novel Gradient-guided Modality Decoupling (GMD) method to decouple the dependency on dominating modalities.

Sentiment Analysis

Paper
Add Code

TDViT: Temporal Dilated Video Transformer for Dense Video Tasks

1 code implementation • 14 Feb 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

Deep video models, for example, 3D CNNs or video transformers, have achieved promising performance on sparse video tasks, i. e., predicting one result per video.

Instance Segmentation object-detection +3

Paper
Code

Efficient One-stage Video Object Detection by Exploiting Temporal Consistency

1 code implementation • 14 Feb 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

Based on the analysis, we present a simple yet efficient framework to address the computational bottlenecks and achieve efficient one-stage VOD by exploiting the temporal consistency in video frames.

object-detection Video Object Detection

Paper
Code

GPT4Battery: An LLM-driven Framework for Adaptive State of Health Estimation of Raw Li-ion Batteries

no code implementations • 30 Jan 2024 • Yuyuan Feng, Guosheng Hu, Zhihong Zhang

State of health (SOH) is a crucial indicator for assessing the degradation level of batteries that cannot be measured directly but requires estimation.

energy management Language Modelling +1

Paper
Add Code

MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection

1 code implementation • 18 Jan 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

However, we argue that these memory structures are not efficient or sufficient because of two implied operations: (1) concatenating all features in memory for enhancement, leading to a heavy computational cost; (2) frame-wise memory updating, preventing the memory from capturing more temporal information.

object-detection Video Object Detection

Paper
Code

Explainability of Speech Recognition Transformers via Gradient-based Attention Visualization

1 code implementation • IEEE Transactions on Multimedia 2023 • Tianli Sun, Haonan Chen, Guosheng Hu, Lianghua He, Cairong Zhao

In addition, we demonstrate the utilization of visualization result in three ways: (1) We visualize attention with respect to connectionist temporal classification (CTC) loss to train an ASR model with adversarial attention erasing regularization, which effectively decreases the word error rate (WER) of the model and improves its generalization capability.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

ISTVT: Interpretable Spatial-Temporal Video Transformer for Deepfake Detection

1 code implementation • IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2023 • Cairong Zhao, Chutian Wang, Guosheng Hu, Haonan Chen, Chun Liu, Jinhui Tang

To address these two challenges, in this paper, we propose an Interpretable Spatial-Temporal Video Transformer (ISTVT), which consists of a novel decomposed spatial-temporal self-attention and a self-subtract mechanism to capture spatial artifacts and temporal inconsistency for robust Deepfake detection.

DeepFake Detection Face Swapping

Paper
Code

SGLoc: Scene Geometry Encoding for Outdoor LiDAR Localization

no code implementations • CVPR 2023 • Wen Li, Shangshu Yu, Cheng Wang, Guosheng Hu, Siqi Shen, Chenglu Wen

In this work, we propose a novel LiDAR localization framework, SGLoc, which decouples the pose estimation to point cloud correspondence regression and pose estimation via this correspondence.

Outdoor Localization Pose Estimation +1

Paper
Add Code

Conv-Adapter: Exploring Parameter Efficient Transfer Learning for ConvNets

no code implementations • 15 Aug 2022 • Hao Chen, Ran Tao, Han Zhang, Yidong Wang, Xiang Li, Wei Ye, Jindong Wang, Guosheng Hu, Marios Savvides

Beyond classification, Conv-Adapter can generalize to detection and segmentation tasks with more than 50% reduction of parameters but comparable performance to the traditional full fine-tuning.

Transfer Learning

Paper
Add Code

Boosting Active Learning via Improving Test Performance

1 code implementation • 10 Dec 2021 • Tianyang Wang, Xingjian Li, Pengkun Yang, Guosheng Hu, Xiangrui Zeng, Siyu Huang, Cheng-Zhong Xu, Min Xu

In this work, we explore such an impact by theoretically proving that selecting unlabeled data of higher gradient norm leads to a lower upper-bound of test loss, resulting in better test performance.

Active Learning Electron Tomography +2

125

Paper
Code

DPT: Deformable Patch-based Transformer for Visual Recognition

1 code implementation • 30 Jul 2021 • Zhiyang Chen, Yousong Zhu, Chaoyang Zhao, Guosheng Hu, Wei Zeng, Jinqiao Wang, Ming Tang

To address this problem, we propose a new Deformable Patch (DePatch) module which learns to adaptively split the images into patches with different positions and scales in a data-driven way rather than using predefined fixed patches.

Ranked #17 on Semantic Segmentation on DensePASS

Image Classification object-detection +2

144

Paper
Code

OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection

1 code implementation • CVPR 2021 • TingTing Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling

Encouraged by the success, we propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy.

Neural Architecture Search object-detection +1

Paper
Code

Imbalance Robust Softmax for Deep Embeeding Learning

no code implementations • 23 Nov 2020 • Hao Zhu, Yang Yuan, Guosheng Hu, Xiang Wu, Neil Robertson

IR-Softmax can generalise to any softmax and its variants (which are discriminative for open-set problem) by directly setting the weights as their class centers, naturally solving the data imbalance problem.

Face Recognition Person Re-Identification

Paper
Add Code

Learning Flow-based Feature Warping for Face Frontalization with Illumination Inconsistent Supervision

1 code implementation • ECCV 2020 • Yuxiang Wei, Ming Liu, Haolin Wang, Ruifeng Zhu, Guosheng Hu, WangMeng Zuo

Despite recent advances in deep learning-based face frontalization methods, photo-realistic and illumination preserving frontal face synthesis is still challenging due to large pose and illumination discrepancy during training.

Face Generation

126

Paper
Code

Salvage Reusable Samples from Noisy Data for Robust Learning

1 code implementation • 6 Aug 2020 • Zeren Sun, Xian-Sheng Hua, Yazhou Yao, Xiu-Shen Wei, Guosheng Hu, Jian Zhang

To this end, we propose a certainty-based reusable sample selection and correction approach, termed as CRSSC, for coping with label noise in training deep FG models with web images.

Memorization

Paper
Code

DADA: Differentiable Automatic Data Augmentation

1 code implementation • ECCV 2020 • Yonggang Li, Guosheng Hu, Yongtao Wang, Timothy Hospedales, Neil M. Robertson, Yongxin Yang

In this paper, we propose Differentiable Automatic Data Augmentation (DADA) which dramatically reduces the cost.

Ranked #15 on Data Augmentation on ImageNet

Data Augmentation

188

Paper
Code

MetaMixUp: Learning Adaptive Interpolation Policy of MixUp with Meta-Learning

no code implementations • 27 Aug 2019 • Zhijun Mai, Guosheng Hu, Dexiong Chen, Fumin Shen, Heng Tao Shen

Since deep networks are capable of memorizing the entire dataset, the corrupted samples generated by vanilla MixUp with a badly chosen interpolation policy will degrade the performance of networks.

Data Augmentation Domain Adaptation +2

Paper
Add Code

Semantic Alignment: Finding Semantically Consistent Ground-truth for Facial Landmark Detection

no code implementations • CVPR 2019 • Zhiwei Liu, Xiangyu Zhu, Guosheng Hu, Haiyun Guo, Ming Tang, Zhen Lei, Neil M. Robertson, Jinqiao Wang

Despite this, we notice that the semantic ambiguity greatly degrades the detection performance.

Ranked #1 on Face Alignment on 300W (NME_inter-pupil (%, Full) metric)

Face Alignment Facial Landmark Detection

Paper
Add Code

Learning Symmetry Consistent Deep CNNs for Face Completion

1 code implementation • 19 Dec 2018 • Xiaoming Li, Ming Liu, Jieru Zhu, WangMeng Zuo, Meng Wang, Guosheng Hu, Lei Zhang

As for missing pixels on both of half-faces, we present a generative reconstruction subnet together with a perceptual symmetry loss to enforce symmetry consistency of recovered structures.

Ranked #1 on Facial Inpainting on VggFace2

Face Recognition Facial Inpainting

Paper
Code

Deep Metric Learning by Online Soft Mining and Class-Aware Attention

3 code implementations • 4 Nov 2018 • Xinshao Wang, Yang Hua, Elyor Kodirov, Guosheng Hu, Neil M. Robertson

Therefore, we propose a novel sample mining method, called Online Soft Mining (OSM), which assigns one continuous score to each sample to make use of all samples in the mini-batch.

Metric Learning Semantic Similarity +2

Paper
Code

Deep Multi-Task Learning to Recognise Subtle Facial Expressions of Mental States

no code implementations • ECCV 2018 • Guosheng Hu, Li Liu, Yang Yuan, Zehao Yu, Yang Hua, Zhihong Zhang, Fumin Shen, Ling Shao, Timothy Hospedales, Neil Robertson, Yongxin Yang

To advance subtle expression recognition, we contribute a Large-scale Subtle Emotions and Mental States in the Wild database (LSEMSW).

Deception Detection Facial Expression Recognition +4

Paper
Add Code

Attribute-Enhanced Face Recognition With Neural Tensor Fusion Networks

no code implementations • ICCV 2017 • Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, Sankha S. Mukherjee, Timothy M. Hospedales, Neil M. Robertson, Yongxin Yang

To solve this problem, we establish a theoretical equivalence between tensor optimisation and a two-stream gated neural network.

Attribute Face Recognition

Paper
Add Code

Deep Stock Representation Learning: From Candlestick Charts to Investment Decisions

1 code implementation • 12 Sep 2017 • Guosheng Hu, Yuxin Hu, Kai Yang, Zehao Yu, Flood Sung, Zhihong Zhang, Fei Xie, Jianguo Liu, Neil Robertson, Timothy Hospedales, Qiangwei Miemie

We propose a novel investment decision strategy (IDS) based on deep learning.

Computational Finance

Paper
Code

Dictionary Integration using 3D Morphable Face Models for Pose-invariant Collaborative-representation-based Classification

no code implementations • 1 Nov 2016 • Xiaoning Song, Zhen-Hua Feng, Guosheng Hu, Josef Kittler, William Christmas, Xiao-Jun Wu

The paper presents a dictionary integration algorithm using 3D morphable face models (3DMM) for pose-invariant collaborative-representation-based face classification.

Classification General Classification

Paper
Add Code

Frankenstein: Learning Deep Face Representations using Small Data

no code implementations • 21 Mar 2016 • Guosheng Hu, Xiaojiang Peng, Yongxin Yang, Timothy Hospedales, Jakob Verbeek

To train such networks, very large training sets are needed with millions of labeled images.

Face Recognition Heterogeneous Face Recognition +1

Paper
Add Code

A Multiresolution 3D Morphable Face Model and Fitting Framework

1 code implementation • 1 Feb 2016 • Patrik Huber, Guosheng Hu, Rafael Tena, Pouria Mortazavian, Willem P. Koppen, William Christmas, Matthias Rätsch, Josef Kittler

In this paper, we present the Surrey Face Model, a multi-resolution 3D Morphable Model that we make available to the public for non-commercial purposes.

3D Face Reconstruction Face Generation +4

1,876

Paper
Code

When Face Recognition Meets with Deep Learning: an Evaluation of Convolutional Neural Networks for Face Recognition

no code implementations • 9 Apr 2015 • Guosheng Hu, Yongxin Yang, Dong Yi, Josef Kittler, William Christmas, Stan Z. Li, Timothy Hospedales

In this work, we conduct an extensive evaluation of CNN-based face recognition systems (CNN-FRS) on a common ground to make our work easily reproducible.

Face Recognition Metric Learning +1

Paper
Add Code

Identifying Similar Patients Using Self-Organising Maps: A Case Study on Type-1 Diabetes Self-care Survey Responses

no code implementations • 21 Mar 2015 • Santosh Tirunagari, Norman Poh, Guosheng Hu, David Windridge

Diabetes is considered a lifestyle disease and a well managed self-care plays an important role in the treatment.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.