Search Results for author: Hubert P. H. Shum

Found 55 papers, 31 papers with code

BP-SGCN: Behavioral Pseudo-Label Informed Sparse Graph Convolution Network for Pedestrian and Heterogeneous Trajectory Prediction

1 code implementation20 Feb 2025 Ruochen Li, Stamos Katsigiannis, Tae-Kyun Kim, Hubert P. H. Shum

In this work, we introduce the behavioral pseudo-labels that effectively capture the behavior distributions of pedestrians and heterogeneous agents solely based on their motion features, significantly improving the accuracy of trajectory prediction.

Autonomous Vehicles Prediction +2

Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory Prediction

1 code implementation4 Feb 2025 Ruochen Li, Tanqiu Qiao, Stamos Katsigiannis, Zhanxing Zhu, Hubert P. H. Shum

We propose the Edge-to-Edge-Node-to-Node Graph Convolution (E2E-N2N-GCN), a novel dual-graph network that jointly models explicit N2N social interactions among pedestrians and implicit E2E influence propagation across these interaction patterns.

Pedestrian Trajectory Prediction Trajectory Prediction

A Comprehensive Survey of Action Quality Assessment: Method and Benchmark

no code implementations15 Dec 2024 Kanglei Zhou, Ruizhi Cai, Liyuan Wang, Hubert P. H. Shum, Xiaohui Liang

In this work, we address these gaps by systematically analyzing over 150 AQA-related papers to develop a hierarchical taxonomy, construct a unified benchmark, and provide an in-depth analysis of current trends, challenges, and future directions.

Action Quality Assessment Computational Efficiency +1

Adaptive Graph Learning from Spatial Information for Surgical Workflow Anticipation

no code implementations9 Dec 2024 Francis Xiatian Zhang, Jingjing Deng, Robert Lieck, Hubert P. H. Shum

First, we introduce a new representation of spatial information based on bounding boxes of surgical instruments and targets, including their detection confidence levels.

Graph Learning

Artificial Intelligence for Geometry-Based Feature Extraction, Analysis and Synthesis in Artistic Images: A Survey

no code implementations2 Dec 2024 Mridula Vijendran, Jingjing Deng, Shuang Chen, Edmond S. L. Ho, Hubert P. H. Shum

Artificial Intelligence significantly enhances the visual art industry by analyzing, identifying and generating digitized artistic images.

One-Index Vector Quantization Based Adversarial Attack on Image Classification

no code implementations2 Sep 2024 Haiju Fan, Xiaona Qin, Shuang Chen, Hubert P. H. Shum, Ming Li

In this paper, we propose a novel one-index attack method in the VQ domain to generate adversarial images by a differential evolution algorithm, successfully resulting in image misclassification in victim models.

Adversarial Attack Image Classification +1

MxT: Mamba x Transformer for Image Inpainting

1 code implementation23 Jul 2024 Shuang Chen, Amir Atapour-Abarghouei, Haozheng Zhang, Hubert P. H. Shum

Image inpainting, or image completion, is a crucial task in computer vision that aims to restore missing or damaged regions of images with semantically coherent content.

Image Inpainting Mamba

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

1 code implementation14 Jul 2024 Li Li, Hubert P. H. Shum, Toby P. Breckon

To effectively embed high-dimensional RAPiD features, we propose a double-nested autoencoder structure with a novel class-aware embedding objective to encode high-dimensional features into manageable voxel-wise embeddings.

3D Semantic Segmentation Autonomous Driving +3

From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos

no code implementations1 Jul 2024 Tanqiu Qiao, Ruochen Li, Frederick W. B. Li, Hubert P. H. Shum

Video-based Human-Object Interaction (HOI) recognition explores the intricate dynamics between humans and objects, which are essential for a comprehensive understanding of human behavior and intentions.

Human-Object Interaction Detection

Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling

1 code implementation26 Jun 2024 Abril Corona-Figueroa, Hubert P. H. Shum, Chris G. Willcocks

This paper investigates a 2D to 3D image translation method with a straightforward technique, enabling correlated 2D X-ray to 3D CT-like reconstruction.

3D Reconstruction

Geometric Features Enhanced Human-Object Interaction Detection

1 code implementation26 Jun 2024 Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang, Hubert P. H. Shum

Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes.

Human-Object Interaction Detection Object

DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications

1 code implementation International Conference on 3D Vision (3DV) 2021 Li Li, Khalid N. Ismail, Hubert P. H. Shum, Toby P. Breckon

We present DurLAR, a high-fidelity 128-channel 3D LiDAR dataset with panoramic ambient (near infrared) and reflectivity imagery, as well as a sample benchmark task using depth estimation for autonomous driving applications.

Autonomous Driving Monocular Depth Estimation

Two-Person Interaction Augmentation with Skeleton Priors

no code implementations8 Apr 2024 Baiyi Li, Edmond S. L. Ho, Hubert P. H. Shum, He Wang

Close and continuous interaction with rich contacts is a crucial aspect of human activities (e. g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc.

Activity Recognition Deep Learning +1

HINT: High-quality INPainting Transformer with Mask-Aware Encoding and Enhanced Attention

1 code implementation22 Feb 2024 Shuang Chen, Amir Atapour-Abarghouei, Hubert P. H. Shum

In this paper, we propose an end-to-end High-quality INpainting Transformer, abbreviated as HINT, which consists of a novel mask-aware pixel-shuffle downsampling module (MPD) to preserve the visible information extracted from the corrupted image while maintaining the integrity of the information available for high-level inferences made within the model.

Image Inpainting speech-recognition +1

Enhancing Surgical Performance in Cardiothoracic Surgery with Innovations from Computer Vision and Artificial Intelligence: A Narrative Review

no code implementations17 Feb 2024 Merryn D. Constable, Hubert P. H. Shum, Stephen Clark

When technical requirements are high, and patient outcomes are critical, opportunities for monitoring and improving surgical skills via objective motion analysis feedback may be particularly beneficial.

Pose Estimation

Correlation-Distance Graph Learning for Treatment Response Prediction from rs-fMRI

1 code implementation17 Nov 2023 Xiatian Zhang, Sisi Zheng, Hubert P. H. Shum, Haozheng Zhang, Nan Song, Mingkang Song, Hongxiao Jia

To overcome that, we propose a graph learning framework that captures comprehensive features by integrating both correlation and distance-based similarity measures under a contrastive loss.

Functional Connectivity Graph Learning

U3DS$^3$: Unsupervised 3D Semantic Scene Segmentation

no code implementations10 Nov 2023 Jiaxu Liu, Zhengdi Yu, Toby P. Breckon, Hubert P. H. Shum

To achieve this, U3DS$^3$ leverages a generalized unsupervised segmentation method for both object and background across both indoor and outdoor static 3D point clouds with no requirement for model pre-training, by leveraging only the inherent information of the point cloud to achieve full 3D scene segmentation.

Point Cloud Segmentation Representation Learning +2

Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers

1 code implementation ICCV 2023 Abril Corona-Figueroa, Sam Bond-Taylor, Neelanjan Bhowmik, Yona Falinie A. Gaus, Toby P. Breckon, Hubert P. H. Shum, Chris G. Willcocks

Generating 3D images of complex objects conditionally from a few 2D views is a difficult synthesis problem, compounded by issues such as domain gap and geometric misalignment.

Enhancing Perception and Immersion in Pre-Captured Environments through Learning-Based Eye Height Adaptation

no code implementations24 Aug 2023 Qi Feng, Hubert P. H. Shum, Shigeo Morishima

To explore this influence, we first propose a pilot study that captures real environments with multiple eye heights and asks participants to judge the egocentric distances and immersion.

Semantic Segmentation

Hard No-Box Adversarial Attack on Skeleton-Based Human Action Recognition with Skeleton-Motion-Informed Gradient

1 code implementation ICCV 2023 Zhengzhi Lu, He Wang, Ziyi Chang, Guoan Yang, Hubert P. H. Shum

Specifically, we first learn a motion manifold where we define an adversarial loss to compute a new gradient for the attack, named skeleton-motion-informed (SMI) gradient.

Action Recognition Adversarial Attack +2

On the Design Fundamentals of Diffusion Models: A Survey

no code implementations7 Jun 2023 Ziyi Chang, George Alex Koulieris, Hubert P. H. Shum

Diffusion models are generative models, which gradually add and remove noise to learn the underlying distribution of training data for data generation.

Survey

INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network

no code implementations17 May 2023 Shuang Chen, Amir Atapour-Abarghouei, Edmond S. L. Ho, Hubert P. H. Shum

We present a software that predicts non-cleft facial images for patients with cleft lip, thereby facilitating the understanding, awareness and discussion of cleft lip surgeries.

Image Inpainting

Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation

1 code implementation CVPR 2023 Li Li, Hubert P. H. Shum, Toby P. Breckon

Whilst the availability of 3D LiDAR point cloud data has significantly grown in recent years, annotation remains expensive and time-consuming, leading to a demand for semi-supervised semantic segmentation methods with application domains such as autonomous driving.

 Ranked #1 on 3D Semantic Segmentation on ScribbleKITTI (mIoU-1% metric)

3D Semantic Segmentation Autonomous Driving +3

Tackling Data Bias in Painting Classification with Style Transfer

1 code implementation6 Jan 2023 Mridula Vijendran, Frederick W. B. Li, Hubert P. H. Shum

We propose a system to handle data bias in small paintings datasets like the Kaokore dataset while simultaneously accounting for domain adaptation in fine-tuning a model trained on real world images.

Classification Data Augmentation +2

Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models

1 code implementation16 Dec 2022 Ziyi Chang, Edmund J. C. Findlay, Haozheng Zhang, Hubert P. H. Shum

To achieve high-quality results, we design a multi-task architecture of diffusion model that strategically generates aspects of human motions for local guidance.

Denoising Motion Synthesis +1

3D Reconstruction of Sculptures from Single Images via Unsupervised Domain Adaptation on Implicit Models

1 code implementation9 Oct 2022 Ziyi Chang, George Alex Koulieris, Hubert P. H. Shum

Acquiring the virtual equivalent of exhibits, such as sculptures, in virtual reality (VR) museums, can be labour-intensive and sometimes infeasible.

3D Reconstruction Unsupervised Domain Adaptation

A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction

1 code implementation18 Aug 2022 Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

As a result, we propose a solution that explicitly takes both individual joint features and inter-joint features as input, relieving the system from the need of discovering more complicated features from small data.

Time Series Time Series Analysis +1

Towards Graph Representation Learning Based Surgical Workflow Anticipation

1 code implementation7 Aug 2022 Xiatian Zhang, Noura Al Moubayed, Hubert P. H. Shum

Hence, we propose a graph representation learning framework to comprehensively represent instrument motions in the surgical workflow anticipation problem.

Graph Representation Learning

Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos

1 code implementation19 Jul 2022 Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum

Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN).

Human-Object Interaction Detection

Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video

no code implementations14 Jul 2022 Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Hubert P. H. Shum

To this end, we propose to classify Parkinson's tremor since it is one of the most predominant symptoms of PD with strong generalizability.

Decision Making Diagnostic

A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection

1 code implementation11 Jul 2022 Manli Zhu, Edmond S. L. Ho, Hubert P. H. Shum

Our network exploits the spatial connections between human keypoints and object keypoints to capture their fine-grained structural interactions via graph convolutions.

Human-Object Interaction Detection Object

Multiclass-SGCN: Sparse Graph-based Trajectory Prediction with Agent Class Embedding

1 code implementation30 Jun 2022 Ruochen Li, Stamos Katsigiannis, Hubert P. H. Shum

Trajectory prediction of road users in real-world scenarios is challenging because their movement patterns are stochastic and complex.

Trajectory Prediction

360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network

1 code implementation16 Feb 2022 Qi Feng, Hubert P. H. Shum, Shigeo Morishima

In this work, we first establish a large-scale dataset with varied settings called Depth360 to tackle the training data problem.

Autonomous Driving Depth Estimation +2

MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware CT-Projections from a Single X-ray

2 code implementations2 Feb 2022 Abril Corona-Figueroa, Jonathan Frawley, Sam Bond-Taylor, Sarath Bethapudi, Hubert P. H. Shum, Chris G. Willcocks

Computed tomography (CT) is an effective medical imaging modality, widely used in the field of clinical medicine for the diagnosis of various pathologies.

Computed Tomography (CT)

Spoofing Detection on Hand Images Using Quality Assessment

no code implementations22 Oct 2021 Asish Bera, Ratnadeep Dey, Debotosh Bhattacharjee, Mita Nasipuri, Hubert P. H. Shum

A presentation attack detection approach is addressed by assessing the visual quality of genuine and fake hand images.

Classification

Semi-Supervised Crowd Counting from Unlabeled Data

no code implementations31 Aug 2021 Haoran Duan, Fan Wan, Rui Sun, Zeyu Wang, Varun Ojha, Yu Guan, Hubert P. H. Shum, Bingzhang Hu, Yang Long

Our method achieved competitive performance in semi-supervised learning approaches on these crowd counting datasets.

Crowd Counting

Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction

1 code implementation10 Aug 2021 Ben A. Rainbow, Qianhui Men, Hubert P. H. Shum

This is because they ignore the impact of the implicit correlations between different types of road users on the trajectory to be predicted - for example, a nearby pedestrian has a different level of influence from a nearby car.

Pedestrian Trajectory Prediction Trajectory Prediction

Interpreting Deep Learning based Cerebral Palsy Prediction with Channel Attention

no code implementations8 Jun 2021 Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

To highlight the capacity of the deep network in modelling input features, we utilize raw joint positions instead of hand-crafted features.

Deep Learning Diagnostic

UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-identification in Video Imagery

1 code implementation13 Apr 2021 Daniel Organisciak, Matthew Poyser, Aishah Alsehaim, Shanfeng Hu, Brian K. S. Isaac-Medina, Toby P. Breckon, Hubert P. H. Shum

As unmanned aerial vehicles (UAVs) become more accessible with a growing range of applications, the potential risk of UAV disruption increases.

Vehicle Re-Identification

Illumination-Based Data Augmentation for Robust Background Subtraction

1 code implementation18 Oct 2019 Dimitrios Sakkos, Hubert P. H. Shum, Edmond S. L. Ho

A core challenge in background subtraction (BGS) is handling videos with sudden illumination changes in consecutive frames.

Data Augmentation Foreground Segmentation +2

Spatio-temporal Manifold Learning for Human Motions via Long-horizon Modeling

no code implementations20 Aug 2019 He Wang, Edmond S. L. Ho, Hubert P. H. Shum, Zhanxing Zhu

In this paper, we propose a new deep network to tackle these challenges by creating a natural motion manifold that is versatile for many applications.

Denoising Time Series Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.