Search Results for author: Kim-Hui Yap

Found 33 papers, 12 papers with code

SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal

no code implementations8 May 2025 Wenyang Liu, Jianjun Gao, Kim-Hui Yap

Visible watermark removal is challenging due to its inherent complexities and the noise carried within images.

Rectification-specific Supervision and Constrained Estimator for Online Stereo Rectification

no code implementations CVPR 2025 Rui Gong, Kim-Hui Yap, Weide Liu, Xulei Yang, Jun Cheng

Extensive experiments show that our approach outperforms both state-of-the-art matching-based and matching-free methods in vertical flow metric by 10. 7% on the Carla-Flowguided dataset and 21. 3% on the Semi-Truck Highway dataset, offering superior rectification accuracy.

Autonomous Vehicles Optical Flow Estimation +1

ByteNet: Rethinking Multimedia File Fragment Classification through Visual Perspectives

1 code implementation28 Oct 2024 Wenyang Liu, Kejun Wu, Tianyi Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau

By looking inside bytes, the bit-level details of file fragments can be accessed, enabling a more accurate classification.

CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models

no code implementations21 Oct 2024 Jianjun Gao, Chen Cai, Ruoyu Wang, Wenyang Liu, Kim-Hui Yap, Kratika Garg, Boon-Siew Han

Human-object interaction (HOI) detection has seen advancements with Vision Language Models (VLMs), but these methods often depend on extensive manual annotations.

Human-Object Interaction Detection

Open World Object Detection: A Survey

no code implementations15 Oct 2024 Yiming Li, Yi Wang, Wenqian Wang, Dan Lin, Bingbing Li, Kim-Hui Yap

Exploring new knowledge is a fundamental human ability that can be mirrored in the development of deep neural networks, especially in the field of object detection.

Incremental Learning Object +4

Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting

1 code implementation1 Oct 2024 Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap

In recent years, the rapid increase in online video content has underscored the limitations of static Video Question Answering (VideoQA) models trained on fixed datasets, as they struggle to adapt to new questions or tasks posed by newly available content.

Continual Learning Language Modeling +4

MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action Recognition

no code implementations3 Aug 2024 Ruoyu Wang, Wenqian Wang, Jianjun Gao, Dan Lin, Kim-Hui Yap, Bingbing Li

Driver action recognition, aiming to accurately identify drivers' behaviours, is crucial for enhancing driver-vehicle interactions and ensuring driving safety.

Action Recognition

CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition

no code implementations17 Jun 2024 Ruoyu Wang, Chen Cai, Wenqian Wang, Jianjun Gao, Dan Lin, Wenyang Liu, Kim-Hui Yap

Therefore, previous works have suggested independently learning each non-RGB modality by fine-tuning a model pre-trained on RGB videos, but these methods are less effective in extracting informative features when faced with newly-incoming modalities due to large domain gaps.

Action Recognition Continual Learning

CoG-DQA: Chain-of-Guiding Learning with Large Language Models for Diagram Question Answering

no code implementations CVPR 2024 Shaowei Wang, Lingling Zhang, Longji Zhu, Tao Qin, Kim-Hui Yap, Xinyu Zhang, Jun Liu

While Large Language Models (LLMs) show promise in question-answering there is still a gap in how to cooperate and interact with the diagram parsing process.

Question Answering Visual Question Answering

Learning-Based Biharmonic Augmentation for Point Cloud Classification

no code implementations10 Nov 2023 Jiacheng Wei, Guosheng Lin, Henghui Ding, Jie Hu, Kim-Hui Yap

Point cloud datasets often suffer from inadequate sample sizes in comparison to image datasets, making data augmentation challenging.

Classification Data Augmentation +2

Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method

1 code implementation NeurIPS 2023 Tianyi Liu, Kejun Wu, Yi Wang, Wenyang Liu, Kim-Hui Yap, Lap-Pui Chau

The past decade has witnessed great strides in video recovery by specialist technologies, like video inpainting, completion, and error concealment.

Video Inpainting

OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking

no code implementations19 Sep 2023 Jianjun Gao, Yi Wang, Kim-Hui Yap, Kratika Garg, Boon Siew Han

Particularly, the improvements on IDF1, IDSw, AssA, and AssR demonstrate the effectiveness of our OccluTrack on tracking and association performance.

Motion Estimation

Top-Down Framework for Weakly-supervised Grounded Image Captioning

1 code implementation13 Jun 2023 Chen Cai, Suchen Wang, Kim-Hui Yap, Yi Wang

Weakly-supervised grounded image captioning (WSGIC) aims to generate the caption and ground (localize) predicted object words in the input image without using bounding box supervision.

Image Captioning Multi-Label Classification +3

SSN: Stockwell Scattering Network for SAR Image Change Detection

no code implementations22 Apr 2023 Gong Chen, Yanan Zhao, Yi Wang, Kim-Hui Yap

Recently, synthetic aperture radar (SAR) image change detection has become an interesting yet challenging direction due to the presence of speckle noise.

Change Detection Computational Efficiency

A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings

1 code implementation14 Apr 2023 Wenyang Liu, Yi Wang, Kejun Wu, Kim-Hui Yap, Lap-Pui Chau

File fragment classification (FFC) on small chunks of memory is essential in memory forensics and Internet security.

Data Augmentation

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision

1 code implementation CVPR 2023 Jiacheng Wei, Hao Wang, Jiashi Feng, Guosheng Lin, Kim-Hui Yap

We conduct extensive experiments to analyze each of our proposed components and show the efficacy of our framework in generating high-fidelity 3D textured and text-relevant shapes.

Diversity

Learning Transferable Human-Object Interaction Detector With Natural Language Supervision

1 code implementation CVPR 2022 Suchen Wang, Yueqi Duan, Henghui Ding, Yap-Peng Tan, Kim-Hui Yap, Junsong Yuan

More specifically, we propose a new HOI visual encoder to detect the interacting humans and objects, and map them to a joint feature space to perform interaction recognition.

Human-Object Interaction Detection

Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds

no code implementations23 Jul 2021 Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung

While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels.

Point Cloud Segmentation Scene Understanding +3

Reconciliation of Statistical and Spatial Sparsity For Robust Image and Image-Set Classification

1 code implementation1 Jun 2021 Hao Cheng, Kim-Hui Yap, Bihan Wen

Recent image classification algorithms, by learning deep features from large-scale datasets, have achieved significantly better results comparing to the classic feature-based approaches.

Classification image-classification +1

Empirical Analysis of Overfitting and Mode Drop in GAN Training

no code implementations25 Jun 2020 Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar

We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective.

Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point Clouds

1 code implementation CVPR 2020 Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Tzu-Yi Hung, Lihua Xie

To the best of our knowledge, this is the first method that uses cloud-level weak labels on raw 3D space to train a point cloud semantic segmentation network.

3D Semantic Segmentation Point Cloud Segmentation +2

AANet: Attribute Attention Network for Person Re-Identifications

no code implementations CVPR 2019 Chiat-Pin Tay, Sharmili Roy, Kim-Hui Yap

This paper proposes Attribute Attention Network (AANet), a new architecture that integrates person attributes and attribute attention maps into a classification framework to solve the person re-identification (re-ID) problem.

Attribute Person Re-Identification +1

Semantic Granularity Metric Learning for Visual Search

no code implementations14 Nov 2019 Dipu Manandhar, Muhammet Bastan, Kim-Hui Yap

In view of this, we propose a new deep semantic granularity metric learning (SGML) that develops a novel idea of leveraging attribute semantic space to capture different granularity of similarity, and then integrate this information into deep metric learning.

Attribute Metric Learning +1

Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions

1 code implementation9 Feb 2019 Yasin Yazici, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities.

Interest Point Detection based on Adaptive Ternary Coding

no code implementations31 Dec 2018 Zhenwei Miao, Kim-Hui Yap, Xudong Jiang

In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism.

Face Recognition Interest Point Detection +1

DCI: Discriminative and Contrast Invertible Descriptor

no code implementations31 Dec 2018 Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang

In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor.

Object Object Recognition +1

Remote Detection of Idling Cars Using Infrared Imaging and Deep Networks

no code implementations28 Apr 2018 Muhammet Bastan, Kim-Hui Yap, Lap-Pui Chau

First, we detect the cars in each IR image using a convolutional neural network, which is pre-trained on regular RGB images and fine-tuned on IR images for higher accuracy.

Event Detection vehicle detection

Autoregressive Generative Adversarial Networks

no code implementations ICLR 2018 Yasin Yazici, Kim-Hui Yap, Stefan Winkler

Generative Adversarial Networks (GANs) learn a generative model by playing an adversarial game between a generator and an auxiliary discriminator, which classifies data samples vs. generated ones.

Binary Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.