no code implementations • 8 May 2025 • Wenyang Liu, Jianjun Gao, Kim-Hui Yap
Visible watermark removal is challenging due to its inherent complexities and the noise carried within images.
no code implementations • CVPR 2025 • Rui Gong, Kim-Hui Yap, Weide Liu, Xulei Yang, Jun Cheng
Extensive experiments show that our approach outperforms both state-of-the-art matching-based and matching-free methods in vertical flow metric by 10. 7% on the Carla-Flowguided dataset and 21. 3% on the Semi-Truck Highway dataset, offering superior rectification accuracy.
1 code implementation • 28 Oct 2024 • Wenyang Liu, Kejun Wu, Tianyi Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau
By looking inside bytes, the bit-level details of file fragments can be accessed, enabling a more accurate classification.
no code implementations • 21 Oct 2024 • Jianjun Gao, Chen Cai, Ruoyu Wang, Wenyang Liu, Kim-Hui Yap, Kratika Garg, Boon-Siew Han
Human-object interaction (HOI) detection has seen advancements with Vision Language Models (VLMs), but these methods often depend on extensive manual annotations.
no code implementations • 15 Oct 2024 • Yiming Li, Yi Wang, Wenqian Wang, Dan Lin, Bingbing Li, Kim-Hui Yap
Exploring new knowledge is a fundamental human ability that can be mirrored in the development of deep neural networks, especially in the field of object detection.
1 code implementation • 1 Oct 2024 • Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap
In recent years, the rapid increase in online video content has underscored the limitations of static Video Question Answering (VideoQA) models trained on fixed datasets, as they struggle to adapt to new questions or tasks posed by newly available content.
no code implementations • 3 Aug 2024 • Ruoyu Wang, Wenqian Wang, Jianjun Gao, Dan Lin, Kim-Hui Yap, Bingbing Li
Driver action recognition, aiming to accurately identify drivers' behaviours, is crucial for enhancing driver-vehicle interactions and ensuring driving safety.
no code implementations • 17 Jun 2024 • Ruoyu Wang, Chen Cai, Wenqian Wang, Jianjun Gao, Dan Lin, Wenyang Liu, Kim-Hui Yap
Therefore, previous works have suggested independently learning each non-RGB modality by fine-tuning a model pre-trained on RGB videos, but these methods are less effective in extracting informative features when faced with newly-incoming modalities due to large domain gaps.
no code implementations • 21 Apr 2024 • Cai Chen, Runzhong Zhang, Jianjun Gao, Kejun Wu, Kim-Hui Yap, Yi Wang
Temporal sentence grounding involves the retrieval of a video moment with a natural language query.
no code implementations • 26 Jan 2024 • Dan Lin, Philip Hann Yung Lee, Yiming Li, Ruoyu Wang, Kim-Hui Yap, Bingbing Li, You Shing Ngim
Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems.
no code implementations • CVPR 2024 • Shaowei Wang, Lingling Zhang, Longji Zhu, Tao Qin, Kim-Hui Yap, Xinyu Zhang, Jun Liu
While Large Language Models (LLMs) show promise in question-answering there is still a gap in how to cooperate and interact with the diagram parsing process.
no code implementations • 10 Nov 2023 • Jiacheng Wei, Guosheng Lin, Henghui Ding, Jie Hu, Kim-Hui Yap
Point cloud datasets often suffer from inadequate sample sizes in comparison to image datasets, making data augmentation challenging.
1 code implementation • NeurIPS 2023 • Tianyi Liu, Kejun Wu, Yi Wang, Wenyang Liu, Kim-Hui Yap, Lap-Pui Chau
The past decade has witnessed great strides in video recovery by specialist technologies, like video inpainting, completion, and error concealment.
no code implementations • 19 Sep 2023 • Jianjun Gao, Yi Wang, Kim-Hui Yap, Kratika Garg, Boon Siew Han
Particularly, the improvements on IDF1, IDSw, AssA, and AssR demonstrate the effectiveness of our OccluTrack on tracking and association performance.
1 code implementation • 13 Jun 2023 • Chen Cai, Suchen Wang, Kim-Hui Yap, Yi Wang
Weakly-supervised grounded image captioning (WSGIC) aims to generate the caption and ground (localize) predicted object words in the input image without using bounding box supervision.
no code implementations • 22 Apr 2023 • Gong Chen, Yanan Zhao, Yi Wang, Kim-Hui Yap
Recently, synthetic aperture radar (SAR) image change detection has become an interesting yet challenging direction due to the presence of speckle noise.
1 code implementation • 14 Apr 2023 • Wenyang Liu, Yi Wang, Kejun Wu, Kim-Hui Yap, Lap-Pui Chau
File fragment classification (FFC) on small chunks of memory is essential in memory forensics and Internet security.
1 code implementation • CVPR 2023 • Wenyang Liu, Yi Wang, Kim-Hui Yap, Lap-Pui Chau
In this paper, we study a real-world JPEG image restoration problem with bit errors on the encrypted bitstream.
1 code implementation • CVPR 2023 • Jiacheng Wei, Hao Wang, Jiashi Feng, Guosheng Lin, Kim-Hui Yap
We conduct extensive experiments to analyze each of our proposed components and show the efficacy of our framework in generating high-fidelity 3D textured and text-relevant shapes.
1 code implementation • CVPR 2022 • Suchen Wang, Yueqi Duan, Henghui Ding, Yap-Peng Tan, Kim-Hui Yap, Junsong Yuan
More specifically, we propose a new HOI visual encoder to detect the interacting humans and objects, and map them to a joint feature space to perform interaction recognition.
no code implementations • 23 Jul 2021 • Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung
While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels.
1 code implementation • 1 Jun 2021 • Hao Cheng, Kim-Hui Yap, Bihan Wen
Recent image classification algorithms, by learning deep features from large-scale datasets, have achieved significantly better results comparing to the classic feature-based approaches.
no code implementations • ICCV 2021 • Suchen Wang, Kim-Hui Yap, Henghui Ding, Jiyan Wu, Junsong Yuan, Yap-Peng Tan
In this work, we study the problem of human-object interaction (HOI) detection with large vocabulary object categories.
no code implementations • 25 Jun 2020 • Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar
We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective.
1 code implementation • CVPR 2020 • Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Tzu-Yi Hung, Lihua Xie
To the best of our knowledge, this is the first method that uses cloud-level weak labels on raw 3D space to train a point cloud semantic segmentation network.
no code implementations • CVPR 2019 • Chiat-Pin Tay, Sharmili Roy, Kim-Hui Yap
This paper proposes Attribute Attention Network (AANet), a new architecture that integrates person attributes and attribute attention maps into a classification framework to solve the person re-identification (re-ID) problem.
no code implementations • 14 Nov 2019 • Dipu Manandhar, Muhammet Bastan, Kim-Hui Yap
In view of this, we propose a new deep semantic granularity metric learning (SGML) that develops a novel idea of leveraging attribute semantic space to capture different granularity of similarity, and then integrate this information into deep metric learning.
1 code implementation • 9 Feb 2019 • Yasin Yazici, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar
We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities.
no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang
In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism.
no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang
In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor.
1 code implementation • ICLR 2019 • Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar
We examine two different techniques for parameter averaging in GAN training.
no code implementations • 28 Apr 2018 • Muhammet Bastan, Kim-Hui Yap, Lap-Pui Chau
First, we detect the cars in each IR image using a convolutional neural network, which is pre-trained on regular RGB images and fine-tuned on IR images for higher accuracy.
no code implementations • ICLR 2018 • Yasin Yazici, Kim-Hui Yap, Stefan Winkler
Generative Adversarial Networks (GANs) learn a generative model by playing an adversarial game between a generator and an auxiliary discriminator, which classifies data samples vs. generated ones.