no code implementations • ECCV 2020 • Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot
The proposed network exploits joint-aware features that are crucial for both tasks, with which gesture recognition and 3D hand pose estimation boost each other to learn highly discriminative features and models.
1 code implementation • 23 Aug 2024 • Chenqi Kong, Anwei Luo, Peijun Bao, Haoliang Li, Renjie Wan, Zengwei Zheng, Anderson Rocha, Alex C. Kot
Drawing on recent advancements in vision transformers (ViT) for face forgery detection, we develop a parameter-efficient ViT-based detection model that includes lightweight forgery feature extraction modules and enables the model to extract global and local forgery clues simultaneously.
1 code implementation • 11 Jul 2024 • Laniqng Guo, Chong Wang, YuFei Wang, Siyu Huang, Wenhan Yang, Alex C. Kot, Bihan Wen
In this paper, we are the first to provide a comprehensive survey to cover various aspects ranging from technical details to applications.
no code implementations • 25 Jun 2024 • Ruohan Meng, Chenyu Yi, Yi Yu, Siyuan Yang, Bingquan Shen, Alex C. Kot
To further boost the robustness of unlearnable examples, we design a Semantic Images Generation module that produces hidden semantic images.
1 code implementation • 13 Jun 2024 • Jiahao Nie, Gongjie Zhang, Wenbin An, Yap-Peng Tan, Alex C. Kot, Shijian Lu
Despite the recent advancements in Multi-modal Large Language Models (MLLMs), understanding inter-object relations, i. e., interactions or associations between distinct objects, remains a major challenge for such models.
1 code implementation • 31 May 2024 • YuFei Wang, Zhihao LI, Lanqing Guo, Wenhan Yang, Alex C. Kot, Bihan Wen
Recently, 3D Gaussian Splatting (3DGS) has become a promising framework for novel view synthesis, offering fast rendering speeds and high fidelity.
no code implementations • 20 May 2024 • Xiyu Wang, YuFei Wang, Satoshi Tsutsui, Weisi Lin, Bihan Wen, Alex C. Kot
Additionally, to mitigate the character confusion of generated results, we propose EpicEvo, a method that customizes a diffusion-based visual story generation model with a single story featuring the new characters seamlessly integrating them into established character dynamics.
1 code implementation • 15 May 2024 • Jiahao Nie, Shan Lin, Alex C. Kot
The primary color profile of the same identity is assumed to remain consistent in typical Person Re-identification (Person ReID) tasks.
no code implementations • 11 May 2024 • Xiaobao Guo, Zitong Yu, Nithish Muthuchamy Selvaraj, Bingquan Shen, Adams Wai-Kin Kong, Alex C. Kot
Automated deception detection is crucial for assisting humans in accurately assessing truthfulness and identifying deceptive behavior.
1 code implementation • 2 May 2024 • Yi Yu, YuFei Wang, Song Xia, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot
Based on this network, a two-stage purification approach is naturally developed.
no code implementations • 21 Apr 2024 • Songlin Dong, Yingjie Chen, Yuhang He, Yuhan Jin, Alex C. Kot, Yihong Gong
Online task-free continual learning (OTFCL) is a more challenging variant of continual learning which emphasizes the gradual shift of task boundaries and learns in an online mode.
1 code implementation • 12 Apr 2024 • Chenqi Kong, Anwei Luo, Peijun Bao, Yi Yu, Haoliang Li, Zengwei Zheng, Shiqi Wang, Alex C. Kot
Deepfakes have recently raised significant trust issues and security concerns among the public.
1 code implementation • CVPR 2024 • Jiahao Nie, Yun Xing, Gongjie Zhang, Pei Yan, Aoran Xiao, Yap-Peng Tan, Alex C. Kot, Shijian Lu
Cross-Domain Few-Shot Segmentation (CD-FSS) poses the challenge of segmenting novel categories from a distinct domain using only limited exemplars.
no code implementations • 24 Dec 2023 • Ling Li, Shaohua Li, Winda Marantika, Alex C. Kot, Huijing Zhan
Denoising Diffusion Probabilistic Model (DDPM) has shown great competence in image and audio generation tasks.
1 code implementation • CVPR 2024 • YuFei Wang, Wenhan Yang, Xinyuan Chen, Yaohui Wang, Lanqing Guo, Lap-Pui Chau, Ziwei Liu, Yu Qiao, Alex C. Kot, Bihan Wen
Extensive experiments conducted on synthetic and real-world datasets demonstrate that the proposed method can achieve comparable or even superior performance compared to both previous SOTA methods and the teacher model, in just one sampling step, resulting in a remarkable up to x10 speedup for inference.
no code implementations • 30 Sep 2023 • Chenqi Kong, Anwei Luo, Shiqi Wang, Haoliang Li, Anderson Rocha, Alex C. Kot
Digital image forensics plays a crucial role in image authentication and manipulation localization.
1 code implementation • 20 Sep 2023 • Anwei Luo, Rizhao Cai, Chenqi Kong, Yakun Ju, Xiangui Kang, Jiwu Huang, Alex C. Kot
With the rapid progress of generative models, the current challenge in face forgery detection is how to effectively detect realistic manipulated faces from different unseen domains.
1 code implementation • ICCV 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Different from a vanilla diffusion model that has to perform Gaussian denoising, with the injected physics-based exposure model, our restoration process can directly start from a noisy image instead of pure noise.
Ranked #1 on Image Denoising on Image Denoising on SID x300
no code implementations • 14 Jul 2023 • Siyuan Yang, Jun Liu, Shijian Lu, Er Meng Hwa, Alex C. Kot
The first is multi-scale matching which captures the scale-wise semantic relevance of skeleton data at multiple spatial and temporal scales simultaneously.
no code implementations • 9 Jul 2023 • Shulin Tian, YuFei Wang, Renjie Wan, Wenhan Yang, Alex C. Kot, Bihan Wen
In this work, we propose a novel approach to increase the visibility of images captured under low-light environments by removing the in-camera infrared (IR) cut-off filter, which allows for the capture of more photons and results in improved signal-to-noise ratio due to the inclusion of information from the IR spectrum.
1 code implementation • 21 Jun 2023 • YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C. Kot, Bihan Wen
Besides, we propose a novel design of the context model, which can better predict the order masks of encoding/decoding based on both the sRGB image and the masks of already processed features.
no code implementations • 24 Apr 2023 • Anwei Luo, Chenqi Kong, Jiwu Huang, Yongjian Hu, Xiangui Kang, Alex C. Kot
Face forgery detection is essential in combating malicious digital face attacks.
no code implementations • 18 Apr 2023 • Siyuan Yang, Jun Liu, Shijian Lu, Er Meng Hwa, Yongjian Hu, Alex C. Kot
We investigate self-supervised representation learning and design a novel skeleton cloud colorization technique that is capable of learning spatial and temporal skeleton representations from unlabeled skeleton sequence data.
no code implementations • 18 Mar 2023 • Xiyu Wang, Yuecong Xu, Jianfei Yang, Bihan Wen, Alex C. Kot
The second module compares the outputs of augmented data from the current model to the outputs of weakly augmented data from the source model, forming a novel consistency regularization on the model to alleviate the accumulation of prediction errors.
no code implementations • 3 Mar 2023 • Ziwang Xu, Lanqing Guo, Shuyan Zhang, Alex C. Kot, Bihan Wen
In this work, we propose a novel unsupervised deep learning framework for the digital staining of cell images using knowledge distillation and generative adversarial networks (GANs).
no code implementations • CVPR 2023 • Yi Yu, YuFei Wang, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot
Extensive experiments show that with our trained trigger injection models and simple modification of encoder parameters (of the compression model), the proposed attack can successfully inject several backdoors with corresponding triggers in a single image compression model.
no code implementations • 28 Feb 2023 • Chenyu Yi, Siyuan Yang, YuFei Wang, Haoliang Li, Yap-Peng Tan, Alex C. Kot
To exploit information in video with self-supervised learning, TeCo uses global content from video clips and optimizes models for entropy minimization.
1 code implementation • 12 Feb 2023 • Yawen Cui, Zitong Yu, Rizhao Cai, Xun Wang, Alex C. Kot, Li Liu
The goal of Few-Shot Continual Learning (FSCL) is to incrementally learn novel tasks with limited labeled samples and preserve previous capabilities simultaneously, while current FSCL methods are all for the class-incremental purpose.
1 code implementation • 11 Feb 2023 • YuFei Wang, Renjie Wan, Wenhan Yang, Bihan Wen, Lap-Pui Chau, Alex C. Kot
Removing image artifacts from the scratched lens protector is inherently challenging due to the occasional flare artifacts and the co-occurring interference within mixed artifacts.
1 code implementation • ICCV 2023 • Zhi Li, Pengfei Wei, Xiang Yin, Zejun Ma, Alex C. Kot
In our method, human pose and garment keypoints are extracted from source images and constructed as graphs to predict the garment keypoints at the target pose.
no code implementations • 5 Sep 2022 • Changsheng chen, Lin Zhao, Rizhao Cai, Zitong Yu, Jiwu Huang, Alex C. Kot
We integrate the trained FANet with practical recapturing detection schemes in face anti-spoofing and recaptured document detection tasks.
no code implementations • 10 Aug 2022 • Zitong Yu, Rizhao Cai, Zhi Li, Wenhan Yang, Jingang Shi, Alex C. Kot
In this paper, we establish the first joint face spoofing and forgery detection benchmark using both visual appearance and physiological rPPG cues.
no code implementations • 4 Jul 2022 • Eugene P. W. Ang, Shan Lin, Rahul Ahuja, Nemath Ahmed, Alex C. Kot
Existing evaluation metrics for Person Re-Identification (Person ReID) models focus on system-wide performance.
1 code implementation • 8 May 2022 • Zhi Li, Rizhao Cai, Haoliang Li, Kwok-Yan Lam, Yongjian Hu, Alex C. Kot
Under this framework, a teacher network is trained with source domain samples to provide discriminative feature representations for face PAD.
1 code implementation • CVPR 2022 • Yi Yu, Wenhan Yang, Yap-Peng Tan, Alex C. Kot
Finally, we examine various types of adversarial attacks that are specific to deraining problems and their effects on both human and machine vision tasks, including 1) rain region attacks, adding perturbations only in the rain regions to make the perturbations in the attacked rain images less visible; 2) object-sensitive attacks, adding perturbations only in regions near the given objects.
no code implementations • 21 Oct 2021 • Eugene P. W. Ang, Lin Shan, Alex C. Kot
With DEX and DEXLite, existing methods can gain significant improvements when tested on other unseen datasets, thereby demonstrating the general applicability of our method.
no code implementations • 18 Oct 2021 • Zhi Li, Haoliang Li, Xin Luo, Yongjian Hu, Kwok-Yan Lam, Alex C. Kot
In this paper, we propose a novel framework based on asymmetric modality translation for face presentation attack detection in bi-modality scenarios.
1 code implementation • 26 Sep 2021 • Hao Cheng, YuFei Wang, Haoliang Li, Alex C. Kot, Bihan Wen
In this work, we propose a novel Disentangled Feature Representation framework, dubbed DFR, for few-shot learning applications.
1 code implementation • 13 Sep 2021 • YuFei Wang, Renjie Wan, Wenhan Yang, Haoliang Li, Lap-Pui Chau, Alex C. Kot
To enhance low-light images to normally-exposed ones is highly ill-posed, namely that the mapping relationship between them is one-to-many.
Ranked #3 on Low-Light Image Enhancement on Sony-Total-Dark
1 code implementation • 13 Sep 2021 • YuFei Wang, Haoliang Li, Hao Cheng, Bihan Wen, Lap-Pui Chau, Alex C. Kot
Domain generalization aims to learn an invariant model that can generalize well to the unseen target domain.
no code implementations • ICCV 2021 • Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot
We investigate unsupervised representation learning for skeleton action recognition, and design a novel skeleton cloud colorization technique that is capable of learning skeleton representations from unlabeled skeleton sequence data.
no code implementations • 6 Jul 2021 • YuFei Wang, Haoliang Li, Lap-Pui Chau, Alex C. Kot
Though convolutional neural networks are widely used in different tasks, lack of generalization capability in the absence of sufficient and representative data is one of the challenges that hinder their practical application.
1 code implementation • CVPR 2021 • Qian Zheng, Boxin Shi, Jinnan Chen, Xudong Jiang, Ling-Yu Duan, Alex C. Kot
In this paper, we consider the absorption effect for the problem of single image reflection removal.
no code implementations • CVPR 2021 • Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi
This paper studies the problem of panoramic image reflection removal, aiming at reliving the content ambiguity between reflection and transmission scenes.
no code implementations • 25 Nov 2020 • Shan Lin, Chang-Tsun Li, Alex C. Kot
To make Person Re-ID systems more practical and scalable, several cross-dataset domain adaptation methods have been proposed, which achieve high performance without the labeled data from the target domain.
no code implementations • 11 Nov 2020 • Mauricio Perez, Jun Liu, Alex C. Kot
In this paper, we leverage the skeleton information to learn the interactions between the individuals straight from it.
1 code implementation • NeurIPS 2020 • Haoliang Li, YuFei Wang, Renjie Wan, Shiqi Wang, Tie-Qiang Li, Alex C. Kot
Recently, we have witnessed great progress in the field of medical imaging classification by adopting deep neural networks.
no code implementations • 15 Sep 2020 • Haoliang Li, Yufei Wang, Xiaofei Xie, Yang Liu, Shiqi Wang, Renjie Wan, Lap-Pui Chau, Alex C. Kot
In this paper, we propose a novel black-box backdoor attack technique on face recognition systems, which can be conducted without the knowledge of the targeted DNN model.
no code implementations • 11 Sep 2020 • Yufei Wang, Haoliang Li, Alex C. Kot
One of the main drawbacks of deep Convolutional Neural Networks (DCNN) is that they lack generalization capability.
1 code implementation • 11 Oct 2019 • Mauricio Perez, Jun Liu, Alex C. Kot
Our solution is able to achieve state-of-the-art performance on the traditional interaction recognition datasets SBU and UT, and also on the mutual actions from the large-scale dataset NTU RGB+D.
Ranked #1 on Human Interaction Recognition on UT-Interaction
3 code implementations • 12 May 2019 • Jun Liu, Amir Shahroudy, Mauricio Perez, Gang Wang, Ling-Yu Duan, Alex C. Kot
Research on depth-based human activity analysis achieved outstanding performance and demonstrated the effectiveness of 3D representation for action recognition.
Ranked #5 on One-Shot 3D Action Recognition on NTU RGB+D 120
no code implementations • ICCV 2019 • Qian Zheng, Yiming Jia, Boxin Shi, Xudong Jiang, Ling-Yu Duan, Alex C. Kot
This paper solves the Sparse Photometric stereo through Lighting Interpolation and Normal Estimation using a generative Network (SPLINE-Net).
no code implementations • 3 Mar 2019 • Renjie Wan, Boxin Shi, Haoliang Li, Ling-Yu Duan, Alex C. Kot
Face images captured through the glass are usually contaminated by reflections.
no code implementations • 8 Feb 2019 • Jun Liu, Amir Shahroudy, Gang Wang, Ling-Yu Duan, Alex C. Kot
Since there are significant temporal scale variations in the observed part of the ongoing action at different time steps, a novel window scale selection method is proposed to make our network focus on the performed part of the ongoing action and try to suppress the possible incoming interference from the previous actions at each step.
Ranked #68 on Skeleton Based Action Recognition on NTU RGB+D 120
no code implementations • 15 Jan 2019 • Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot
Learning a set of features that are reliable and discriminatively representative of the pose of a hand (or body) part is difficult due to the ambiguities, texture and illumination variation, and self-occlusion in the real application of 3D pose estimation.
no code implementations • 17 Sep 2018 • Zhuo Chen, Weisi Lin, Shiqi Wang, Ling-Yu Duan, Alex C. Kot
The recent advances of hardware technology have made the intelligent analysis equipped at the front-end with deep learning more prevailing and practical.
no code implementations • 27 Jun 2018 • Youmei Zhang, Chunluan Zhou, Faliang Chang, Alex C. Kot
Occlusions, complex backgrounds, scale variations and non-uniform distributions present great challenges for crowd counting in practical applications.
no code implementations • CVPR 2018 • Jun Liu, Amir Shahroudy, Gang Wang, Ling-Yu Duan, Alex C. Kot
As there are significant temporal scale variations of the observed part of the ongoing action at different progress levels, we propose a novel window scale selection scheme to make our network focus on the performed part of the ongoing action and try to suppress the noise from the previous actions at each time step.
no code implementations • CVPR 2018 • Haoliang Li, Sinno Jialin Pan, Shiqi Wang, Alex C. Kot
In this paper, we tackle the problem of domain generalization: how to learn a generalized feature representation for an âunseenâ target domain by taking the advantage of multiple seen source-domain data.
Ranked #56 on Domain Generalization on PACS
1 code implementation • CVPR 2018 • Renjie Wan, Boxin Shi, Ling-Yu Duan, Ah-Hwee Tan, Alex C. Kot
Removing the undesired reflections from images taken through the glass is of broad application to various computer vision tasks.
no code implementations • CVPR 2018 • Jianlou Si, Honggang Zhang, Chun-Guang Li, Jason Kuen, Xiangfei Kong, Alex C. Kot, Gang Wang
Typical person re-identification (ReID) methods usually describe each pedestrian with a single feature vector and match them in a task-specific metric space.
no code implementations • 7 Mar 2018 • Tianyi Zhang, Guosheng Lin, Jianfei Cai, Tong Shen, Chunhua Shen, Alex C. Kot
In our work, we focus on the weakly supervised semantic segmentation with image label annotations.
no code implementations • ICCV 2017 • Renjie Wan, Boxin Shi, Ling-Yu Duan, Ah-Hwee Tan, Alex C. Kot
Removing undesired reflections from a photo taken in front of a glass is of great importance for enhancing the efficiency of visual computing systems.
no code implementations • 18 Jul 2017 • Jun Liu, Gang Wang, Ling-Yu Duan, Kamila Abdiyeva, Alex C. Kot
In this paper, we propose a new class of LSTM network, Global Context-Aware Attention LSTM (GCA-LSTM), for skeleton based action recognition.
Ranked #66 on Skeleton Based Action Recognition on NTU RGB+D 120
no code implementations • CVPR 2017 • Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, Alex C. Kot
Hence we propose a new class of LSTM network, Global Context-Aware Attention LSTM (GCA-LSTM), for 3D action recognition, which is able to selectively focus on the informative joints in the action sequence with the assistance of global contextual information.
Ranked #7 on One-Shot 3D Action Recognition on NTU RGB+D 120
no code implementations • 26 Jun 2017 • Jun Liu, Amir Shahroudy, Dong Xu, Alex C. Kot, Gang Wang
Skeleton-based human action recognition has attracted a lot of research attention during the past few years.
Ranked #6 on One-Shot 3D Action Recognition on NTU RGB+D 120