Search Results for author: Alex Kot

Found 21 papers, 8 papers with code

BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models

2 code implementations5 Dec 2023 Rizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Xing Luo, Chenyu Yi, Alex Kot

Large Multimodal Models (LMMs) such as GPT-4V and LLaVA have shown remarkable capabilities in visual reasoning with common image styles.

Benchmarking Visual Question Answering +1

Multi-Camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras

1 code implementation1 May 2020 Olly Styles, Tanaya Guha, Victor Sanchez, Alex Kot

To facilitate research in this new area, we release the Warwick-NTU Multi-camera Forecasting Database (WNMF), a unique dataset of multi-camera pedestrian trajectories from a network of 15 synchronized cameras.

Pedestrian Trajectory Prediction Trajectory Forecasting

Raw Image Reconstruction with Learned Compact Metadata

1 code implementation CVPR 2023 YuFei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex Kot, Bihan Wen

While raw images exhibit advantages over sRGB images (e. g., linearity and fine-grained quantization level), they are not widely used by common users due to the large storage requirements.

Image Compression Image Reconstruction +1

Benchmarking the Robustness of Spatial-Temporal Models Against Corruptions

1 code implementation13 Oct 2021 Chenyu Yi, Siyuan Yang, Haoliang Li, Yap-Peng Tan, Alex Kot

The state-of-the-art deep neural networks are vulnerable to common corruptions (e. g., input data degradations, distortions, and disturbances caused by weather changes, system error, and processing).

Benchmarking Computational Efficiency +1

AI-GAN: Attack-Inspired Generation of Adversarial Examples

1 code implementation6 Feb 2020 Tao Bai, Jun Zhao, Jinlin Zhu, Shoudong Han, Jiefeng Chen, Bo Li, Alex Kot

Deep neural networks (DNNs) are vulnerable to adversarial examples, which are crafted by adding imperceptible perturbations to inputs.

Audio-Visual Deception Detection: DOLOS Dataset and Parameter-Efficient Crossmodal Learning

1 code implementation ICCV 2023 Xiaobao Guo, Nithish Muthuchamy Selvaraj, Zitong Yu, Adams Wai-Kin Kong, Bingquan Shen, Alex Kot

Despite this, deception detection research is hindered by the lack of high-quality deception datasets, as well as the difficulties of learning multimodal features effectively.

Deception Detection Multi-Task Learning

Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation

no code implementations ECCV 2020 Tianyi Zhang, Guosheng Lin, Weide Liu, Jianfei Cai, Alex Kot

Finally, by training the segmentation model with the masks generated by our Splitting vs Merging strategy, we achieve the state-of-the-art weakly-supervised segmentation results on the Pascal VOC 2012 benchmark.

Segmentation Weakly supervised segmentation +2

Feature Distillation With Guided Adversarial Contrastive Learning

no code implementations21 Sep 2020 Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot

In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features.

Adversarial Robustness Contrastive Learning

Towards Efficiently Evaluating the Robustness of Deep Neural Networks in IoT Systems: A GAN-based Method

no code implementations19 Nov 2021 Tao Bai, Jun Zhao, Jinlin Zhu, Shoudong Han, Jiefeng Chen, Bo Li, Alex Kot

Through extensive experiments, AI-GAN achieves high attack success rates, outperforming existing methods, and reduces generation time significantly.

Enhancing Low-Light Images in Real World via Cross-Image Disentanglement

no code implementations10 Jan 2022 Lanqing Guo, Renjie Wan, Wenhan Yang, Alex Kot, Bihan Wen

Images captured in the low-light condition suffer from low visibility and various imaging artifacts, e. g., real noise.

Disentanglement Low-Light Image Enhancement

Toward domain generalized pruning by scoring out-of-distribution importance

no code implementations25 Oct 2022 Rizhao Cai, Haoliang Li, Alex Kot

Filter pruning has been widely used for compressing convolutional neural networks to reduce computation costs during the deployment stage.

Domain Generalization

Traceable and Authenticable Image Tagging for Fake News Detection

no code implementations20 Nov 2022 Ruohan Meng, Zhili Zhou, Qi Cui, Kwok-Yan Lam, Alex Kot

Extensive experiments, on diverse datasets and unseen manipulations, demonstrate that the proposed tagging approach achieves excellent performance in the aspects of both authenticity verification and source tracing for reliable fake news detection and outperforms the prior works.

Fake News Detection TAG

Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing

no code implementations11 Feb 2023 Zitong Yu, Rizhao Cai, Yawen Cui, Xin Liu, Yongjian Hu, Alex Kot

In this paper, we investigate three key factors (i. e., inputs, pre-training, and finetuning) in ViT for multimodal FAS with RGB, Infrared (IR), and Depth.

Face Anti-Spoofing

Flexible-modal Deception Detection with Audio-Visual Adapter

no code implementations11 Feb 2023 Zhaoxu Li, Zitong Yu, Nithish Muthuchamy Selvaraj, Xiaobao Guo, Bingquan Shen, Adams Wai-Kin Kong, Alex Kot

Detecting deception by human behaviors is vital in many fields such as custom security and multimedia anti-fraud.

Deception Detection

Rehearsal-Free Domain Continual Face Anti-Spoofing: Generalize More and Forget Less

no code implementations ICCV 2023 Rizhao Cai, Yawen Cui, Zhi Li, Zitong Yu, Haoliang Li, Yongjian Hu, Alex Kot

To alleviate the forgetting of previous domains without using previous data, we propose the Proxy Prototype Contrastive Regularization (PPCR) to constrain the continual learning with previous domain knowledge from the proxy prototypes.

Continual Learning Domain Generalization +1

Hyperbolic Face Anti-Spoofing

no code implementations17 Aug 2023 Shuangpeng Han, Rizhao Cai, Yawen Cui, Zitong Yu, Yongjian Hu, Alex Kot

To further improve generalization, we conduct hyperbolic contrastive learning for the bonafide only while relaxing the constraints on diverse spoofing attacks.

Contrastive Learning Face Anti-Spoofing +1

MIMIC: Mask Image Pre-training with Mix Contrastive Fine-tuning for Facial Expression Recognition

no code implementations14 Jan 2024 Fan Zhang, Xiaobao Guo, Xiaojiang Peng, Alex Kot

In addition, when compared with the domain disparity existing between face datasets and FER datasets, the divergence between general datasets and FER datasets is more pronounced.

Contrastive Learning Face Recognition +3

Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations

no code implementations21 Mar 2024 Xun Lin, Yi Yu, Song Xia, Jue Jiang, Haoran Wang, Zitong Yu, Yizhong Liu, Ying Fu, Shuai Wang, Wenzhong Tang, Alex Kot

This is particularly true for medical image segmentation (MIS) datasets, where the processes of collection and fine-grained annotation are time-intensive and laborious.

Image Classification Image Generation +4

Cannot find the paper you are looking for? You can Submit a new open access paper.