Search Results for author: Yongkang Wong

Found 38 papers, 20 papers with code

Finetuning Text-to-Image Diffusion Models for Fairness

1 code implementation11 Nov 2023 Xudong Shen, Chao Du, Tianyu Pang, Min Lin, Yongkang Wong, Mohan Kankanhalli

The rapid adoption of text-to-image diffusion models in society underscores an urgent need to address their biases.

Fairness

MCM: Multi-condition Motion Synthesis Framework for Multi-scenario

no code implementations6 Sep 2023 Zeyu Ling, Bo Han, Yongkang Wong, Mohan Kangkanhalli, Weidong Geng

We also introduce a Transformer-based diffusion model MWNet (DDPM-like) as our main branch that can capture the spatial complexity and inter-joint correlations in motion sequences through a channel-dimension self-attention module.

Motion Synthesis

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023

no code implementations13 Jul 2023 Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lin, Hehe Fan, Yongkang Wong, Ying Sun, Mohan Kankanhalli

Our research focuses on the innovative application of a differentiable logic loss in the training to leverage the co-occurrence relations between verb and noun, as well as the pre-trained Large Language Models (LLMs) to generate the logic rules for the adaptation to unseen action labels.

Action Recognition Unsupervised Domain Adaptation

Distance Matters in Human-Object Interaction Detection

1 code implementation5 Jul 2022 Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli

2) Insufficient number of distant interactions in benchmark datasets results in under-fitting on these instances.

Human-Object Interaction Detection Object +1

A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA

1 code implementation30 Jun 2022 Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu, Zhiyong Cheng, Mohan Kankanhalli

On the other hand, pertaining to the implicit knowledge, the multi-modal implicit knowledge for knowledge-based VQA still remains largely unexplored.

Question Answering Retrieval +1

Learning to Minimize the Remainder in Supervised Learning

1 code implementation23 Jan 2022 Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

To this end, we propose a new learning approach, namely gradient adjustment learning (GAL), to leverage the knowledge learned from the past training iterations to adjust vanilla gradients, such that the remainders are minimized and the approximations are improved.

Image Classification Image Retrieval +3

Learning to Predict Gradients for Semi-Supervised Continual Learning

1 code implementation23 Jan 2022 Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

To explore these issues, we formulate a new semi-supervised continual learning method, which can be generically applied to existing continual learning models.

Continual Learning

Unsupervised Motion Representation Learning with Capsule Autoencoders

1 code implementation NeurIPS 2021 Ziwei Xu, Xudong Shen, Yongkang Wong, Mohan S Kankanhalli

We propose the Motion Capsule Autoencoder (MCAE), which addresses a key challenge in the unsupervised learning of motion representations: transformation invariance.

Action Recognition Representation Learning +2

Learning to Predict Trustworthiness with Steep Slope Loss

1 code implementation NeurIPS 2021 Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

Secondly, due to the data complexity, it is challenging to differentiate the incorrect predictions from the correct ones on real-world large-scale datasets.

Fair Representation: Guaranteeing Approximate Multiple Group Fairness for Unknown Tasks

1 code implementation1 Sep 2021 Xudong Shen, Yongkang Wong, Mohan Kankanhalli

Motivated by scenarios where data is used for diverse prediction tasks, we study whether fair representation can be used to guarantee fairness for unknown tasks and for multiple fairness notions simultaneously.

Fairness valid

Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition

1 code implementation10 Aug 2021 Ziwei Xu, Guangzhi Wang, Yongkang Wong, Mohan Kankanhalli

The concept module generates semantically meaningful features for primitive concepts, whereas the visual module extracts visual features for attributes and objects from input images.

Attribute Blocking +2

Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Interventions

no code implementations ICCV 2021 Xiheng Zhang, Yongkang Wong, Xiaofei Wu, Juwei Lu, Mohan Kankanhalli, Xiangdong Li, Weidong Geng

In this work, we take a step towards training robust models for cross-domain pose estimation task, which brings together ideas from causal representation learning and generative adversarial networks.

3D Pose Estimation counterfactual +3

GradMix: Multi-source Transfer across Domains and Tasks

no code implementations9 Feb 2020 Junnan Li, Ziwei Xu, Yongkang Wong, Qi Zhao, Mohan Kankanhalli

Therefore, it is important to develop algorithms that can leverage off-the-shelf labeled dataset to learn useful knowledge for the target task.

Action Recognition Meta-Learning +1

Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$ Videos

no code implementations9 Feb 2020 Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan Kankanhalli

To enable research in this direction, we introduce 360Action, the first omnidirectional video dataset for multi-person action recognition.

Action Localization Action Recognition +1

Direction Concentration Learning: Enhancing Congruency in Machine Learning

1 code implementation17 Dec 2019 Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

We propose a Direction Concentration Learning (DCL) method to improve congruency in the learning process, where enhancing congruency influences the convergence path to be less circuitous.

Ranked #8 on Image Classification on Tiny ImageNet Classification (using extra training data)

BIG-bench Machine Learning Continual Learning +2

Explainable Video Action Reasoning via Prior Knowledge and State Transitions

1 code implementation28 Aug 2019 Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli

Finally, by sequentially examining each state transition in the video graph, our method can detect and explain how those actions are executed with prior knowledge, just like the logical manner of thinking by humans.

Action Analysis Attribute

BEHAVIOR MODULE IN NEURAL NETWORKS

no code implementations ICLR 2019 Andrey Sakryukin, Yongkang Wong, Mohan S. Kankanhalli

This property is particularly useful for user modeling (as for dialog agents) and recommendation tasks, as allows learning personalized representations of different user states.

$\mathcal{G}$-softmax: Improving Intra-class Compactness and Inter-class Separability of Features

1 code implementation8 Apr 2019 Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

In addition, analysis of the intra-class compactness and inter-class separability demonstrates the advantages of the proposed function over the softmax function, which is consistent with the performance improvement.

General Classification Multi-Label Classification

Learning to Learn from Noisy Labeled Data

1 code implementation CVPR 2019 Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect.

Ranked #26 on Image Classification on Clothing1M (using extra training data)

Learning with noisy labels Meta-Learning

Unsupervised Learning of View-invariant Action Representations

1 code implementation NeurIPS 2018 Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

Different from previous works in video representation learning, our unsupervised learning task is to predict 3D motion in multiple target views using video representation from a source view.

Action Recognition Representation Learning +1

Interact as You Intend: Intention-Driven Human-Object Interaction Detection

no code implementations29 Aug 2018 Bingjie Xu, Junnan Li, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

The recent advances in instance-level detection tasks lay strong foundation for genuine comprehension of the visual scenes.

Human-Object Interaction Detection

Video Storytelling: Textual Summaries for Events

no code implementations25 Jul 2018 Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

Video storytelling introduces new challenges, mainly due to the diversity of the story and the length and complexity of the video.

Sentence

Attention Transfer from Web Images for Video Recognition

no code implementations3 Aug 2017 Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli

However, due to the domain shift problem, the performance of Web images trained deep classifiers tend to degrade when directly deployed to videos.

Action Recognition Temporal Action Localization +1

Dual-Glance Model for Deciphering Social Relationships

1 code implementation ICCV 2017 Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

Since the beginning of early civilizations, social relationships derived from each individual fundamentally form the basis of social structure in our daily life.

object-detection Object Detection +2

Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking

no code implementations21 Jul 2016 Wenhui Li, Yongkang Wong, An-An Liu, Yang Li, Yu-Ting Su, Mohan Kankanhalli

To enable the study of this problem, there exist a vast number of action datasets, which are recorded under controlled laboratory settings, real-world surveillance environments, or crawled from the Internet.

Action Recognition Benchmarking +1

Label Consistent Quadratic Surrogate Model for Visual Saliency Prediction

no code implementations CVPR 2015 Yan Luo, Yongkang Wong, Qi Zhao

In addition, since new datasets are built and shared in the community from time to time, it would be good not to retrain the entire model when new data are added.

Saliency Prediction

Automatic Classification of Human Epithelial Type 2 Cell Indirect Immunofluorescence Images using Cell Pyramid Matching

no code implementations15 Mar 2014 Arnold Wiliem, Conrad Sanderson, Yongkang Wong, Peter Hobson, Rodney F. Minchin, Brian C. Lovell

This paper describes a novel system for automatic classification of images obtained from Anti-Nuclear Antibody (ANA) pathology tests on Human Epithelial type 2 (HEp-2) cells using the Indirect Immunofluorescence (IIF) protocol.

General Classification Image Classification

Dynamic Amelioration of Resolution Mismatches for Local Feature Based Identity Inference

no code implementations8 Apr 2013 Yongkang Wong, Conrad Sanderson, Sandra Mau, Brian C. Lovell

While existing face recognition systems based on local features are robust to issues such as misalignment, they can exhibit accuracy degradation when comparing images of differing resolutions.

Face Recognition

Classification of Human Epithelial Type 2 Cell Indirect Immunofluoresence Images via Codebook Based Descriptors

no code implementations4 Apr 2013 Arnold Wiliem, Yongkang Wong, Conrad Sanderson, Peter Hobson, Shaokang Chen, Brian C. Lovell

In this paper, we propose a cell classification system comprised of a dual-region codebook-based descriptor, combined with the Nearest Convex Hull Classifier.

General Classification

Patch-based Probabilistic Image Quality Assessment for Face Selection and Improved Video-based Face Recognition

no code implementations3 Apr 2013 Yongkang Wong, Shaokang Chen, Sandra Mau, Conrad Sanderson, Brian C. Lovell

In video based face recognition, face images are typically captured over multiple frames in uncontrolled conditions, where head pose, illumination, shadowing, motion blur and focus change over the sequence.

Face Image Quality Face Image Quality Assessment +3

On Robust Face Recognition via Sparse Encoding: the Good, the Bad, and the Ugly

no code implementations7 Mar 2013 Yongkang Wong, Mehrtash T. Harandi, Conrad Sanderson

Thorough experiments on AR, FERET, exYaleB, BANCA and ChokePoint datasets show that the proposed local SR approach obtains considerably better and more robust performance than several previous state-of-the-art holistic SR methods, in both verification and closed-set identification problems.

Face Recognition Face Verification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.