Search Results for author: Mohan Kankanhalli

Found 84 papers, 47 papers with code

Near-Optimal Active Learning of Multi-Output Gaussian Processes

1 code implementation • 21 Nov 2015 • Yehong Zhang, Trong Nghia Hoang, Kian Hsiang Low, Mohan Kankanhalli

This paper addresses the problem of active learning of a multi-output Gaussian process (MOGP) model representing multiple types of coexisting correlated environmental phenomena.

Active Learning Gaussian Processes

Paper
Code

Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking

no code implementations • 21 Jul 2016 • Wenhui Li, Yongkang Wong, An-An Liu, Yang Li, Yu-Ting Su, Mohan Kankanhalli

To enable the study of this problem, there exist a vast number of action datasets, which are recorded under controlled laboratory settings, real-world surveillance environments, or crawled from the Internet.

Action Recognition Benchmarking +1

Paper
Add Code

Attention Transfer from Web Images for Video Recognition

no code implementations • 3 Aug 2017 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli

However, due to the domain shift problem, the performance of Web images trained deep classifiers tend to degrade when directly deployed to videos.

Action Recognition Temporal Action Localization +1

Paper
Add Code

Looking Beyond a Clever Narrative: Visual Context and Attention are Primary Drivers of Affect in Video Advertisements

no code implementations • 14 Aug 2018 • Abhinav Shukla, Harish Katti, Mohan Kankanhalli, Ramanathan Subramanian

Contrary to the popular notion that ad affect hinges on the narrative and the clever use of linguistic and social cues, we find that actively attended objects and the coarse scene structure better encode affective information as compared to individual scene objects or conspicuous background elements.

Paper
Add Code

Unsupervised Online Video Object Segmentation with Motion Property Understanding

1 code implementation • IEEE Transactions on Image Processing 2019 • Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli

Moreover, our method achieves better performance than the best unsupervised offline algorithm on the DAVIS-2016 benchmark dataset.

Motion Detection Object +4

Paper
Code

MMALFM: Explainable Recommendation by Leveraging Reviews and Images

no code implementations • 12 Nov 2018 • Zhiyong Cheng, Xiaojun Chang, Lei Zhu, Rose C. Kanjirathinkal, Mohan Kankanhalli

Then the aspect importance is integrated into a novel aspect-aware latent factor model (ALFM), which learns user's and item's latent factors based on ratings.

Explainable Recommendation

Paper
Add Code

Learning to Learn from Noisy Labeled Data

1 code implementation • CVPR 2019 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect.

Ranked #26 on Image Classification on Clothing1M (using extra training data)

Learning with noisy labels Meta-Learning

120

Paper
Code

Towards Robust ResNet: A Small Step but A Giant Leap

no code implementations • 28 Feb 2019 • Jingfeng Zhang, Bo Han, Laura Wynter, Kian Hsiang Low, Mohan Kankanhalli

Our analytical studies reveal that the step factor h in the Euler method is able to control the robustness of ResNet in both its training and generalization.

Paper
Add Code

Recognition of Advertisement Emotions with Application to Computational Advertising

no code implementations • 3 Apr 2019 • Abhinav Shukla, Shruti Shriya Gullapuram, Harish Katti, Mohan Kankanhalli, Stefan Winkler, Ramanathan Subramanian

Advertisements (ads) often contain strong affective content to capture viewer attention and convey an effective message to the audience.

EEG Multi-Task Learning

Paper
Add Code

$\mathcal{G}$-softmax: Improving Intra-class Compactness and Inter-class Separability of Features

1 code implementation • 8 Apr 2019 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

In addition, analysis of the intra-class compactness and inter-class separability demonstrates the advantages of the proposed function over the softmax function, which is consistent with the performance improvement.

General Classification Multi-Label Classification

Paper
Code

Quantifying and Alleviating the Language Prior Problem in Visual Question Answering

1 code implementation • 13 May 2019 • Yangyang Guo, Zhiyong Cheng, Liqiang Nie, Yibing Liu, Yinglong Wang, Mohan Kankanhalli

Benefiting from the advancement of computer vision, natural language processing and information retrieval techniques, visual question answering (VQA), which aims to answer questions about an image or a video, has received lots of attentions over the past few years.

Information Retrieval Question Answering +2

Paper
Code

Augmenting Physiological Time Series Data: A Case Study for Sleep Apnea Detection

no code implementations • 22 May 2019 • Konstantinos Nikolaidis, Stein Kristiansen, Vera Goebel, Thomas Plagemann, Knut Liestøl, Mohan Kankanhalli

Supervised machine learning applications in the health domain often face the problem of insufficient training datasets.

Generative Adversarial Network Sleep apnea detection +2

Paper
Add Code

User Diverse Preference Modeling by Multimodal Attentive Metric Learning

1 code implementation • 21 Aug 2019 • Fan Liu, Zhiyong Cheng, Changchang Sun, Yinglong Wang, Liqiang Nie, Mohan Kankanhalli

To tackle this problem, in this paper, we propose a novel Multimodal Attentive Metric Learning (MAML) method to model user diverse preferences for various items.

Metric Learning Recommendation Systems

Paper
Code

Fast Video Object Segmentation via Mask Transfer Network

no code implementations • 28 Aug 2019 • Tao Zhuo, Zhiyong Cheng, Mohan Kankanhalli

To overcome this limitation, we propose a novel mask transfer network (MTN), which can greatly boost the processing speed of VOS and also achieve a reasonable accuracy.

Object Semantic Segmentation +2

Paper
Add Code

Explainable Video Action Reasoning via Prior Knowledge and State Transitions

1 code implementation • 28 Aug 2019 • Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli

Finally, by sequentially examining each state transition in the video graph, our method can detect and explain how those actions are executed with prior knowledge, just like the logical manner of thinking by humans.

Action Analysis Attribute

Paper
Code

Solving Raven's Progressive Matrices with Neural Networks

1 code implementation • 5 Feb 2020 • Tao Zhuo, Mohan Kankanhalli

Based on the design of the pseudo target, MCPT converts the unsupervised learning problem to a supervised task.

Paper
Code

GradMix: Multi-source Transfer across Domains and Tasks

no code implementations • 9 Feb 2020 • Junnan Li, Ziwei Xu, Yongkang Wong, Qi Zhao, Mohan Kankanhalli

Therefore, it is important to develop algorithms that can leverage off-the-shelf labeled dataset to learn useful knowledge for the target task.

Action Recognition Meta-Learning +1

Paper
Add Code

Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$ Videos

no code implementations • 9 Feb 2020 • Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan Kankanhalli

To enable research in this direction, we introduce 360Action, the first omnidirectional video dataset for multi-person action recognition.

Action Localization Action Recognition +1

Paper
Add Code

Attacks Which Do Not Kill Training Make Adversarial Learning Stronger

1 code implementation • ICML 2020 • Jingfeng Zhang, Xilie Xu, Bo Han, Gang Niu, Lizhen Cui, Masashi Sugiyama, Mohan Kankanhalli

Adversarial training based on the minimax formulation is necessary for obtaining adversarial robustness of trained models.

Adversarial Robustness

118

Paper
Code

Evaluating Salient Object Detection in Natural Images with Multiple Objects having Multi-level Saliency

1 code implementation • 19 Mar 2020 • Gökhan Yildirim, Debashis Sen, Mohan Kankanhalli, Sabine Süsstrunk

In this paper, we corroborate based on three subjective experiments on a novel image dataset that objects in natural images are inherently perceived to have varying levels of importance.

Object object-detection +3

Paper
Code

Adversarial Learning for Personalized Tag Recommendation

1 code implementation • 1 Apr 2020 • Erik Quintanilla, Yogesh Rawat, Andrey Sakryukin, Mubarak Shah, Mohan Kankanhalli

We demonstrate the effectiveness of the proposed model on two different large-scale and publicly available datasets, YFCC100M and NUS-WIDE.

General Classification Image Classification +2

Paper
Code

Hierarchically Fair Federated Learning

no code implementations • 22 Apr 2020 • Jingfeng Zhang, Cheng Li, Antonio Robles-Kelly, Mohan Kankanhalli

When the federated learning is adopted among competitive agents with siloed datasets, agents are self-interested and participate only if they are fairly rewarded.

Fairness Federated Learning +1

Paper
Add Code

Robust Federated Recommendation System

no code implementations • 15 Jun 2020 • Chen Chen, Jingfeng Zhang, Anthony K. H. Tung, Mohan Kankanhalli, Gang Chen

We argue that the key to Byzantine detection is monitoring of gradients of the model parameters of clients.

Recommendation Systems

Paper
Add Code

Learning Realistic Patterns from Unrealistic Stimuli: Generalization and Data Anonymization

1 code implementation • 21 Sep 2020 • Konstantinos Nikolaidis, Stein Kristiansen, Thomas Plagemann, Vera Goebel, Knut Liestøl, Mohan Kankanhalli, Gunn Marit Traaen, Britt Øverland, Harriet Akre, Lars Aakerøy, Sigurd Steinshamn

We use sleep monitoring data from both an open and a large closed clinical study and evaluate whether (1) end-users can create and successfully use customized classification models for sleep apnea detection, and (2) the identity of participants in the study is protected.

Sleep apnea detection

Paper
Code

My Health Sensor, my Classifier: Adapting a Trained Classifier to Unlabeled End-User Data

no code implementations • 22 Sep 2020 • Konstantinos Nikolaidis, Stein Kristiansen, Thomas Plagemann, Vera Goebel, Knut Liestøl, Mohan Kankanhalli, Gunn Marit Traaen, Britt Øverland, Harriet Akre, Lars Aakerøy, Sigurd Steinshamn

In this work, we present an approach for unsupervised domain adaptation (DA) with the constraint, that the labeled source data are not directly available, and instead only access to a classifier trained on the source data is provided.

Sleep apnea detection Unsupervised Domain Adaptation

Paper
Add Code

Using Under-trained Deep Ensembles to Learn Under Extreme Label Noise

no code implementations • 23 Sep 2020 • Konstantinos Nikolaidis, Thomas Plagemann, Stein Kristiansen, Vera Goebel, Mohan Kankanhalli

A new model is trained with these labels to generalize reliably despite the label noise.

Sleep apnea detection

Paper
Add Code

Geometry-aware Instance-reweighted Adversarial Training

2 code implementations • ICLR 2021 • Jingfeng Zhang, Jianing Zhu, Gang Niu, Bo Han, Masashi Sugiyama, Mohan Kankanhalli

The belief was challenged by recent studies where we can maintain the robustness and improve the accuracy.

Paper
Code

Unsupervised Spatial-spectral Network Learning for Hyperspectral Compressive Snapshot Reconstruction

no code implementations • 18 Dec 2020 • Yubao Sun, Ying Yang, Qingshan Liu, Mohan Kankanhalli

Hyperspectral compressive imaging takes advantage of compressive sensing theory to achieve coded aperture snapshot measurement without temporal scanning, and the entire three-dimensional spatial-spectral data is captured by a two-dimensional projection during a single integration period.

Compressive Sensing

Paper
Add Code

Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Interventions

no code implementations • ICCV 2021 • Xiheng Zhang, Yongkang Wong, Xiaofei Wu, Juwei Lu, Mohan Kankanhalli, Xiangdong Li, Weidong Geng

In this work, we take a step towards training robust models for cross-domain pose estimation task, which brings together ideas from causal representation learning and generative adversarial networks.

3D Pose Estimation counterfactual +3

Paper
Add Code

Understanding the Interaction of Adversarial Training with Noisy Labels

no code implementations • 6 Feb 2021 • Jianing Zhu, Jingfeng Zhang, Bo Han, Tongliang Liu, Gang Niu, Hongxia Yang, Mohan Kankanhalli, Masashi Sugiyama

A recent adversarial training (AT) study showed that the number of projected gradient descent (PGD) steps to successfully attack a point (i. e., find an adversarial example in its proximity) is an effective measure of the robustness of this point.

Paper
Add Code

Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos

1 code implementation • CVPR 2021 • Hehe Fan, Yi Yang, Mohan Kankanhalli

To capture the dynamics in point cloud videos, point tracking is usually employed.

Ranked #4 on 3D Action Recognition on NTU RGB+D

3D Action Recognition Point Tracking +1

154

Paper
Code

Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition

1 code implementation • 10 Aug 2021 • Ziwei Xu, Guangzhi Wang, Yongkang Wong, Mohan Kankanhalli

The concept module generates semantically meaningful features for primitive concepts, whereas the visual module extracts visual features for attributes and objects from input images.

Attribute Blocking +2

Paper
Code

Fair Representation: Guaranteeing Approximate Multiple Group Fairness for Unknown Tasks

1 code implementation • 1 Sep 2021 • Xudong Shen, Yongkang Wong, Mohan Kankanhalli

Motivated by scenarios where data is used for diverse prediction tasks, we study whether fair representation can be used to guarantee fairness for unknown tasks and for multiple fairness notions simultaneously.

Fairness valid

Paper
Code

Unsupervised Abstract Reasoning for Raven's Problem Matrices

1 code implementation • 21 Sep 2021 • Tao Zhuo, Qiang Huang, Mohan Kankanhalli

Raven's Progressive Matrices (RPM) is highly correlated with human intelligence, and it has been widely used to measure the abstract reasoning ability of humans.

Paper
Code

Fast Yet Effective Machine Unlearning

1 code implementation • 17 Nov 2021 • Ayush K Tarun, Vikram S Chundawat, Murari Mandal, Mohan Kankanhalli

In the impair step, the noise matrix along with a very high learning rate is used to induce sharp unlearning in the model.

Machine Unlearning

Paper
Code

Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation With Reliable Voted Pseudo Labels

no code implementations • CVPR 2022 • Hehe Fan, Xiaojun Chang, Wanyue Zhang, Yi Cheng, Ying Sun, Mohan Kankanhalli

In this paper, we propose an unsupervised domain adaptation method for deep point cloud representation learning.

Representation Learning Unsupervised Domain Adaptation

Paper
Add Code

Zero-Shot Machine Unlearning

1 code implementation • 14 Jan 2022 • Vikram S Chundawat, Ayush K Tarun, Murari Mandal, Mohan Kankanhalli

In case of machine learning (ML) applications, this necessitates deletion of data not only from storage archives but also from ML models.

Machine Unlearning Transfer Learning

Paper
Code

Learning to Predict Gradients for Semi-Supervised Continual Learning

1 code implementation • 23 Jan 2022 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

To explore these issues, we formulate a new semi-supervised continual learning method, which can be generically applied to existing continual learning models.

Continual Learning

Paper
Code

Adversarial Attack and Defense for Non-Parametric Two-Sample Tests

1 code implementation • 7 Feb 2022 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

Furthermore, we theoretically find that the adversary can also degrade the lower bound of a TST's test power, which enables us to iteratively minimize the test criterion in order to search for adversarial pairs.

Adversarial Attack Vocal Bursts Valence Prediction

Paper
Code

On Modality Bias Recognition and Reduction

1 code implementation • 25 Feb 2022 • Yangyang Guo, Liqiang Nie, Harry Cheng, Zhiyong Cheng, Mohan Kankanhalli, Alberto del Bimbo

From the results on four datasets regarding the above three tasks, our method yields remarkable performance improvements compared with the baselines, demonstrating its superiority on reducing the modality bias problem.

Action Recognition Multi-modal Classification +3

Paper
Code

Joint Answering and Explanation for Visual Commonsense Reasoning

1 code implementation • 25 Feb 2022 • Zhenyang Li, Yangyang Guo, Kejie Wang, Yinwei Wei, Liqiang Nie, Mohan Kankanhalli

Given that our framework is model-agnostic, we apply it to the existing popular baselines and validate its effectiveness on the benchmark dataset.

Knowledge Distillation Question Answering +2

Paper
Code

Disentangled Multimodal Representation Learning for Recommendation

1 code implementation • 10 Mar 2022 • Fan Liu, Huilin Chen, Zhiyong Cheng, AnAn Liu, Liqiang Nie, Mohan Kankanhalli

However, existing methods ignore the fact that different modalities contribute differently towards a user's preference on various factors of an item.

Recommendation Systems Representation Learning

Paper
Code

Can Bad Teaching Induce Forgetting? Unlearning in Deep Networks using an Incompetent Teacher

1 code implementation • 17 May 2022 • Vikram S Chundawat, Ayush K Tarun, Murari Mandal, Mohan Kankanhalli

It facilitates the provision for removal of certain set or class of data from an already trained ML model without requiring retraining from scratch.

Machine Unlearning

Paper
Code

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

1 code implementation • ICLR 2021 • Hehe Fan, Xin Yu, Yuhang Ding, Yi Yang, Mohan Kankanhalli

Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension.

Ranked #3 on 3D Action Recognition on NTU RGB+D

3D Action Recognition Semantic Segmentation

Paper
Code

Effective Abstract Reasoning with Dual-Contrast Network

1 code implementation • ICLR 2021 • Tao Zhuo, Mohan Kankanhalli

As a step towards improving the abstract reasoning capability of machines, we aim to solve Raven's Progressive Matrices (RPM) with neural networks, since solving RPM puzzles is highly correlated with human intelligence.

Paper
Code

A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA

1 code implementation • 30 Jun 2022 • Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu, Zhiyong Cheng, Mohan Kankanhalli

On the other hand, pertaining to the implicit knowledge, the multi-modal implicit knowledge for knowledge-based VQA still remains largely unexplored.

Question Answering Retrieval +1

Paper
Code

Distance Matters in Human-Object Interaction Detection

1 code implementation • 5 Jul 2022 • Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli

2) Insufficient number of distant interactions in benchmark datasets results in under-fitting on these instances.

Human-Object Interaction Detection Object +1

Paper
Code

Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection

1 code implementation • 6 Jul 2022 • Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli

To quantitatively study the object bias problem, we advocate a new protocol for evaluating model performance.

Human-Object Interaction Detection Object +2

Paper
Code

Privacy-Preserving Synthetic Data Generation for Recommendation Systems

1 code implementation • 27 Sep 2022 • Fan Liu, Zhiyong Cheng, Huilin Chen, Yinwei Wei, Liqiang Nie, Mohan Kankanhalli

At the item level, a synthetic data generation module is proposed to generate a synthetic item corresponding to the selected item based on the user's preferences.

Privacy Preserving Recommendation Systems +1

Paper
Code

Deep Regression Unlearning

1 code implementation • 15 Oct 2022 • Ayush K Tarun, Vikram S Chundawat, Murari Mandal, Mohan Kankanhalli

In the last few years, there have been notable developments in machine unlearning to remove the information of certain training data efficiently and effectively from ML models.

Inference Attack Machine Unlearning +2

Paper
Code

PointListNet: Deep Learning on 3D Point Lists

no code implementations • CVPR 2023 • Hehe Fan, Linchao Zhu, Yi Yang, Mohan Kankanhalli

Deep neural networks on regular 1D lists (e. g., natural languages) and irregular 3D sets (e. g., point clouds) have made tremendous achievements.

Paper
Add Code

Text to Point Cloud Localization with Relation-Enhanced Transformer

no code implementations • 13 Jan 2023 • Guangzhi Wang, Hehe Fan, Mohan Kankanhalli

To overcome these two challenges, we propose a unified Relation-Enhanced Transformer (RET) to improve representation discriminability for both point cloud and natural language queries.

Natural Language Queries Relation

Paper
Add Code

Learning to Agree on Vision Attention for Visual Commonsense Reasoning

no code implementations • 4 Feb 2023 • Zhenyang Li, Yangyang Guo, Kejie Wang, Fan Liu, Liqiang Nie, Mohan Kankanhalli

Visual Commonsense Reasoning (VCR) remains a significant yet challenging research problem in the realm of visual reasoning.

Visual Commonsense Reasoning

Paper
Add Code

Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection

1 code implementation • NeurIPS 2023 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

Adversarial contrastive learning (ACL) does not require expensive data annotations but outputs a robust representation that withstands adversarial attacks and also generalizes to a wide range of downstream tasks.

Contrastive Learning

Paper
Code

Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization

1 code implementation • NeurIPS 2023 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

To improve transferability, the existing work introduced the standard invariant regularization (SIR) to impose style-independence property to SCL, which can exempt the impact of nuisance style factors in the standard representation.

Contrastive Learning

Paper
Code

DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment

1 code implementation • CVPR 2023 • Heyuan Li, Bo wang, Yu Cheng, Mohan Kankanhalli, Robby T. Tan

Thanks to the proposed fusion module, our method is robust not only to occlusion and large pitch and roll view angles, which is the benefit of our image space approach, but also to noise and large yaw angles, which is the benefit of our model space method.

Ranked #1 on 3D Face Reconstruction on AFLW2000-3D (Mean NME metric)

3D Face Reconstruction Face Alignment +1

Paper
Code

What Makes for Good Visual Tokenizers for Large Language Models?

1 code implementation • 20 May 2023 • Guangzhi Wang, Yixiao Ge, Xiaohan Ding, Mohan Kankanhalli, Ying Shan

In our benchmark, which is curated to evaluate MLLMs visual semantic understanding and fine-grained perception capabilities, we discussed different visual tokenizers pre-trained with dominant methods (i. e., DeiT, CLIP, MAE, DINO), and observe that: i) Fully/weakly supervised models capture more semantics than self-supervised models, but the gap is narrowed by scaling up the pre-training dataset.

Image Captioning Object Counting +2

Paper
Code

Continual Learning with Strong Experience Replay

1 code implementation • 23 May 2023 • Tao Zhuo, Zhiyong Cheng, Zan Gao, Hehe Fan, Mohan Kankanhalli

Experience Replay (ER) is a simple and effective rehearsal-based strategy, which optimizes the model with current training data and a subset of old samples stored in a memory buffer.

Continual Learning Image Classification

Paper
Code

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023

no code implementations • 13 Jul 2023 • Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lin, Hehe Fan, Yongkang Wong, Ying Sun, Mohan Kankanhalli

Our research focuses on the innovative application of a differentiable logic loss in the training to leverage the co-occurrence relations between verb and noun, as well as the pre-trained Large Language Models (LLMs) to generate the logic rules for the adaptation to unseen action labels.

Action Recognition Unsupervised Domain Adaptation

Paper
Add Code

Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection

no code implementations • 19 Jul 2023 • Guangzhi Wang, Yangyang Guo, Mohan Kankanhalli

Human-Object Interaction Detection is a crucial aspect of human-centric scene understanding, with important applications in various domains.

Human-Object Interaction Detection Object +1

Paper
Add Code

Towards Generalizable Deepfake Detection by Primary Region Regularization

no code implementations • 24 Jul 2023 • Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan Kankanhalli

The existing deepfake detection methods have reached a bottleneck in generalizing to unseen forgeries and manipulation approaches.

DeepFake Detection Face Swapping

Paper
Add Code

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering

no code implementations • 25 Jul 2023 • Yi Cheng, Hehe Fan, Dongyun Lin, Ying Sun, Mohan Kankanhalli, Joo-Hwee Lim

The main challenge in video question answering (VideoQA) is to capture and understand the complex spatial and temporal relations between objects based on given questions.

graph construction Question Answering +2

Paper
Add Code

Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration

1 code implementation • 27 Jul 2023 • Harry Cheng, Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Mohan Kankanhalli

Training an effective video action recognition model poses significant computational challenges, particularly under limited resource budgets.

Action Recognition Temporal Action Localization

Paper
Code

DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation

no code implementations • 31 Jul 2023 • Yue Zhang, Hehe Fan, Yi Yang, Mohan Kankanhalli

The proposed method, named Mixture of Depth and Point cloud video experts (DPMix), achieved the first place in the 4D Action Segmentation Track of the HOI4D Challenge 2023.

Action Segmentation Human-Object Interaction Detection +2

Paper
Add Code

Semantic-Guided Feature Distillation for Multimodal Recommendation

1 code implementation • 6 Aug 2023 • Fan Liu, Huilin Chen, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli

The teacher model first extracts rich modality features from the generic modality feature by considering both the semantic information of items and the complementary information of multiple modalities.

Multimodal Recommendation Representation Learning

Paper
Code

Distill to Delete: Unlearning in Graph Networks with Knowledge Distillation

no code implementations • 28 Sep 2023 • Yash Sinha, Murari Mandal, Mohan Kankanhalli

Our work takes a novel approach to address these challenges in graph unlearning through knowledge distillation, as it distills to delete in GNN (D2DGN).

Knowledge Distillation

Paper
Add Code

ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens

1 code implementation • 28 Sep 2023 • Yangyang Guo, Haoyu Zhang, Yongkang Wong, Liqiang Nie, Mohan Kankanhalli

Learning a versatile language-image model is computationally prohibitive under a limited computing budget.

Cross-Modal Retrieval Image Captioning +1

Paper
Code

AutoLoRa: A Parameter-Free Automated Robust Fine-Tuning Framework

no code implementations • 3 Oct 2023 • Xilie Xu, Jingfeng Zhang, Mohan Kankanhalli

To mitigate this issue, we propose a low-rank (LoRa) branch that disentangles RFT into two distinct components: optimizing natural objectives via the LoRa branch and adversarial objectives via the FE.

Adversarial Robustness Scheduling

Paper
Add Code

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

1 code implementation • 16 Oct 2023 • Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli

This allows for direct and efficient utilization of the low-rank model for downstream fine-tuning tasks.

Paper
Code

Prior-Free Continual Learning with Unlabeled Data in the Wild

1 code implementation • 16 Oct 2023 • Tao Zhuo, Zhiyong Cheng, Hehe Fan, Mohan Kankanhalli

Existing CL methods usually reduce forgetting with task priors, \ie using task identity or a subset of previously seen samples for model training.

Continual Learning Image Classification

Paper
Code

UNK-VQA: A Dataset and a Probe into the Abstention Ability of Multi-modal Large Models

1 code implementation • 17 Oct 2023 • Yangyang Guo, Fangkai Jiao, Zhiqi Shen, Liqiang Nie, Mohan Kankanhalli

Teaching Visual Question Answering (VQA) models to refrain from answering unanswerable questions is necessary for building a trustworthy AI system.

Attribute Question Answering +1

Paper
Code

Image-Based Virtual Try-On: A Survey

1 code implementation • 8 Nov 2023 • Dan Song, Xuanpu Zhang, Juan Zhou, Weizhi Nie, Ruofeng Tong, Mohan Kankanhalli, An-An Liu

In this survey, we provide a comprehensive analysis of the state-of-the-art techniques and methodologies in aspects of pipeline architecture, person representation and key modules such as try-on indication, clothing warping and try-on stage.

Image Generation Virtual Try-on

Paper
Code

Finetuning Text-to-Image Diffusion Models for Fairness

1 code implementation • 11 Nov 2023 • Xudong Shen, Chao Du, Tianyu Pang, Min Lin, Yongkang Wong, Mohan Kankanhalli

The rapid adoption of text-to-image diffusion models in society underscores an urgent need to address their biases.

Fairness

Paper
Code

Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models

no code implementations • 26 Nov 2023 • Yu-Wei Zhan, Fan Liu, Xin Luo, Liqiang Nie, Xin-Shun Xu, Mohan Kankanhalli

To capitalize on these rich Human-Centric Visual Cues, we propose a novel approach named HCVC for HOI detection.

Human-Object Interaction Detection

Paper
Add Code

Attribute-driven Disentangled Representation Learning for Multimodal Recommendation

no code implementations • 22 Dec 2023 • Zhenyang Li, Fan Liu, Yinwei Wei, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli

To obtain robust and independent representations for each factor associated with a specific attribute, we first disentangle the representations of features both within and across different modalities.

Attribute Multimodal Recommendation +1

Paper
Add Code

Understanding Before Recommendation: Semantic Aspect-Aware Review Exploitation via Large Language Models

no code implementations • 26 Dec 2023 • Fan Liu, Yaqi Liu, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli

Recommendation systems harness user-item interactions like clicks and reviews to learn their representations.

Recommendation Systems

Paper
Add Code

Hallucination is Inevitable: An Innate Limitation of Large Language Models

no code implementations • 22 Jan 2024 • Ziwei Xu, Sanjay Jain, Mohan Kankanhalli

Since the formal world is a part of the real world which is much more complicated, hallucinations are also inevitable for real world LLMs.

Hallucination Learning Theory

Paper
Add Code

Diffusion Facial Forgery Detection

1 code implementation • 29 Jan 2024 • Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan Kankanhalli

In particular, this dataset leverages 30, 000 carefully collected textual and visual prompts, ensuring the synthesis of images with both high fidelity and semantic consistency.

Image Generation

Paper
Code

EcoVal: An Efficient Data Valuation Framework for Machine Learning

no code implementations • 14 Feb 2024 • Ayush K Tarun, Vikram S Chundawat, Murari Mandal, Hong Ming Tan, Bowei Chen, Mohan Kankanhalli

In this paper, we introduce an efficient data valuation framework EcoVal, to estimate the value of data for machine learning models in a fast and practical manner.

Data Valuation

Paper
Add Code

How to Understand Named Entities: Using Common Sense for News Captioning

no code implementations • 11 Mar 2024 • Ning Xu, Yanhui Wang, Tingting Zhang, Hongshuo Tian, Mohan Kankanhalli, An-An Liu

Our approach consists of three modules: (a) Filter Module aims to clarify the common sense concerning a named entity from two aspects: what does it mean?

Common Sense Reasoning

Paper
Add Code

S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing

no code implementations • 11 Apr 2024 • Guangzhi Wang, Tianyi Chen, Kamran Ghasedi, HsiangTao Wu, Tianyu Ding, Chris Nuesmeyer, Ilya Zharkov, Mohan Kankanhalli, Luming Liang

S3Editor is model-agnostic and compatible with various editing approaches.

Attribute Video Editing

Paper
Add Code

Cluster-based Graph Collaborative Filtering

1 code implementation • 16 Apr 2024 • Fan Liu, Shuai Zhao, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli

This model performs high-order graph convolution on cluster-specific graphs, which are constructed by capturing the multiple interests of users and identifying the common interests among them.

Clustering Collaborative Filtering +3

Paper
Code

Identifying Worry in Twitter: Beyond Emotion Analysis

no code implementations • EMNLP (NLP+CSS) 2020 • Reyha Verma, Christian von der Weth, Jithin Vachery, Mohan Kankanhalli

Identifying the worries of individuals and societies plays a crucial role in providing social support and enhancing policy decision-making.

Decision Making Emotion Recognition

Paper
Add Code

Inferring DQN structure for high-dimensional continuous control

no code implementations • ICML 2020 • Andrey Sakryukin, Chedy Raissi, Mohan Kankanhalli

We propose a novel approach to infer the network structure for DQN models operating with high-dimensional continuous actions.

Continuous Control Vocal Bursts Intensity Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.