no code implementations • EMNLP (NLP+CSS) 2020 • Reyha Verma, Christian von der Weth, Jithin Vachery, Mohan Kankanhalli
Identifying the worries of individuals and societies plays a crucial role in providing social support and enhancing policy decision-making.
no code implementations • ICML 2020 • Andrey Sakryukin, Chedy Raissi, Mohan Kankanhalli
We propose a novel approach to infer the network structure for DQN models operating with high-dimensional continuous actions.
1 code implementation • 25 Nov 2024 • Wey Yeh Choong, Yangyang Guo, Mohan Kankanhalli
Through our benchmark, we aim to inspire further research on 1) holistic understanding of VLLM capabilities, particularly regarding hallucination, and 2) extensive development of advanced VLLMs to alleviate this problem.
no code implementations • 20 Nov 2024 • Ziyang Luo, HaoNing Wu, Dongxu Li, Jing Ma, Mohan Kankanhalli, Junnan Li
To further streamline our evaluation, we introduce VideoAutoBench as an auxiliary benchmark, where human annotators label winners in a subset of VideoAutoArena battles.
no code implementations • 19 Nov 2024 • Haoyu Zhang, Yangyang Guo, Mohan Kankanhalli
Additionally, we advocate a new evaluation protocol that can 1) holistically quantify the model debiasing and V-L alignment ability, and 2) evaluate the generalization of social bias removal models.
1 code implementation • 14 Nov 2024 • Yangyang Guo, Mohan Kankanhalli
In particular, we individually pre-train seven CLIP models on two large-scale image-text pair datasets, and two MoCo models on the ImageNet dataset, resulting in a total of 16 pre-trained models.
no code implementations • 13 Nov 2024 • Yangyang Guo, Fangkai Jiao, Liqiang Nie, Mohan Kankanhalli
The vulnerability of Vision Large Language Models (VLLMs) to jailbreak attacks appears as no surprise.
no code implementations • 22 Oct 2024 • Yash Sinha, Murari Mandal, Mohan Kankanhalli
The key components of machine learning are data samples for training, model for learning patterns, and loss function for optimizing accuracy.
no code implementations • 3 Oct 2024 • Ziwei Xu, Mohan Kankanhalli
A key component of value alignment is the modeling of human preferences as a representation of human values.
1 code implementation • 7 Jun 2024 • Zenghao Chai, Chen Tang, Yongkang Wong, Mohan Kankanhalli
The creation of 4D avatars (i. e., animated 3D avatars) from text description typically uses text-to-image (T2I) diffusion models to synthesize 3D avatars in the canonical space and subsequently applies animation with target motions.
no code implementations • 27 May 2024 • Zhenyang Li, Yangyang Guo, Kejie Wang, Xiaolin Chen, Liqiang Nie, Mohan Kankanhalli
Visual Commonsense Reasoning (VCR) calls for explanatory reasoning behind question answering over visual scenes.
no code implementations • 24 May 2024 • Yash Sinha, Murari Mandal, Mohan Kankanhalli
This is particularly true in case of multi-modal recommender systems (MMRS), which aim to accommodate the growing influence of multi-modal information on user preferences.
1 code implementation • 22 May 2024 • Wei Li, Hehe Fan, Yongkang Wong, Mohan Kankanhalli, Yi Yang
Recent advancements in image understanding have benefited from the extensive use of web image-text pairs.
no code implementations • 21 May 2024 • Yi Cheng, Ziwei Xu, Dongyun Lin, Harry Cheng, Yongkang Wong, Ying Sun, Joo Hwee Lim, Mohan Kankanhalli
To address these challenges, we propose a knowledge-enhanced iterative refinement framework for visual content generation.
1 code implementation • 19 Apr 2024 • Zeyu Ling, Bo Han, Yongkang Wongkan, Han Lin, Mohan Kankanhalli, Weidong Geng
Conditional human motion synthesis (HMS) aims to generate human motion sequences that conform to specific conditions.
Ranked #5 on Motion Synthesis on HumanML3D
1 code implementation • 16 Apr 2024 • Fan Liu, Shuai Zhao, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli
This model performs high-order graph convolution on cluster-specific graphs, which are constructed by capturing the multiple interests of users and identifying the common interests among them.
no code implementations • 11 Apr 2024 • Guangzhi Wang, Tianyi Chen, Kamran Ghasedi, HsiangTao Wu, Tianyu Ding, Chris Nuesmeyer, Ilya Zharkov, Mohan Kankanhalli, Luming Liang
S3Editor is model-agnostic and compatible with various editing approaches.
no code implementations • 11 Mar 2024 • Ning Xu, Yanhui Wang, Tingting Zhang, Hongshuo Tian, Mohan Kankanhalli, An-An Liu
Our approach consists of three modules: (a) Filter Module aims to clarify the common sense concerning a named entity from two aspects: what does it mean?
no code implementations • 14 Feb 2024 • Ayush K Tarun, Vikram S Chundawat, Murari Mandal, Hong Ming Tan, Bowei Chen, Mohan Kankanhalli
In this paper, we introduce an efficient data valuation framework EcoVal, to estimate the value of data for machine learning models in a fast and practical manner.
1 code implementation • 29 Jan 2024 • Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan Kankanhalli
In particular, this dataset leverages 30, 000 carefully collected textual and visual prompts, ensuring the synthesis of images with both high fidelity and semantic consistency.
no code implementations • 22 Jan 2024 • Ziwei Xu, Sanjay Jain, Mohan Kankanhalli
Since the formal world is a part of the real world which is much more complicated, hallucinations are also inevitable for real world LLMs.
no code implementations • CVPR 2024 • Guangzhi Wang, Yangyang Guo, Ziwei Xu, Mohan Kankanhalli
Human-Object Interaction (HOI) Detection constitutes an important aspect of human-centric scene understanding which requires precise object detection and interaction recognition.
no code implementations • 26 Dec 2023 • Fan Liu, Yaqi Liu, Huilin Chen, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli
Recommendation systems harness user-item interactions like clicks and reviews to learn their representations.
no code implementations • 22 Dec 2023 • Zhenyang Li, Fan Liu, Yinwei Wei, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli
To obtain robust and independent representations for each factor associated with a specific attribute, we first disentangle the representations of features both within and across different modalities.
no code implementations • 26 Nov 2023 • Yu-Wei Zhan, Fan Liu, Xin Luo, Liqiang Nie, Xin-Shun Xu, Mohan Kankanhalli
To capitalize on these rich Human-Centric Visual Cues, we propose a novel approach named HCVC for HOI detection.
1 code implementation • 11 Nov 2023 • Xudong Shen, Chao Du, Tianyu Pang, Min Lin, Yongkang Wong, Mohan Kankanhalli
The rapid adoption of text-to-image diffusion models in society underscores an urgent need to address their biases.
1 code implementation • 8 Nov 2023 • Dan Song, Xuanpu Zhang, Juan Zhou, Weizhi Nie, Ruofeng Tong, Mohan Kankanhalli, An-An Liu
Image-based virtual try-on aims to synthesize a naturally dressed person image with a clothing image, which revolutionizes online shopping and inspires related topics within image generation, showing both research significance and commercial potential.
1 code implementation • 17 Oct 2023 • Yangyang Guo, Fangkai Jiao, Zhiqi Shen, Liqiang Nie, Mohan Kankanhalli
Teaching Visual Question Answering (VQA) models to refrain from answering unanswerable questions is necessary for building a trustworthy AI system.
1 code implementation • CVPR 2024 • Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli
This allows for direct and efficient utilization of the low-rank model for downstream fine-tuning tasks.
1 code implementation • 16 Oct 2023 • Tao Zhuo, Zhiyong Cheng, Hehe Fan, Mohan Kankanhalli
Existing CL methods usually reduce forgetting with task priors, \ie using task identity or a subset of previously seen samples for model training.
no code implementations • 3 Oct 2023 • Xilie Xu, Jingfeng Zhang, Mohan Kankanhalli
To mitigate this issue, we propose a low-rank (LoRa) branch that disentangles RFT into two distinct components: optimizing natural objectives via the LoRa branch and adversarial objectives via the FE.
no code implementations • 28 Sep 2023 • Yash Sinha, Murari Mandal, Mohan Kankanhalli
Our work takes a novel approach to address these challenges in graph unlearning through knowledge distillation, as it distills to delete in GNN (D2DGN).
1 code implementation • 28 Sep 2023 • Yangyang Guo, Haoyu Zhang, Yongkang Wong, Liqiang Nie, Mohan Kankanhalli
Learning a versatile language-image model is computationally prohibitive under a limited computing budget.
1 code implementation • 6 Aug 2023 • Fan Liu, Huilin Chen, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli
The teacher model first extracts rich modality features from the generic modality feature by considering both the semantic information of items and the complementary information of multiple modalities.
no code implementations • 31 Jul 2023 • Yue Zhang, Hehe Fan, Yi Yang, Mohan Kankanhalli
The proposed method, named Mixture of Depth and Point cloud video experts (DPMix), achieved the first place in the 4D Action Segmentation Track of the HOI4D Challenge 2023.
1 code implementation • 27 Jul 2023 • Harry Cheng, Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Mohan Kankanhalli
Training an effective video action recognition model poses significant computational challenges, particularly under limited resource budgets.
no code implementations • 25 Jul 2023 • Yi Cheng, Hehe Fan, Dongyun Lin, Ying Sun, Mohan Kankanhalli, Joo-Hwee Lim
The main challenge in video question answering (VideoQA) is to capture and understand the complex spatial and temporal relations between objects based on given questions.
no code implementations • 24 Jul 2023 • Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan Kankanhalli
The existing deepfake detection methods have reached a bottleneck in generalizing to unseen forgeries and manipulation approaches.
no code implementations • 19 Jul 2023 • Guangzhi Wang, Yangyang Guo, Mohan Kankanhalli
Human-Object Interaction Detection is a crucial aspect of human-centric scene understanding, with important applications in various domains.
no code implementations • 13 Jul 2023 • Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lin, Hehe Fan, Yongkang Wong, Ying Sun, Mohan Kankanhalli
Our research focuses on the innovative application of a differentiable logic loss in the training to leverage the co-occurrence relations between verb and noun, as well as the pre-trained Large Language Models (LLMs) to generate the logic rules for the adaptation to unseen action labels.
1 code implementation • 23 May 2023 • Tao Zhuo, Zhiyong Cheng, Zan Gao, Hehe Fan, Mohan Kankanhalli
Experience Replay (ER) is a simple and effective rehearsal-based strategy, which optimizes the model with current training data and a subset of old samples stored in a memory buffer.
1 code implementation • 20 May 2023 • Guangzhi Wang, Yixiao Ge, Xiaohan Ding, Mohan Kankanhalli, Ying Shan
In our benchmark, which is curated to evaluate MLLMs visual semantic understanding and fine-grained perception capabilities, we discussed different visual tokenizers pre-trained with dominant methods (i. e., DeiT, CLIP, MAE, DINO), and observe that: i) Fully/weakly supervised models capture more semantics than self-supervised models, but the gap is narrowed by scaling up the pre-training dataset.
1 code implementation • CVPR 2023 • Heyuan Li, Bo wang, Yu Cheng, Mohan Kankanhalli, Robby T. Tan
Thanks to the proposed fusion module, our method is robust not only to occlusion and large pitch and roll view angles, which is the benefit of our image space approach, but also to noise and large yaw angles, which is the benefit of our model space method.
Ranked #1 on 3D Face Reconstruction on AFLW2000-3D (Mean NME metric)
1 code implementation • NeurIPS 2023 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli
To improve transferability, the existing work introduced the standard invariant regularization (SIR) to impose style-independence property to SCL, which can exempt the impact of nuisance style factors in the standard representation.
1 code implementation • NeurIPS 2023 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli
Adversarial contrastive learning (ACL) does not require expensive data annotations but outputs a robust representation that withstands adversarial attacks and also generalizes to a wide range of downstream tasks.
no code implementations • 4 Feb 2023 • Zhenyang Li, Yangyang Guo, Kejie Wang, Fan Liu, Liqiang Nie, Mohan Kankanhalli
Visual Commonsense Reasoning (VCR) remains a significant yet challenging research problem in the realm of visual reasoning.
no code implementations • 13 Jan 2023 • Guangzhi Wang, Hehe Fan, Mohan Kankanhalli
To overcome these two challenges, we propose a unified Relation-Enhanced Transformer (RET) to improve representation discriminability for both point cloud and natural language queries.
no code implementations • CVPR 2023 • Hehe Fan, Linchao Zhu, Yi Yang, Mohan Kankanhalli
Deep neural networks on regular 1D lists (e. g., natural languages) and irregular 3D sets (e. g., point clouds) have made tremendous achievements.
1 code implementation • 15 Oct 2022 • Ayush K Tarun, Vikram S Chundawat, Murari Mandal, Mohan Kankanhalli
In the last few years, there have been notable developments in machine unlearning to remove the information of certain training data efficiently and effectively from ML models.
1 code implementation • 27 Sep 2022 • Fan Liu, Zhiyong Cheng, Huilin Chen, Yinwei Wei, Liqiang Nie, Mohan Kankanhalli
At the item level, a synthetic data generation module is proposed to generate a synthetic item corresponding to the selected item based on the user's preferences.
1 code implementation • 6 Jul 2022 • Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli
To quantitatively study the object bias problem, we advocate a new protocol for evaluating model performance.
1 code implementation • 5 Jul 2022 • Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli
2) Insufficient number of distant interactions in benchmark datasets results in under-fitting on these instances.
1 code implementation • 30 Jun 2022 • Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu, Zhiyong Cheng, Mohan Kankanhalli
On the other hand, pertaining to the implicit knowledge, the multi-modal implicit knowledge for knowledge-based VQA still remains largely unexplored.
1 code implementation • ICLR 2021 • Hehe Fan, Xin Yu, Yuhang Ding, Yi Yang, Mohan Kankanhalli
Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension.
Ranked #3 on 3D Action Recognition on NTU RGB+D
1 code implementation • ICLR 2021 • Tao Zhuo, Mohan Kankanhalli
As a step towards improving the abstract reasoning capability of machines, we aim to solve Raven's Progressive Matrices (RPM) with neural networks, since solving RPM puzzles is highly correlated with human intelligence.
1 code implementation • 17 May 2022 • Vikram S Chundawat, Ayush K Tarun, Murari Mandal, Mohan Kankanhalli
It facilitates the provision for removal of certain set or class of data from an already trained ML model without requiring retraining from scratch.
1 code implementation • 10 Mar 2022 • Fan Liu, Huilin Chen, Zhiyong Cheng, AnAn Liu, Liqiang Nie, Mohan Kankanhalli
However, existing methods ignore the fact that different modalities contribute differently towards a user's preference on various factors of an item.
1 code implementation • 25 Feb 2022 • Zhenyang Li, Yangyang Guo, Kejie Wang, Yinwei Wei, Liqiang Nie, Mohan Kankanhalli
Given that our framework is model-agnostic, we apply it to the existing popular baselines and validate its effectiveness on the benchmark dataset.
1 code implementation • 25 Feb 2022 • Yangyang Guo, Liqiang Nie, Harry Cheng, Zhiyong Cheng, Mohan Kankanhalli, Alberto del Bimbo
From the results on four datasets regarding the above three tasks, our method yields remarkable performance improvements compared with the baselines, demonstrating its superiority on reducing the modality bias problem.
1 code implementation • 7 Feb 2022 • Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli
Furthermore, we theoretically find that the adversary can also degrade the lower bound of a TST's test power, which enables us to iteratively minimize the test criterion in order to search for adversarial pairs.
1 code implementation • 23 Jan 2022 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao
To explore these issues, we formulate a new semi-supervised continual learning method, which can be generically applied to existing continual learning models.
1 code implementation • 14 Jan 2022 • Vikram S Chundawat, Ayush K Tarun, Murari Mandal, Mohan Kankanhalli
In case of machine learning (ML) applications, this necessitates deletion of data not only from storage archives but also from ML models.
no code implementations • CVPR 2022 • Hehe Fan, Xiaojun Chang, Wanyue Zhang, Yi Cheng, Ying Sun, Mohan Kankanhalli
In this paper, we propose an unsupervised domain adaptation method for deep point cloud representation learning.
1 code implementation • 17 Nov 2021 • Ayush K Tarun, Vikram S Chundawat, Murari Mandal, Mohan Kankanhalli
In the impair step, the noise matrix along with a very high learning rate is used to induce sharp unlearning in the model.
1 code implementation • 21 Sep 2021 • Tao Zhuo, Qiang Huang, Mohan Kankanhalli
Raven's Progressive Matrices (RPM) is highly correlated with human intelligence, and it has been widely used to measure the abstract reasoning ability of humans.
1 code implementation • 1 Sep 2021 • Xudong Shen, Yongkang Wong, Mohan Kankanhalli
Motivated by scenarios where data is used for diverse prediction tasks, we study whether fair representation can be used to guarantee fairness for unknown tasks and for multiple fairness notions simultaneously.
1 code implementation • 10 Aug 2021 • Ziwei Xu, Guangzhi Wang, Yongkang Wong, Mohan Kankanhalli
The concept module generates semantically meaningful features for primitive concepts, whereas the visual module extracts visual features for attributes and objects from input images.
1 code implementation • CVPR 2021 • Hehe Fan, Yi Yang, Mohan Kankanhalli
To capture the dynamics in point cloud videos, point tracking is usually employed.
Ranked #4 on 3D Action Recognition on NTU RGB+D
no code implementations • 6 Feb 2021 • Jianing Zhu, Jingfeng Zhang, Bo Han, Tongliang Liu, Gang Niu, Hongxia Yang, Mohan Kankanhalli, Masashi Sugiyama
A recent adversarial training (AT) study showed that the number of projected gradient descent (PGD) steps to successfully attack a point (i. e., find an adversarial example in its proximity) is an effective measure of the robustness of this point.
no code implementations • ICCV 2021 • Xiheng Zhang, Yongkang Wong, Xiaofei Wu, Juwei Lu, Mohan Kankanhalli, Xiangdong Li, Weidong Geng
In this work, we take a step towards training robust models for cross-domain pose estimation task, which brings together ideas from causal representation learning and generative adversarial networks.
no code implementations • 18 Dec 2020 • Yubao Sun, Ying Yang, Qingshan Liu, Mohan Kankanhalli
Hyperspectral compressive imaging takes advantage of compressive sensing theory to achieve coded aperture snapshot measurement without temporal scanning, and the entire three-dimensional spatial-spectral data is captured by a two-dimensional projection during a single integration period.
2 code implementations • ICLR 2021 • Jingfeng Zhang, Jianing Zhu, Gang Niu, Bo Han, Masashi Sugiyama, Mohan Kankanhalli
The belief was challenged by recent studies where we can maintain the robustness and improve the accuracy.
no code implementations • 23 Sep 2020 • Konstantinos Nikolaidis, Thomas Plagemann, Stein Kristiansen, Vera Goebel, Mohan Kankanhalli
A new model is trained with these labels to generalize reliably despite the label noise.
no code implementations • 22 Sep 2020 • Konstantinos Nikolaidis, Stein Kristiansen, Thomas Plagemann, Vera Goebel, Knut Liestøl, Mohan Kankanhalli, Gunn Marit Traaen, Britt Øverland, Harriet Akre, Lars Aakerøy, Sigurd Steinshamn
In this work, we present an approach for unsupervised domain adaptation (DA) with the constraint, that the labeled source data are not directly available, and instead only access to a classifier trained on the source data is provided.
1 code implementation • 21 Sep 2020 • Konstantinos Nikolaidis, Stein Kristiansen, Thomas Plagemann, Vera Goebel, Knut Liestøl, Mohan Kankanhalli, Gunn Marit Traaen, Britt Øverland, Harriet Akre, Lars Aakerøy, Sigurd Steinshamn
We use sleep monitoring data from both an open and a large closed clinical study and evaluate whether (1) end-users can create and successfully use customized classification models for sleep apnea detection, and (2) the identity of participants in the study is protected.
no code implementations • 15 Jun 2020 • Chen Chen, Jingfeng Zhang, Anthony K. H. Tung, Mohan Kankanhalli, Gang Chen
We argue that the key to Byzantine detection is monitoring of gradients of the model parameters of clients.
no code implementations • 22 Apr 2020 • Jingfeng Zhang, Cheng Li, Antonio Robles-Kelly, Mohan Kankanhalli
When the federated learning is adopted among competitive agents with siloed datasets, agents are self-interested and participate only if they are fairly rewarded.
1 code implementation • 1 Apr 2020 • Erik Quintanilla, Yogesh Rawat, Andrey Sakryukin, Mubarak Shah, Mohan Kankanhalli
We demonstrate the effectiveness of the proposed model on two different large-scale and publicly available datasets, YFCC100M and NUS-WIDE.
1 code implementation • 19 Mar 2020 • Gökhan Yildirim, Debashis Sen, Mohan Kankanhalli, Sabine Süsstrunk
In this paper, we corroborate based on three subjective experiments on a novel image dataset that objects in natural images are inherently perceived to have varying levels of importance.
1 code implementation • ICML 2020 • Jingfeng Zhang, Xilie Xu, Bo Han, Gang Niu, Lizhen Cui, Masashi Sugiyama, Mohan Kankanhalli
Adversarial training based on the minimax formulation is necessary for obtaining adversarial robustness of trained models.
no code implementations • 9 Feb 2020 • Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan Kankanhalli
To enable research in this direction, we introduce 360Action, the first omnidirectional video dataset for multi-person action recognition.
no code implementations • 9 Feb 2020 • Junnan Li, Ziwei Xu, Yongkang Wong, Qi Zhao, Mohan Kankanhalli
Therefore, it is important to develop algorithms that can leverage off-the-shelf labeled dataset to learn useful knowledge for the target task.
1 code implementation • 5 Feb 2020 • Tao Zhuo, Mohan Kankanhalli
Based on the design of the pseudo target, MCPT converts the unsupervised learning problem to a supervised task.
no code implementations • 28 Aug 2019 • Tao Zhuo, Zhiyong Cheng, Mohan Kankanhalli
To overcome this limitation, we propose a novel mask transfer network (MTN), which can greatly boost the processing speed of VOS and also achieve a reasonable accuracy.
1 code implementation • 28 Aug 2019 • Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli
Finally, by sequentially examining each state transition in the video graph, our method can detect and explain how those actions are executed with prior knowledge, just like the logical manner of thinking by humans.
1 code implementation • 21 Aug 2019 • Fan Liu, Zhiyong Cheng, Changchang Sun, Yinglong Wang, Liqiang Nie, Mohan Kankanhalli
To tackle this problem, in this paper, we propose a novel Multimodal Attentive Metric Learning (MAML) method to model user diverse preferences for various items.
no code implementations • 22 May 2019 • Konstantinos Nikolaidis, Stein Kristiansen, Vera Goebel, Thomas Plagemann, Knut Liestøl, Mohan Kankanhalli
Supervised machine learning applications in the health domain often face the problem of insufficient training datasets.
1 code implementation • 13 May 2019 • Yangyang Guo, Zhiyong Cheng, Liqiang Nie, Yibing Liu, Yinglong Wang, Mohan Kankanhalli
Benefiting from the advancement of computer vision, natural language processing and information retrieval techniques, visual question answering (VQA), which aims to answer questions about an image or a video, has received lots of attentions over the past few years.
1 code implementation • 8 Apr 2019 • Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao
In addition, analysis of the intra-class compactness and inter-class separability demonstrates the advantages of the proposed function over the softmax function, which is consistent with the performance improvement.
no code implementations • 3 Apr 2019 • Abhinav Shukla, Shruti Shriya Gullapuram, Harish Katti, Mohan Kankanhalli, Stefan Winkler, Ramanathan Subramanian
Advertisements (ads) often contain strong affective content to capture viewer attention and convey an effective message to the audience.
no code implementations • 28 Feb 2019 • Jingfeng Zhang, Bo Han, Laura Wynter, Kian Hsiang Low, Mohan Kankanhalli
Our analytical studies reveal that the step factor h in the Euler method is able to control the robustness of ResNet in both its training and generalization.
1 code implementation • CVPR 2019 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli
Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect.
Ranked #26 on Image Classification on Clothing1M (using extra training data)
no code implementations • 12 Nov 2018 • Zhiyong Cheng, Xiaojun Chang, Lei Zhu, Rose C. Kanjirathinkal, Mohan Kankanhalli
Then the aspect importance is integrated into a novel aspect-aware latent factor model (ALFM), which learns user's and item's latent factors based on ratings.
1 code implementation • IEEE Transactions on Image Processing 2019 • Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli
Moreover, our method achieves better performance than the best unsupervised offline algorithm on the DAVIS-2016 benchmark dataset.
no code implementations • 14 Aug 2018 • Abhinav Shukla, Harish Katti, Mohan Kankanhalli, Ramanathan Subramanian
Contrary to the popular notion that ad affect hinges on the narrative and the clever use of linguistic and social cues, we find that actively attended objects and the coarse scene structure better encode affective information as compared to individual scene objects or conspicuous background elements.
no code implementations • 3 Aug 2017 • Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli
However, due to the domain shift problem, the performance of Web images trained deep classifiers tend to degrade when directly deployed to videos.
no code implementations • 21 Jul 2016 • Wenhui Li, Yongkang Wong, An-An Liu, Yang Li, Yu-Ting Su, Mohan Kankanhalli
To enable the study of this problem, there exist a vast number of action datasets, which are recorded under controlled laboratory settings, real-world surveillance environments, or crawled from the Internet.
1 code implementation • 21 Nov 2015 • Yehong Zhang, Trong Nghia Hoang, Kian Hsiang Low, Mohan Kankanhalli
This paper addresses the problem of active learning of a multi-output Gaussian process (MOGP) model representing multiple types of coexisting correlated environmental phenomena.