1 code implementation • 5 Jan 2025 • Yixiao Chen, Yue Yao, Ruining Yang, Md Zakir Hossain, Ashu Gupta, Tom Gedeon
This article investigates the critical issue of dataset bias in medical imaging, with a particular emphasis on racial disparities caused by uneven population distribution in dataset collection.
no code implementations • 1 Jan 2025 • Md Rakibul Hasan, Yue Yao, Md Zakir Hossain, Aneesh Krishna, Imre Rudas, Shafin Rahman, Tom Gedeon
In this paper, we evaluate this approach in the emerging field of empathy computing -- automating the prediction of psychological questionnaire outcomes from inputs like text sequences.
no code implementations • 9 Dec 2024 • Weijie Tu, Weijian Deng, Dylan Campbell, Yu Yao, Jiyang Zheng, Tom Gedeon, Tongliang Liu
As large multimodal models (LMMs) are increasingly deployed across diverse applications, the need for adaptable, real-world model ranking has become paramount.
no code implementations • 22 Nov 2024 • Huilin Chen, Lei Wang, Yifan Chen, Tom Gedeon, Piotr Koniusz
Capturing the rich temporal evolution of video frames, while preserving their spatial details, is crucial for improving accuracy.
no code implementations • 3 Oct 2024 • Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Wenjia Niu, Sabrina Caldwell, Tom Gedeon, Yang Liu, Zhenyue Qin
Vision Large Language Models (VLLMs) are transforming the intersection of computer vision and natural language processing.
no code implementations • 2 Oct 2024 • Dexuan Ding, Lei Wang, Liyun Zhu, Tom Gedeon, Piotr Koniusz
In computer vision tasks, features often come from diverse representations, domains, and modalities, such as text, images, and videos.
no code implementations • 2 Oct 2024 • Weijie Tu, Weijian Deng, Tom Gedeon
For instance, the architecture of the visual encoder in CLIP plays a significant role in their robustness against 3D corruption.
no code implementations • 22 Sep 2024 • Arjun Raj, Lei Wang, Tom Gedeon
We refer to our lightweight, plug-and-play solution, built on top of the existing TrackNet, as TrackNetV4.
no code implementations • 11 Sep 2024 • Shreya Ghosh, Zhixi Cai, Abhinav Dhall, Dimitrios Kollias, Roland Goecke, Tom Gedeon
With the rapid advancements in multimodal generative technology, Affective Computing research has provoked discussion about the potential consequences of AI systems equipped with emotional intelligence.
no code implementations • 10 Sep 2024 • Surbhi Madan, Shreya Ghosh, Lownish Rai Sookha, M. A. Ganaie, Ramanathan Subramanian, Abhinav Dhall, Tom Gedeon
Estimating the Most Important Person (MIP) in any social event setup is a challenging problem mainly due to contextual complexity and scarcity of labeled data.
1 code implementation • 9 Jul 2024 • Xuehan Liu, Md Rakibul Hasan, Tom Gedeon, Md Zakir Hossain
We experiment with the well-known publicly available ABIDE (Autism Brain Imaging Data Exchange) I dataset, consisting of resting state fMRI data from 17 different laboratories around the globe.
Ranked #1 on
Autism detection
on ABIDE I CPAC (NYU Subset)
1 code implementation • 3 Jul 2024 • Qixiang Chen, Lei Wang, Piotr Koniusz, Tom Gedeon
We formalize this process as a motion prompt layer and incorporate the regularization term into the loss function to learn better motion prompts.
no code implementations • 14 Jun 2024 • Weijie Tu, Weijian Deng, Liang Zheng, Tom Gedeon
This work aims to develop a measure that can accurately rank the performance of various classifiers when they are tested on unlabeled data from out-of-distribution (OOD) distributions.
no code implementations • 10 May 2024 • Md Jobayer, Md. Mehedi Hasan Shawon, Md Rakibul Hasan, Shreya Ghosh, Tom Gedeon, Md Zakir Hossain
Conclusion: The proposed model offers a deep learning framework for real-time accurate heart murmur detection within limited resources.
no code implementations • 21 Apr 2024 • Qixuan Zhang, Zhifeng Wang, Yang Liu, Zhenyue Qin, Kaihao Zhang, Sabrina Caldwell, Tom Gedeon
In this paper, we present a novel benchmark for Emotion Recognition using facial landmarks extracted from realistic news videos.
1 code implementation • 12 Mar 2024 • Zeyu Zhang, Khandaker Asif Ahmed, Md Rakibul Hasan, Tom Gedeon, Md Zakir Hossain
Diabetes, resulting from inadequate insulin production or utilization, causes extensive harm to the body.
no code implementations • 12 Feb 2024 • Weijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeon
Vision-Language Models (VLMs) have emerged as the dominant approach for zero-shot recognition, adept at handling diverse scenarios and significant distribution changes.
no code implementations • NeurIPS 2023 • Weijie Tu, Weijian Deng, Tom Gedeon
Driven by the above, this work comprehensively investigates the safety objectives of CLIP models, specifically focusing on three key properties: resilience to visual factor variations, calibrated uncertainty estimations, and the ability to detect anomalous inputs.
no code implementations • 7 Feb 2024 • Lei Wang, Jun Liu, Liang Zheng, Tom Gedeon, Piotr Koniusz
For a support sequence, we match it with view-simulated query sequences, as in the popular Dynamic Time Warping (DTW).
no code implementations • 7 Feb 2024 • Liyun Zhu, Lei Wang, Arjun Raj, Tom Gedeon, Chen Chen
Our review identifies practical issues, including the absence of comprehensive datasets with diverse scenarios.
1 code implementation • 5 Feb 2024 • Lei Wang, Xiuyuan Yuan, Tom Gedeon, Liang Zheng
Effectively extracting motions from video is a critical and long-standing problem for action recognition.
no code implementations • 4 Dec 2023 • Yunzhong Hou, Xingjian Leng, Tom Gedeon, Liang Zheng
Jointly considering multiple camera views (multi-view) is very effective for pedestrian detection under occlusion.
1 code implementation • 26 Nov 2023 • Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Tom Gedeon, Kalin Stefanov
The comprehensive benchmark of the proposed dataset utilizing state-of-the-art deepfake detection and localization methods indicates a significant drop in performance compared to previous datasets.
no code implementations • 30 Oct 2023 • Md Rakibul Hasan, Md Zakir Hossain, Shreya Ghosh, Aneesh Krishna, Tom Gedeon
Empathy indicates an individual's ability to understand others.
1 code implementation • 9 Oct 2023 • Lei Wang, Piotr Koniusz, Tom Gedeon, Liang Zheng
In contrastive learning, two views of an original image, generated by different augmentations, are considered a positive pair, and their similarity is required to be high.
1 code implementation • 18 Aug 2023 • Yue Yao, Xinyu Tian, Zheng Tang, Sujit Biswas, Huan Lei, Tom Gedeon, Liang Zheng
Because the digital twins individually mimic user bias, the resulting DT training set better reflects the characteristics of the target scenario and allows us to train more effective product detection and tracking models.
1 code implementation • 10 May 2023 • Md Rakibul Hasan, Shreya Ghosh, Pradyumna Agrawal, Zhixi Cai, Abhinav Dhall, Tom Gedeon
This paper proposes a feedback mechanism to change behavioural patterns using the Pavlok device.
1 code implementation • 3 May 2023 • Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat
The proposed baseline method, Boundary Aware Temporal Forgery Detection (BA-TFD), is a 3D Convolutional Neural Network-based architecture which effectively captures multimodal manipulations.
Ranked #1 on
Temporal Forgery Localization
on ForgeryNet
1 code implementation • CVPR 2023 • Yue Yao, Huan Lei, Tom Gedeon, Liang Zheng
We consider a scenario where we have access to the target domain, but cannot afford on-the-fly training data annotation, and instead would like to construct an alternative training set from a large-scale data pool such that a competitive model can be obtained.
no code implementations • CVPR 2023 • Weijie Tu, Weijian Deng, Tom Gedeon, Liang Zheng
The former measures how suitable a training set is for a target domain, while the latter studies how challenging a test set is for a learned model.
no code implementations • 26 Jul 2022 • Xuyang Shen, Jo Plested, Sabrina Caldwell, Yiran Zhong, Tom Gedeon
Fine-tuning is widely applied in image classification tasks as a transfer learning approach.
no code implementations • 28 May 2022 • Zhenyue Qin, Pan Ji, Dongwoo Kim, Yang Liu, Saeed Anwar, Tom Gedeon
Skeleton sequences are compact and lightweight.
no code implementations • 20 May 2022 • Jo Plested, Tom Gedeon
We show that under this new taxonomy, many of the applications where transfer learning has been shown to be ineffective or even hinder performance are to be expected when taking into account the source and target datasets and the techniques used.
no code implementations • 4 May 2022 • Zhenyue Qin, Yang Liu, Madhawa Perera, Tom Gedeon, Pan Ji, Dongwoo Kim, Saeed Anwar
To this end, we present a review in the form of a taxonomy on existing works of skeleton-based action recognition.
2 code implementations • 28 Feb 2022 • Yue Yao, Liang Zheng, Xiaodong Yang, Milind Napthade, Tom Gedeon
This article aims to use graphic engines to simulate a large number of training data that have free annotations and possibly strongly resemble to real-world data.
1 code implementation • 3 Dec 2021 • Yuchi Liu, Zhongdao Wang, Tom Gedeon, Liang Zheng
To this end, we develop a protocol to automatically synthesize large scale MiE training data that allow us to train improved recognition models for real-world test data.
1 code implementation • 15 Oct 2021 • Zhenyue Qin, Tom Gedeon, Bob McKay
This dynamicity is imposed on top of an already complex fitness landscape.
no code implementations • 8 Sep 2021 • Xuyang Shen, Jo Plested, Tom Gedeon
These findings are likely to improve the accuracy of current stress recognition systems.
no code implementations • 23 Aug 2021 • Xuyang Shen, Jo Plested, Sabrina Caldwell, Tom Gedeon
Varying the proportions of male and female faces in the training data can have a substantial effect on behavior on the test data: we found that the seemingly obvious choice of 50:50 proportions was not the best for this dataset to reduce biased behavior on female faces, which was 71% unbiased as compared to our top unbiased rate of 84%.
1 code implementation • 19 Jul 2021 • Jo Plested, Xuyang Shen, Tom Gedeon
A model is either pre-trained or not pre-trained.
Ranked #2 on
Image Classification
on Caltech-256
(using extra training data)
no code implementations • 19 Jun 2021 • Zhenyue Qin, Dongwoo Kim, Tom Gedeon
We give a new view of neural network classifiers with softmax and cross-entropy as mutual information evaluators.
no code implementations • 19 Jun 2021 • Zhenyue Qin, Dongwoo Kim, Tom Gedeon
We develop an informative class activation map (infoCAM).
1 code implementation • 24 May 2021 • Zhenyue Qin, Saeed Anwar, Dongwoo Kim, Yang Liu, Pan Ji, Tom Gedeon
Such GNNs are incapable of learning relative positions between graph nodes within a graph.
1 code implementation • 11 May 2021 • Yang Liu, Saeed Anwar, Zhenyue Qin, Pan Ji, Sabrina Caldwell, Tom Gedeon
The prevalent convolutional neural network (CNN) based image denoising methods extract features of images to restore the clean ground truth, achieving high denoising accuracy.
1 code implementation • 4 May 2021 • Zhenyue Qin, Yang Liu, Pan Ji, Dongwoo Kim, Lei Wang, Bob McKay, Saeed Anwar, Tom Gedeon
Recent skeleton-based action recognition methods extract features from 3D joint coordinates as spatial-temporal cues, using these representations in a graph neural network for feature fusion to boost recognition performance.
Ranked #27 on
Skeleton Based Action Recognition
on NTU RGB+D 120
1 code implementation • CVPR 2021 • Yang Liu, Zhenyue Qin, Saeed Anwar, Pan Ji, Dongwoo Kim, Sabrina Caldwell, Tom Gedeon
InvDN transforms the noisy input into a low-resolution clean image and a latent representation containing noise.
1 code implementation • 30 Nov 2020 • Yan Yang, Md Zakir Hossain, Tom Gedeon, Shafin Rahman
Instead of constraining the translation process by using a reference image, the users can command the model to retouch the generated images by involving the semantic information in the generation process.
1 code implementation • 18 Oct 2020 • Shiya Liu, Yue Yao, Chaoyue Xing, Tom Gedeon
The personal identity information in original EEGs are transformed into disguised ones with a CycleGANbased EEG disguising model.
no code implementations • 14 Oct 2020 • Rollin Omari, R. I. McKay, Tom Gedeon
To illustrate this, we first investigate the performance of our networks with supervised learning, then with unsupervised learning.
no code implementations • 7 Oct 2020 • Yan Yang, Md Zakir Hossain, Tom Gedeon, Shafin Rahman
Smiles play a vital role in the understanding of social interactions within different communities, and reveal the physical state of mind of people in both real and deceptive ways.
no code implementations • 23 Sep 2020 • Weiwei Hou, Hanna Suominen, Piotr Koniusz, Sabrina Caldwell, Tom Gedeon
Sentence compression is a Natural Language Processing (NLP) task aimed at shortening original sentences and preserving their key information.
1 code implementation • 13 Sep 2020 • Xuyang Shen, Jo Plested, Yue Yao, Tom Gedeon
This inspired our research which explores the performance of two models from pixel transformation in frontal facial synthesis, Pix2Pix and CycleGAN.
1 code implementation • 7 Sep 2020 • Yang Liu, Zhenyue Qin, Saeed Anwar, Sabrina Caldwell, Tom Gedeon
Identifying the information lossless condition for deep neural architectures is important, because tasks such as image restoration require keep the detailed information of the input data as much as possible.
no code implementations • 6 Sep 2020 • Chaoxing Huang, Xuanying Zhu, Tom Gedeon
For example, acted anger can be expressed when stimuli is not genuinely angry with an aim to manipulate the observer.
2 code implementations • ECCV 2020 • Yue Yao, Liang Zheng, Xiaodong Yang, Milind Naphade, Tom Gedeon
Between synthetic and real data, there is a two-level domain gap, i. e., content level and appearance level.
1 code implementation • 25 Nov 2019 • Zhenyue Qin, Dongwoo Kim, Tom Gedeon
We show that optimising the parameters of classification neural networks with softmax cross-entropy is equivalent to maximising the mutual information between inputs and labels under the balanced data assumption.
no code implementations • 31 Dec 2018 • Shreya Ghosh, Abhinav Dhall, Nicu Sebe, Tom Gedeon
We study the factors that influence the perception of group-level cohesion and propose methods for estimating the human-perceived cohesion on the group cohesiveness scale.
no code implementations • 8 Nov 2018 • Jiaxu Zuo, Tom Gedeon, Zhenyue Qin
Eye movement patterns reflect human latent internal cognitive activities.
Human-Computer Interaction
no code implementations • 23 Aug 2018 • Abhinav Dhall, Amanjot Kaur, Roland Goecke, Tom Gedeon
This paper details the sixth Emotion Recognition in the Wild (EmotiW) challenge.
no code implementations • 11 Jul 2018 • Zhenyue Qin, Robert McKay, Tom Gedeon
Wagner's modularity inducing problem domain is a key contribution to the study of the evolution of modularity, including both evolutionary theory and evolutionary computation.
no code implementations • SEMEVAL 2018 • Liyuan Zhou, Qiongkai Xu, Hanna Suominen, Tom Gedeon
This paper describes our approach, called EPUTION, for the open trial of the SemEval- 2018 Task 2, Multilingual Emoji Prediction.