Search Results for author: Tom Gedeon

Found 61 papers, 25 papers with code

Unsupervised Search for Ethnic Minorities' Medical Segmentation Training Set

1 code implementation5 Jan 2025 Yixiao Chen, Yue Yao, Ruining Yang, Md Zakir Hossain, Ashu Gupta, Tom Gedeon

This article investigates the critical issue of dataset bias in medical imaging, with a particular emphasis on racial disparities caused by uneven population distribution in dataset collection.

Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro

no code implementations1 Jan 2025 Md Rakibul Hasan, Yue Yao, Md Zakir Hossain, Aneesh Krishna, Imre Rudas, Shafin Rahman, Tom Gedeon

In this paper, we evaluate this approach in the emerging field of empathy computing -- automating the prediction of psychological questionnaire outcomes from inputs like text sequences.

Data Augmentation Language Modeling +3

Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels

no code implementations9 Dec 2024 Weijie Tu, Weijian Deng, Dylan Campbell, Yu Yao, Jiyang Zheng, Tom Gedeon, Tongliang Liu

As large multimodal models (LMMs) are increasingly deployed across diverse applications, the need for adaptable, real-world model ranking has become paramount.

Question Answering Visual Question Answering

When Spatial meets Temporal in Action Recognition

no code implementations22 Nov 2024 Huilin Chen, Lei Wang, Yifan Chen, Tom Gedeon, Piotr Koniusz

Capturing the rich temporal evolution of video frames, while preserving their spatial details, is crucial for improving accuracy.

Action Recognition Temporal Action Localization

Visual Prompting in LLMs for Enhancing Emotion Recognition

no code implementations3 Oct 2024 Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Wenjia Niu, Sabrina Caldwell, Tom Gedeon, Yang Liu, Zhenyue Qin

Vision Large Language Models (VLLMs) are transforming the intersection of computer vision and natural language processing.

Emotion Recognition Visual Prompting

LEGO: Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion

no code implementations2 Oct 2024 Dexuan Ding, Lei Wang, Liyun Zhu, Tom Gedeon, Piotr Koniusz

In computer vision tasks, features often come from diverse representations, domains, and modalities, such as text, images, and videos.

Anomaly Detection Video Anomaly Detection

Toward a Holistic Evaluation of Robustness in CLIP Models

no code implementations2 Oct 2024 Weijie Tu, Weijian Deng, Tom Gedeon

For instance, the architecture of the visual encoder in CLIP plays a significant role in their robustness against 3D corruption.

Classification Out-of-Distribution Detection +1

TrackNetV4: Enhancing Fast Sports Object Tracking with Motion Attention Maps

no code implementations22 Sep 2024 Arjun Raj, Lei Wang, Tom Gedeon

We refer to our lightweight, plug-and-play solution, built on top of the existing TrackNet, as TrackNetV4.

Object Tracking Trajectory Prediction

MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing

no code implementations11 Sep 2024 Shreya Ghosh, Zhixi Cai, Abhinav Dhall, Dimitrios Kollias, Roland Goecke, Tom Gedeon

With the rapid advancements in multimodal generative technology, Affective Computing research has provoked discussion about the potential consequences of AI systems equipped with emotional intelligence.

Emotional Intelligence

MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding

no code implementations10 Sep 2024 Surbhi Madan, Shreya Ghosh, Lownish Rai Sookha, M. A. Ganaie, Ramanathan Subramanian, Abhinav Dhall, Tom Gedeon

Estimating the Most Important Person (MIP) in any social event setup is a challenging problem mainly due to contextual complexity and scarcity of labeled data.

Benchmarking Language Modeling +3

MADE-for-ASD: A Multi-Atlas Deep Ensemble Network for Diagnosing Autism Spectrum Disorder

1 code implementation9 Jul 2024 Xuehan Liu, Md Rakibul Hasan, Tom Gedeon, Md Zakir Hossain

We experiment with the well-known publicly available ABIDE (Autism Brain Imaging Data Exchange) I dataset, consisting of resting state fMRI data from 17 different laboratories around the globe.

Autism detection Diagnostic +1

Motion meets Attention: Video Motion Prompts

1 code implementation3 Jul 2024 Qixiang Chen, Lei Wang, Piotr Koniusz, Tom Gedeon

We formalize this process as a motion prompt layer and incorporate the regularization term into the loss function to learn better motion prompts.

Action Recognition

What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

no code implementations14 Jun 2024 Weijie Tu, Weijian Deng, Liang Zheng, Tom Gedeon

This work aims to develop a measure that can accurately rank the performance of various classifiers when they are tested on unlabeled data from out-of-distribution (OOD) distributions.

FunnelNet: An End-to-End Deep Learning Framework to Monitor Digital Heart Murmur in Real-Time

no code implementations10 May 2024 Md Jobayer, Md. Mehedi Hasan Shawon, Md Rakibul Hasan, Shreya Ghosh, Tom Gedeon, Md Zakir Hossain

Conclusion: The proposed model offers a deep learning framework for real-time accurate heart murmur detection within limited resources.

Diagnostic

Authentic Emotion Mapping: Benchmarking Facial Expressions in Real News

no code implementations21 Apr 2024 Qixuan Zhang, Zhifeng Wang, Yang Liu, Zhenyue Qin, Kaihao Zhang, Sabrina Caldwell, Tom Gedeon

In this paper, we present a novel benchmark for Emotion Recognition using facial landmarks extracted from realistic news videos.

Benchmarking Emotion Recognition

DiabetesNet: A Deep Learning Approach to Diabetes Diagnosis

1 code implementation12 Mar 2024 Zeyu Zhang, Khandaker Asif Ahmed, Md Rakibul Hasan, Tom Gedeon, Md Zakir Hossain

Diabetes, resulting from inadequate insulin production or utilization, causes extensive harm to the body.

Deep Learning Diagnostic +1

An Empirical Study Into What Matters for Calibrating Vision-Language Models

no code implementations12 Feb 2024 Weijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeon

Vision-Language Models (VLMs) have emerged as the dominant approach for zero-shot recognition, adept at handling diverse scenarios and significant distribution changes.

Zero-Shot Learning

A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)

no code implementations NeurIPS 2023 Weijie Tu, Weijian Deng, Tom Gedeon

Driven by the above, this work comprehensively investigates the safety objectives of CLIP models, specifically focusing on three key properties: resilience to visual factor variations, calibrated uncertainty estimations, and the ability to detect anomalous inputs.

Advancing Video Anomaly Detection: A Concise Review and a New Dataset

no code implementations7 Feb 2024 Liyun Zhu, Lei Wang, Arjun Raj, Tom Gedeon, Chen Chen

Our review identifies practical issues, including the absence of comprehensive datasets with diverse scenarios.

Anomaly Detection Diversity +2

Taylor Videos for Action Recognition

1 code implementation5 Feb 2024 Lei Wang, Xiuyuan Yuan, Tom Gedeon, Liang Zheng

Effectively extracting motions from video is a critical and long-standing problem for action recognition.

Action Recognition Optical Flow Estimation +1

Optimizing Camera Configurations for Multi-View Pedestrian Detection

no code implementations4 Dec 2023 Yunzhong Hou, Xingjian Leng, Tom Gedeon, Liang Zheng

Jointly considering multiple camera views (multi-view) is very effective for pedestrian detection under occlusion.

Pedestrian Detection

AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

1 code implementation26 Nov 2023 Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Tom Gedeon, Kalin Stefanov

The comprehensive benchmark of the proposed dataset utilizing state-of-the-art deepfake detection and localization methods indicates a significant drop in performance compared to previous datasets.

2k DeepFake Detection +2

Adaptive Multi-head Contrastive Learning

1 code implementation9 Oct 2023 Lei Wang, Piotr Koniusz, Tom Gedeon, Liang Zheng

In contrastive learning, two views of an original image, generated by different augmentations, are considered a positive pair, and their similarity is required to be high.

Contrastive Learning

Training with Product Digital Twins for AutoRetail Checkout

1 code implementation18 Aug 2023 Yue Yao, Xinyu Tian, Zheng Tang, Sujit Biswas, Huan Lei, Tom Gedeon, Liang Zheng

Because the digital twins individually mimic user bias, the resulting DT training set better reflects the characteristics of the target scenario and allows us to train more effective product detection and tracking models.

Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

1 code implementation3 May 2023 Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat

The proposed baseline method, Boundary Aware Temporal Forgery Detection (BA-TFD), is a 3D Convolutional Neural Network-based architecture which effectively captures multimodal manipulations.

Binary Classification DeepFake Detection +2

Large-scale Training Data Search for Object Re-identification

1 code implementation CVPR 2023 Yue Yao, Huan Lei, Tom Gedeon, Liang Zheng

We consider a scenario where we have access to the target domain, but cannot afford on-the-fly training data annotation, and instead would like to construct an alternative training set from a large-scale data pool such that a competitive model can be obtained.

Object Specificity

A Bag-of-Prototypes Representation for Dataset-Level Applications

no code implementations CVPR 2023 Weijie Tu, Weijian Deng, Tom Gedeon, Liang Zheng

The former measures how suitable a training set is for a target domain, while the latter studies how challenging a test set is for a learned model.

Deep transfer learning for image classification: a survey

no code implementations20 May 2022 Jo Plested, Tom Gedeon

We show that under this new taxonomy, many of the applications where transfer learning has been shown to be ineffective or even hinder performance are to be expected when taking into account the source and target datasets and the techniques used.

Classification Image Classification +2

Attribute Descent: Simulating Object-Centric Datasets on the Content Level and Beyond

2 code implementations28 Feb 2022 Yue Yao, Liang Zheng, Xiaodong Yang, Milind Napthade, Tom Gedeon

This article aims to use graphic engines to simulate a large number of training data that have free annotations and possibly strongly resemble to real-world data.

Attribute Data Augmentation +2

How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset?

1 code implementation3 Dec 2021 Yuchi Liu, Zhongdao Wang, Tom Gedeon, Liang Zheng

To this end, we develop a protocol to automatically synthesize large scale MiE training data that allow us to train improved recognition models for real-world test data.

Face Generation Micro-Expression Recognition

Feature Selection on Thermal-stress Dataset

no code implementations8 Sep 2021 Xuyang Shen, Jo Plested, Tom Gedeon

These findings are likely to improve the accuracy of current stress recognition systems.

feature selection

Exploring Biases and Prejudice of Facial Synthesis via Semantic Latent Space

no code implementations23 Aug 2021 Xuyang Shen, Jo Plested, Sabrina Caldwell, Tom Gedeon

Varying the proportions of male and female faces in the training data can have a substantial effect on behavior on the test data: we found that the seemingly obvious choice of 50:50 proportions was not the best for this dataset to reduce biased behavior on female faces, which was 71% unbiased as compared to our top unbiased rate of 84%.

Neural Network Classifier as Mutual Information Evaluator

no code implementations19 Jun 2021 Zhenyue Qin, Dongwoo Kim, Tom Gedeon

We give a new view of neural network classifiers with softmax and cross-entropy as mutual information evaluators.

Form

Disentangling Noise from Images: A Flow-Based Image Denoising Neural Network

1 code implementation11 May 2021 Yang Liu, Saeed Anwar, Zhenyue Qin, Pan Ji, Sabrina Caldwell, Tom Gedeon

The prevalent convolutional neural network (CNN) based image denoising methods extract features of images to restore the clean ground truth, achieving high denoising accuracy.

Disentanglement Image Denoising

Fusing Higher-order Features in Graph Neural Networks for Skeleton-based Action Recognition

1 code implementation4 May 2021 Zhenyue Qin, Yang Liu, Pan Ji, Dongwoo Kim, Lei Wang, Bob McKay, Saeed Anwar, Tom Gedeon

Recent skeleton-based action recognition methods extract features from 3D joint coordinates as spatial-temporal cues, using these representations in a graph neural network for feature fusion to boost recognition performance.

Action Recognition Graph Neural Network +1

S2FGAN: Semantically Aware Interactive Sketch-to-Face Translation

1 code implementation30 Nov 2020 Yan Yang, Md Zakir Hossain, Tom Gedeon, Shafin Rahman

Instead of constraining the translation process by using a reference image, the users can command the model to retouch the generated images by involving the semantic information in the generation process.

Attribute Diversity +2

Disguising Personal Identity Information in EEG Signals

1 code implementation18 Oct 2020 Shiya Liu, Yue Yao, Chaoyue Xing, Tom Gedeon

The personal identity information in original EEGs are transformed into disguised ones with a CycleGANbased EEG disguising model.

EEG

Analogical and Relational Reasoning with Spiking Neural Networks

no code implementations14 Oct 2020 Rollin Omari, R. I. McKay, Tom Gedeon

To illustrate this, we first investigate the performance of our networks with supervised learning, then with unsupervised learning.

Relational Reasoning

RealSmileNet: A Deep End-To-End Network for Spontaneous and Posed Smile Recognition

no code implementations7 Oct 2020 Yan Yang, Md Zakir Hossain, Tom Gedeon, Shafin Rahman

Smiles play a vital role in the understanding of social interactions within different communities, and reveal the physical state of mind of people in both real and deceptive ways.

Feature Engineering Smile Recognition

A Token-wise CNN-based Method for Sentence Compression

no code implementations23 Sep 2020 Weiwei Hou, Hanna Suominen, Piotr Koniusz, Sabrina Caldwell, Tom Gedeon

Sentence compression is a Natural Language Processing (NLP) task aimed at shortening original sentences and preserving their key information.

Sentence Sentence Compression

Pairwise-GAN: Pose-based View Synthesis through Pair-Wise Training

1 code implementation13 Sep 2020 Xuyang Shen, Jo Plested, Yue Yao, Tom Gedeon

This inspired our research which explores the performance of two models from pixel transformation in frontal facial synthesis, Pix2Pix and CycleGAN.

Face Reconstruction Generative Adversarial Network +1

Are Deep Neural Architectures Losing Information? Invertibility Is Indispensable

1 code implementation7 Sep 2020 Yang Liu, Zhenyue Qin, Saeed Anwar, Sabrina Caldwell, Tom Gedeon

Identifying the information lossless condition for deep neural architectures is important, because tasks such as image restoration require keep the detailed information of the input data as much as possible.

Image Denoising Image Inpainting +1

A Genetic Feature Selection Based Two-stream Neural Network for Anger Veracity Recognition

no code implementations6 Sep 2020 Chaoxing Huang, Xuanying Zhu, Tom Gedeon

For example, acted anger can be expressed when stimuli is not genuinely angry with an aim to manipulate the observer.

Emotion Recognition feature selection +2

Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator

1 code implementation25 Nov 2019 Zhenyue Qin, Dongwoo Kim, Tom Gedeon

We show that optimising the parameters of classification neural networks with softmax cross-entropy is equivalent to maximising the mutual information between inputs and labels under the balanced data assumption.

Classification Fine-Grained Image Classification +2

Predicting Group Cohesiveness in Images

no code implementations31 Dec 2018 Shreya Ghosh, Abhinav Dhall, Nicu Sebe, Tom Gedeon

We study the factors that influence the perception of group-level cohesion and propose methods for estimating the human-perceived cohesion on the group cohesiveness scale.

Attribute

Your Eyes Say You're Lying: An Eye Movement Pattern Analysis for Face Familiarity and Deceptive Cognition

no code implementations8 Nov 2018 Jiaxu Zuo, Tom Gedeon, Zhenyue Qin

Eye movement patterns reflect human latent internal cognitive activities.

Human-Computer Interaction

Why don't the modules dominate - Investigating the Structure of a Well-Known Modularity-Inducing Problem Domain

no code implementations11 Jul 2018 Zhenyue Qin, Robert McKay, Tom Gedeon

Wagner's modularity inducing problem domain is a key contribution to the study of the evolution of modularity, including both evolutionary theory and evolutionary computation.

EPUTION at SemEval-2018 Task 2: Emoji Prediction with User Adaption

no code implementations SEMEVAL 2018 Liyuan Zhou, Qiongkai Xu, Hanna Suominen, Tom Gedeon

This paper describes our approach, called EPUTION, for the open trial of the SemEval- 2018 Task 2, Multilingual Emoji Prediction.

General Classification Task 2 +4

Cannot find the paper you are looking for? You can Submit a new open access paper.