Search Results for author: Zhenyue Qin

Found 23 papers, 8 papers with code

Plane Geometry Problem Solving with Multi-modal Reasoning: A Survey

no code implementations20 May 2025 Seunghyuk Cho, Zhenyue Qin, Yang Liu, Youngbin Choi, Seungbeom Lee, Dongwoo Kim

In particular, we discuss the hallucination issues arising during the encoding phase within encoder-decoder architectures, as well as the problem of data leakage in current PGPS benchmarks.

Decoder Geometry Problem Solving +1

GeoDANO: Geometric VLM with Domain Agnostic Vision Encoder

no code implementations17 Feb 2025 Seunghyuk Cho, Zhenyue Qin, Yang Liu, Youngbin Choi, Seungbeom Lee, Dongwoo Kim

We introduce GeoDANO, a geometric vision-language model (VLM) with a domain-agnostic vision encoder, for solving plane geometry problems.

Domain Adaptation Language Modeling +1

HandCraft: Anatomically Correct Restoration of Malformed Hands in Diffusion Generated Images

no code implementations7 Nov 2024 Zhenyue Qin, Yiqun Zhang, Yang Liu, Dylan Campbell

Generative text-to-image models, such as Stable Diffusion, have demonstrated a remarkable ability to generate diverse, high-quality images.

Anatomy Benchmarking

Visual Prompting in LLMs for Enhancing Emotion Recognition

no code implementations3 Oct 2024 Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Wenjia Niu, Sabrina Caldwell, Tom Gedeon, Yang Liu, Zhenyue Qin

Vision Large Language Models (VLLMs) are transforming the intersection of computer vision and natural language processing.

Emotion Recognition Visual Prompting

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models

no code implementations2 Oct 2024 Zhenyue Qin, Yu Yin, Dylan Campbell, Xuansheng Wu, Ke Zou, Yih-Chung Tham, Ninghao Liu, Xiuzhen Zhang, Qingyu Chen

The prevalence of vision-threatening eye diseases is a significant global burden, with many cases remaining undiagnosed or diagnosed too late for effective treatment.

Hallucination

Authentic Emotion Mapping: Benchmarking Facial Expressions in Real News

no code implementations21 Apr 2024 Qixuan Zhang, Zhifeng Wang, Yang Liu, Zhenyue Qin, Kaihao Zhang, Sabrina Caldwell, Tom Gedeon

In this paper, we present a novel benchmark for Emotion Recognition using facial landmarks extracted from realistic news videos.

Benchmarking Emotion Recognition

Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images

no code implementations7 Dec 2023 Yiqun Zhang, Zhenyue Qin, Yang Liu, Dylan Campbell

We introduce a pipeline to address anatomical inaccuracies in Stable Diffusion generated hand images.

Pose Estimation

Anonymization for Skeleton Action Recognition

1 code implementation30 Nov 2021 Saemi Moon, Myeonghyeon Kim, Zhenyue Qin, Yang Liu, Dongwoo Kim

Compared with RGB-video-based action recognition, skeleton-based action recognition is a safer way to protect the privacy of subjects while having competitive recognition performance.

Action Recognition Skeleton Based Action Recognition

Neural Network Classifier as Mutual Information Evaluator

no code implementations19 Jun 2021 Zhenyue Qin, Dongwoo Kim, Tom Gedeon

We give a new view of neural network classifiers with softmax and cross-entropy as mutual information evaluators.

Form

Disentangling Noise from Images: A Flow-Based Image Denoising Neural Network

1 code implementation11 May 2021 Yang Liu, Saeed Anwar, Zhenyue Qin, Pan Ji, Sabrina Caldwell, Tom Gedeon

The prevalent convolutional neural network (CNN) based image denoising methods extract features of images to restore the clean ground truth, achieving high denoising accuracy.

Disentanglement Image Denoising

Fusing Higher-order Features in Graph Neural Networks for Skeleton-based Action Recognition

1 code implementation4 May 2021 Zhenyue Qin, Yang Liu, Pan Ji, Dongwoo Kim, Lei Wang, Bob McKay, Saeed Anwar, Tom Gedeon

Recent skeleton-based action recognition methods extract features from 3D joint coordinates as spatial-temporal cues, using these representations in a graph neural network for feature fusion to boost recognition performance.

Action Recognition Graph Neural Network +1

Are Deep Neural Architectures Losing Information? Invertibility Is Indispensable

1 code implementation7 Sep 2020 Yang Liu, Zhenyue Qin, Saeed Anwar, Sabrina Caldwell, Tom Gedeon

Identifying the information lossless condition for deep neural architectures is important, because tasks such as image restoration require keep the detailed information of the input data as much as possible.

Image Denoising Image Inpainting +1

Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator

1 code implementation25 Nov 2019 Zhenyue Qin, Dongwoo Kim, Tom Gedeon

We show that optimising the parameters of classification neural networks with softmax cross-entropy is equivalent to maximising the mutual information between inputs and labels under the balanced data assumption.

Classification Fine-Grained Image Classification +2

Softmax Is Not an Artificial Trick: An Information-Theoretic View of Softmax in Neural Networks

no code implementations7 Oct 2019 Zhenyue Qin, Dongwoo Kim

Under this view, we can naturally and mathematically derive log-softmax as an inherent component in a neural network for evaluating the conditional mutual information between network output vectors and labels given an input datum.

Your Eyes Say You're Lying: An Eye Movement Pattern Analysis for Face Familiarity and Deceptive Cognition

no code implementations8 Nov 2018 Jiaxu Zuo, Tom Gedeon, Zhenyue Qin

Eye movement patterns reflect human latent internal cognitive activities.

Human-Computer Interaction

Why don't the modules dominate - Investigating the Structure of a Well-Known Modularity-Inducing Problem Domain

no code implementations11 Jul 2018 Zhenyue Qin, Robert McKay, Tom Gedeon

Wagner's modularity inducing problem domain is a key contribution to the study of the evolution of modularity, including both evolutionary theory and evolutionary computation.

Cannot find the paper you are looking for? You can Submit a new open access paper.