Search Results for author: Wenhui Zhu

Found 37 papers, 21 papers with code

Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models

no code implementations20 May 2025 Wenhui Zhu, Xuanzhao Dong, Xin Li, Peijie Qiu, Xiwen Chen, Abolfazl Razi, Aris Sotiras, Yi Su, Yalin Wang

Recently, reinforcement learning (RL)-based tuning has shifted the trajectory of Multimodal Large Language Models (MLLMs), particularly following the introduction of Group Relative Policy Optimization (GRPO).

Medical Visual Question Answering Question Answering +2

FIC-TSC: Learning Time Series Classification with Fisher Information Constraint

no code implementations9 May 2025 Xiwen Chen, Wenhui Zhu, Peijie Qiu, Hao Wang, Huayu Li, Zihan Li, Yalin Wang, Aristeidis Sotiras, Abolfazl Razi

However, there is a large consensus that time series data often suffers from domain shifts between training and test sets, which dramatically degrades the classification performance.

Classification Time Series +2

Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA

1 code implementation30 Apr 2025 Xuanzhao Dong, Wenhui Zhu, Hao Wang, Xiwen Chen, Peijie Qiu, Rui Yin, Yi Su, Yalin Wang

Medical question answering (QA) is a reasoning-intensive task that remains challenging for large language models (LLMs) due to hallucinations and outdated domain knowledge.

Information Retrieval Question Answering +3

How Effective Can Dropout Be in Multiple Instance Learning ?

1 code implementation21 Apr 2025 Wenhui Zhu, Peijie Qiu, Xiwen Chen, Zhangsihao Yang, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang

Due to the gigapixel resolution of WSI, applications of MIL in WSI typically necessitate a two-stage training scheme: first, extract features from the pre-trained backbone and then perform MIL aggregation.

Multiple Instance Learning

Schrödinger Diffusion Driven Signal Recovery in 3T BOLD fMRI Using Unmatched 7T Observations

no code implementations1 Apr 2025 Yujian Xiong, Xuanzhao Dong, Sebastian Waz, Wenhui Zhu, Negar Mallak, Zhong-Lin Lu, Yalin Wang

Ultra-high-field (7 Tesla) BOLD fMRI offers exceptional detail in both spatial and temporal domains, along with robust signal-to-noise characteristics, making it a powerful modality for studying visual information processing in the brain.

EyeBench: A Call for More Rigorous Evaluation of Retinal Image Enhancement

no code implementations20 Feb 2025 Wenhui Zhu, Xuanzhao Dong, Xin Li, Yujian Xiong, Xiwen Chen, Peijie Qiu, Vamsi Krishna Vasa, Zhangsihao Yang, Yi Su, Oana Dumitrascu, Yalin Wang

To this end, we propose a novel comprehensive benchmark, EyeBench, to provide insights that align enhancement models with clinical needs, offering a foundation for future work to improve the clinical relevance and applicability of generative models for fundus image enhancement.

Denoising Image Enhancement +2

Sequence Complementor: Complementing Transformers For Time Series Forecasting with Learnable Sequences

no code implementations6 Jan 2025 Xiwen Chen, Peijie Qiu, Wenhui Zhu, Huayu Li, Hao Wang, Aristeidis Sotiras, Yalin Wang, Abolfazl Razi

Since its introduction, the transformer has shifted the development trajectory away from traditional models (e. g., RNN, MLP) in time series forecasting, which is attributed to its ability to capture global dependencies within temporal tokens.

Time Series Time Series Forecasting

Multimodal Variational Autoencoder: a Barycentric View

no code implementations29 Dec 2024 Peijie Qiu, Wenhui Zhu, Sayantan Kumar, Xiwen Chen, Xiaotong Sun, Jin Yang, Abolfazl Razi, Yalin Wang, Aristeidis Sotiras

Previous attempts at multimodal VAEs approach this mainly through the lens of experts, aggregating unimodal inference distributions with a product of experts (PoE), a mixture of experts (MoE), or a combination of both.

Mixture-of-Experts Representation Learning

SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC

1 code implementation23 Dec 2024 Yue Deng, Yan Yu, Weiyu Ma, ZiRui Wang, Wenhui Zhu, Jian Zhao, Yin Zhang

SMAC-HARD supports customizable opponent strategies, randomization of adversarial policies, and interfaces for MARL self-play, enabling agents to generalize to varying opponent behaviors and improve model stability.

Benchmarking SMAC+ +1

Deploying Foundation Model Powered Agent Services: A Survey

no code implementations18 Dec 2024 Wenchao Xu, Jinyu Chen, Peirong Zheng, Xiaoquan Yi, Tianyi Tian, Wenhui Zhu, Quan Wan, Haozhao Wang, Yunfeng Fan, Qinliang Su, Xuemin Shen

Foundation model (FM) powered agent services are regarded as a promising solution to develop intelligent and personalized applications for advancing toward Artificial General Intelligence (AGI).

model Model Compression +2

EViT-Unet: U-Net Like Efficient Vision Transformer for Medical Image Segmentation on Mobile and Edge Devices

1 code implementation19 Oct 2024 Xin Li, Wenhui Zhu, Xuanzhao Dong, Oana M. Dumitrascu, Yalin Wang

The rise of Vision Transformer (ViT) has effectively compensated for this deficiency of CNNs and promoted the application of ViT-based U-networks in medical image segmentation.

Decoder Image Segmentation +3

STA-Unet: Rethink the semantic redundant for Medical Imaging Segmentation

1 code implementation13 Oct 2024 Vamsi Krishna Vasa, Wenhui Zhu, Xiwen Chen, Peijie Qiu, Xuanzhao Dong, Yalin Wang

In particular, deep neural networks based on a U-shaped architecture (UNet) with skip connections have been adopted for several medical imaging tasks, including organ segmentation.

Medical Image Analysis Medical Image Segmentation +2

Context-Aware Optimal Transport Learning for Retinal Fundus Image Enhancement

no code implementations12 Sep 2024 Vamsi Krishna Vasa, Peijie Qiu, Wenhui Zhu, Yujian Xiong, Oana Dumitrascu, Yalin Wang

Retinal fundus photography offers a non-invasive way to diagnose and monitor a variety of retinal diseases, but is prone to inherent quality glitches arising from systemic imperfections or operator/patient-related factors.

Image Enhancement

AMG: Avatar Motion Guided Video Generation

1 code implementation2 Sep 2024 Zhangsihao Yang, Mengyi Shan, Mohammad Farazi, Wenhui Zhu, Yanxi Chen, Xuanzhao Dong, Yalin Wang

Human video generation task has gained significant attention with the advancement of deep generative models.

Video Generation

RBAD: A Dataset and Benchmark for Retinal Vessels Branching Angle Detection

1 code implementation17 Jul 2024 Hao Wang, Wenhui Zhu, Jiayou Qin, Xin Li, Oana Dumitrascu, Xiwen Chen, Peijie Qiu, Abolfazl Razi

Detecting retinal image analysis, particularly the geometrical features of branching points, plays an essential role in diagnosing eye diseases.

DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification

1 code implementation4 Jul 2024 Wenhui Zhu, Xiwen Chen, Peijie Qiu, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang

Second, we propose two mechanisms to enforce the diversity among the global vectors to be more descriptive of the entire bag: (i) positive instance alignment and (ii) a novel, efficient, and theoretically guaranteed diversification learning paradigm.

Descriptive Diversity +4

SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation

1 code implementation21 Jun 2024 Wenhui Zhu, Xiwen Chen, Peijie Qiu, Mohammad Farazi, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang

Although numerous follow-up studies have also been dedicated to improving the performance of standard UNet, few have conducted in-depth analyses of the underlying interest pattern of UNet in medical image segmentation.

Decoder Image Segmentation +3

TimeMIL: Advancing Multivariate Time Series Classification via a Time-aware Multiple Instance Learning

3 code implementations6 May 2024 Xiwen Chen, Peijie Qiu, Wenhui Zhu, Huayu Li, Hao Wang, Aristeidis Sotiras, Yalin Wang, Abolfazl Razi

Deep neural networks, including transformers and convolutional neural networks, have significantly improved multivariate time series classification (MTSC).

Multiple Instance Learning Time Series +1

Imaging Signal Recovery Using Neural Network Priors Under Uncertain Forward Model Parameters

no code implementations5 May 2024 Xiwen Chen, Wenhui Zhu, Peijie Qiu, Abolfazl Razi

We theoretically demonstrate the convergence of the MA framework, which has a similar complexity with reconstruction under the known forward model parameters.

Compressive Sensing

Reconstructing Retinal Visual Images from 3T fMRI Data Enhanced by Unsupervised Learning

no code implementations7 Apr 2024 Yujian Xiong, Wenhui Zhu, Zhong-Lin Lu, Yalin Wang

The reconstruction of human visual inputs from brain activity, particularly through functional Magnetic Resonance Imaging (fMRI), holds promising avenues for unraveling the mechanisms of the human visual system.

Generative Adversarial Network

SC-MIL: Sparsely Coded Multiple Instance Learning for Whole Slide Image Classification

1 code implementation31 Oct 2023 Peijie Qiu, Pan Xiao, Wenhui Zhu, Yalin Wang, Aristeidis Sotiras

Typical MIL methods include a feature embedding part, which embeds the instances into features via a pre-trained feature extractor, and an MIL aggregator that combines instance embeddings into predictions.

Dictionary Learning image-classification +2

PDL: Regularizing Multiple Instance Learning with Progressive Dropout Layers

1 code implementation19 Aug 2023 Wenhui Zhu, Peijie Qiu, Xiwen Chen, Oana M. Dumitrascu, Yalin Wang

Multiple instance learning (MIL) was a weakly supervised learning approach that sought to assign binary class labels to collections of instances known as bags.

Multiple Instance Learning Weakly Supervised Classification +3

nnMobileNet: Rethinking CNN for Retinopathy Research

2 code implementations2 Jun 2023 Wenhui Zhu, Peijie Qiu, Xiwen Chen, Xin Li, Natasha Lepore, Oana M. Dumitrascu, Yalin Wang

Over the past few decades, convolutional neural networks (CNNs) have been at the forefront of the detection and tracking of various retinal diseases (RD).

Diabetic Retinopathy Grading

A Surface-Based Federated Chow Test Model for Integrating APOE Status, Tau Deposition Measure, and Hippocampal Surface Morphometry

no code implementations31 Mar 2023 Jianfeng Wu, Yi Su, Yanxi Chen, Wenhui Zhu, Eric M. Reiman, Richard J. Caselli, Kewei Chen, Paul M. Thompson, Junwen Wang, Yalin Wang

Objective: To build a surface-based model to 1) detect differences between APOE subgroups in patterns of tau deposition and hippocampal atrophy, and 2) use the extracted surface-based features to predict cognitive decline.

TetCNN: Convolutional Neural Networks on Tetrahedral Meshes

no code implementations8 Feb 2023 Mohammad Farazi, Zhangsihao Yang, Wenhui Zhu, Peijie Qiu, Yalin Wang

Our results show the superiority of our LBO-based convolution layer and adapted pooling over the conventionally used unitary cortical thickness, graph Laplacian, and point cloud representation.

OTRE: Where Optimal Transport Guided Unpaired Image-to-Image Translation Meets Regularization by Enhancing

3 code implementations6 Feb 2023 Wenhui Zhu, Peijie Qiu, Oana M. Dumitrascu, Jacob M. Sobczak, Mohammad Farazi, Zhangsihao Yang, Keshav Nandakumar, Yalin Wang

Non-mydriatic retinal color fundus photography (CFP) is widely available due to the advantage of not requiring pupillary dilation, however, is prone to poor quality due to operators, systemic imperfections, or patient-related causes.

Denoising Diabetic Retinopathy Grading +5

Improved Prediction of Beta-Amyloid and Tau Burden Using Hippocampal Surface Multivariate Morphometry Statistics and Sparse Coding

no code implementations28 Oct 2022 Jianfeng Wu, Yi Su, Wenhui Zhu, Negar Jalili Mallak, Natasha Lepore, Eric M. Reiman, Richard J. Caselli, Paul M. Thompson, Kewei Chen, Yalin Wang

Experimental results suggest that amyloid/tau measurements predicted with our PASCP-MP representations are closer to the real values than the measures derived from other approaches, such as hippocampal surface area, volume, and shape morphometry features based on spherical harmonics (SPHARM).

Anisotropic Multi-Scale Graph Convolutional Network for Dense Shape Correspondence

no code implementations17 Oct 2022 Mohammad Farazi, Wenhui Zhu, Zhangsihao Yang, Yalin Wang

This paper studies 3D dense shape correspondence, a key shape analysis application in computer vision and graphics.

3D Dense Shape Correspondence

Cannot find the paper you are looking for? You can Submit a new open access paper.