Search Results for author: Wenyu Zhang

Found 42 papers, 15 papers with code

Mapping Urban Villages in China: Progress and Challenges

no code implementations18 Mar 2025 Rui Cao, Wei Tu, Dongsheng Chen, Wenyu Zhang

We also address the challenges and future directions for further research.

Bridging Domain Gaps between Pretrained Multimodal Models and Recommendations

no code implementations21 Feb 2025 Wenyu Zhang, Jie Luo, Xinming Zhang, Yuan Fang

With the explosive growth of multimodal content online, pre-trained visual-language models have shown great potential for multimodal recommendation.

Multimodal Recommendation

Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome

no code implementations25 Jan 2025 Wei Wang, Zhaoxi Chen, Wenyu Zhang, Zetao Wang, Xiang Zhao, Chenyang Li, Jian Guan, Shankai Yin, Gang Li

Objective: The aim of the study is to develop a novel method for improved diagnosis of obstructive sleep apnea-hypopnea syndrome (OSAHS) in clinical or home settings, with the focus on achieving diagnostic performance comparable to the gold-standard polysomnography (PSG) with significantly reduced monitoring burden.

Diagnostic Sleep Staging +1

SS-CTML: Self-Supervised Cross-Task Mutual Learning for CT Image Reconstruction

no code implementations31 Dec 2024 Gaofeng Chen, Yaoduo Zhang, Li Huang, Pengfei Wang, Wenyu Zhang, Dong Zeng, Jianhua Ma, Ji He

Considering that the ultimate goals of the three tasks are all to reconstruct high-quality CT images, we therefore construct a set of cross-task mutual learning objectives for the three tasks, in which way, the three neural networks can be self-supervised optimized by learning from each other.

Computed Tomography (CT) Image Reconstruction

SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models

1 code implementation17 Dec 2024 Wenyu Zhang, Wei En Ng, Lixin Ma, Yuwen Wang, Jungqi Zhao, Boyang Li, Lu Wang

Current vision-language models may incorporate single-dimensional spatial cues, such as depth, object boundary, and basic spatial directions (e. g. left, right, front, back), yet often lack the multi-dimensional spatial reasoning necessary for human-like understanding and real-world applications.

Logical Reasoning Spatial Reasoning

MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models

no code implementations13 Dec 2024 Yingxu He, Zhuohan Liu, Shuo Sun, Bin Wang, Wenyu Zhang, Xunlong Zou, Nancy F. Chen, Ai Ti Aw

We introduce MERaLiON-AudioLLM (Multimodal Empathetic Reasoning and Learning in One Network), the first speech-text model tailored for Singapore's multilingual and multicultural landscape.

speech-recognition Speech Recognition

Importance Sampling With Stochastic Particle Flow and Diffusion Optimization

no code implementations13 Dec 2024 Wenyu Zhang, Mohammad J. Khojasteh, Nikolay A. Atanasov, Florian Meyer

Recently proposed stochastic PFL introduces a diffusion term in the ordinary differential equation (ODE) that describes particle motion.

Numerical Integration

Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter

no code implementations28 Sep 2024 Wei Wang, Chenyang Li, Zhaoxi Chen, Wenyu Zhang, Zetao Wang, Xi Guo, Jian Guan, Gang Li

Obstructive Sleep Apnea-Hypopnea Syndrome (OSAHS) is a sleep-related breathing disorder associated with significant morbidity and mortality worldwide.

Temporal Localization

Deep Learning-based Automated Diagnosis of Obstructive Sleep Apnea and Sleep Stage Classification in Children Using Millimeter-wave Radar and Pulse Oximeter

no code implementations28 Sep 2024 Wei Wang, Ruobing Song, Yunxiao Wu, Li Zheng, Wenyu Zhang, Zhaoxi Chen, Gang Li, Zhifei Xu

Study Objectives: To evaluate the agreement between the millimeter-wave radar-based device and polysomnography (PSG) in diagnosis of obstructive sleep apnea (OSA) and classification of sleep stage in children.

Classification Sleep Staging +1

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders

no code implementations10 Sep 2024 Wenyu Zhang, Shuo Sun, Bin Wang, Xunlong Zou, Zhuohan Liu, Yingxu He, Geyu Lin, Nancy F. Chen, Ai Ti Aw

The rapid advancements in large language models (LLMs) have significantly enhanced natural language processing capabilities, facilitating the development of AudioLLMs that process and understand speech and audio inputs alongside text.

Particle Flows for Source Localization in 3-D Using TDOA Measurements

no code implementations30 Aug 2024 Wenyu Zhang, Mohammad Javad Khojasteh, Florian Meyer

Our numerical results demonstrate that the proposed method can correctly determine the number of sources and provide accurate location estimates.

A Survey and Evaluation of Adversarial Attacks for Object Detection

no code implementations4 Aug 2024 Khoi Nguyen Tiet Nguyen, Wenyu Zhang, Kangkang Lu, YuHuan Wu, Xingjian Zheng, Hui Li Tan, Liangli Zhen

Deep learning models excel in various computer vision tasks but are susceptible to adversarial examples-subtle perturbations in input data that lead to incorrect predictions.

Adversarial Robustness Autonomous Vehicles +5

Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-Training

1 code implementation5 May 2024 Wenyu Zhang, Li Shen, Chuan-Sheng Foo

For adaptation, we propose the Co-learn algorithm to improve target pseudolabel quality collaboratively through the source model and a pre-trained feature extractor.

Language Modelling Representation Learning +2

Universal Semi-Supervised Domain Adaptation by Mitigating Common-Class Bias

1 code implementation CVPR 2024 Wenyu Zhang, Qingmu Liu, Felix Ong Wei Cong, Mohamed Ragab, Chuan-Sheng Foo

UniSSDA is at the intersection of Universal Domain Adaptation (UniDA) and Semi-Supervised Domain Adaptation (SSDA): the UniDA setting does not allow for fine-grained categorization of target private classes not represented in the source domain, while SSDA focuses on the restricted closed-set setting where source and target label spaces match exactly.

Pseudo Label Semi-supervised Domain Adaptation +1

Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution

1 code implementation16 Sep 2023 Wenyu Zhang, Xin Deng, Baojun Jia, Xingtong Yu, Yifan Chen, Jin Ma, Qing Ding, Xinming Zhang

Additionally, we introduce the MLP-based Sequential Residual Block (MSRB) for robust feature extraction from text images, and a Local Contour Awareness loss ($\mathcal{L}_{lca}$) to enhance the model's perception of details.

Graph Attention Image Super-Resolution

Pixel-wise Graph Attention Networks for Person Re-identification

1 code implementation18 Jul 2023 Wenyu Zhang, Qing Ding, Jian Hu, Yi Ma, Mingzhe Lu

Based on these two modules, we consulted the ResNet and design a pixel-wise graph attention network (PGANet).

Graph Attention Graph Generation +1

DeepMA: End-to-end Deep Multiple Access for Wireless Image Transmission in Semantic Communication

no code implementations21 Mar 2023 Wenyu Zhang, Kaiyuan Bai, Sherali Zeadally, Haijun Zhang, Hua Shao, Hui Ma, Victor C. M. Leung

Semantic communication is a new paradigm that exploits deep learning models to enable end-to-end communications processes, and recent studies have shown that it can achieve better noise resiliency compared with traditional communication schemes in a low signal-to-noise (SNR) regime.

Decoder Privacy Preserving +1

Active Planning for Cooperative Localization: A Fisher Information Approach

no code implementations30 Dec 2022 Wenyu Zhang, Bryan Teague, Florian Meyer

A numerical case study demonstrates the intelligent behavior of a single controlled anchor in a 3-D scenario and the resulting significantly improved localization accuracy.

Multisensor Multiobject Tracking with Improved Sampling Efficiency

no code implementations30 Dec 2022 Wenyu Zhang, Florian Meyer

We perform a numerical evaluation in a passive acoustic monitoring scenario where multiple sources are tracked in 3-D from 1-D time-difference-of-arrival (TDOA) measurements provided by pairs of hydrophones.

Object object-detection +3

Rethinking the Role of Pre-Trained Networks in Source-Free Domain Adaptation

1 code implementation ICCV 2023 Wenyu Zhang, Li Shen, Chuan-Sheng Foo

We propose to distil useful target domain information through a co-learning strategy to improve target pseudolabel quality for finetuning the source model.

Representation Learning Source-Free Domain Adaptation +1

Data Fusion for Radio Frequency SLAM with Robust Sampling

no code implementations20 Jun 2022 Erik Leitinger, Bryan Teague, Wenyu Zhang, Mingchao Liang, Florian Meyer

A promising approach to address this problem is to exchange radio signals between mobile agents and static physical anchors (PAs) that bounce off flat surfaces in the indoor environment.

Indoor Localization Simultaneous Localization and Mapping

Domain Generalization via Selective Consistency Regularization for Time Series Classification

no code implementations16 Jun 2022 Wenyu Zhang, Mohamed Ragab, Chuan-Sheng Foo

Domain generalization methods aim to learn models robust to domain shift with data from a limited number of source domains and without access to target domain samples during training.

Classification Domain Generalization +4

Few-Shot Adaptation of Pre-Trained Networks for Domain Shift

1 code implementation30 May 2022 Wenyu Zhang, Li Shen, Wanyue Zhang, Chuan-Sheng Foo

Recent test-time adaptation methods update batch normalization layers of pre-trained source models deployed in new target environments with streaming data to mitigate such performance degradation.

domain classification Semantic Segmentation +1

Selective Cross-Domain Consistency Regularization for Time Series Domain Generalization

no code implementations29 Sep 2021 Wenyu Zhang, Chuan-Sheng Foo, Mohamed Ragab

Domain generalization aims to learn models robust to domain shift, with limited source domains at training and without any access to target domain samples except at test time.

Domain Generalization Representation Learning +3

Source-Free Few-Shot Domain Adaptation

no code implementations29 Sep 2021 Wenyu Zhang, Li Shen, Chuan-Sheng Foo, Wanyue Zhang

Test-time adaptation of pre-trained source models with streaming unlabelled target data is an attractive setting that protects the privacy of source data, but it has mini-batch size and class-distribution requirements on the streaming data which might not be desirable in practice.

domain classification Test-time Adaptation

An Evaluation of Anomaly Detection and Diagnosis in Multivariate Time Series

1 code implementation23 Sep 2021 Astha Garg, Wenyu Zhang, Jules Samaran, Savitha Ramasamy, Chuan-Sheng Foo

Several techniques for multivariate time series anomaly detection have been proposed recently, but a systematic comparison on a common set of datasets and metrics is lacking.

Anomaly Detection Time Series +1

Graph-Based Multiobject Tracking with Embedded Particle Flow

no code implementations16 Mar 2021 Wenyu Zhang, Florian Meyer

Seamless situational awareness provided by modern radar systems relies on effective methods for multiobject tracking (MOT).

Object object-detection +1

Robust Domain-Free Domain Generalization with Class-aware Alignment

no code implementations17 Feb 2021 Wenyu Zhang, Mohamed Ragab, Ramon Sagarna

In this paper, we propose Domain-Free Domain Generalization (DFDG), a model-agnostic method to achieve better generalization performance on the unseen test domain without the need for source domain labels.

Domain Generalization Image Classification +2

POLA: Online Time Series Prediction by Adaptive Learning Rates

no code implementations17 Feb 2021 Wenyu Zhang

We propose POLA (Predicting Online by Learning rate Adaptation) to automatically regulate the learning rate of recurrent neural network models to adapt to changing time series patterns across time.

Prediction Time Series +1

Conformance Checking for a Medical Training Process Using Petri net Simulation and Sequence Alignment

1 code implementation21 Oct 2020 An Nguyen, Wenyu Zhang, Leo Schwinn, Bjoern Eskofier

Process Mining has recently gained popularity in healthcare due to its potential to provide a transparent, objective and data-based view on processes.

HALO: Learning to Prune Neural Networks with Shrinkage

1 code implementation24 Aug 2020 Skyler Seto, Martin T. Wells, Wenyu Zhang

Deep neural networks achieve state-of-the-art performance in a variety of tasks by extracting a rich set of features from unstructured data, however this performance is closely tied to model size.

Network Pruning

Modeling a Nonlinear Biophysical Trend Followed by Long-Memory Equilibrium with Unknown Change Point

1 code implementation18 Jul 2020 Wenyu Zhang, Maryclare Griffin, David S. Matteson

In this paper, we assume that measurements during the trend period are independent deviations from a smooth nonlinear function of time, and that measurements during the equilibrium period are characterized by a simple long memory model.

Applications Quantitative Methods

CAZSL: Zero-Shot Regression for Pushing Models by Generalizing Through Context

no code implementations26 Mar 2020 Wenyu Zhang, Skyler Seto, Devesh K. Jha

The purpose of these agents is to quickly adapt and/or generalize their notion of physics of interaction in the real world based on certain features about the interacting objects that provide different contexts to the predictive models.

Meta-Learning regression +1

Multi-label Prediction in Time Series Data using Deep Neural Networks

no code implementations27 Jan 2020 Wenyu Zhang, Devesh K. Jha, Emil Laftchiev, Daniel Nikovski

In the most general setting of these types of problems, one or more samples of data across multiple time series can be assigned several concurrent fault labels from a finite, known set and the task is to predict the possibility of fault occurrence over a desired time horizon.

Event Detection General Classification +4

ABACUS: Unsupervised Multivariate Change Detection via Bayesian Source Separation

no code implementations15 Oct 2018 Wenyu Zhang, Daniel Gilbert, David Matteson

Change detection involves segmenting sequential data such that observations in the same segment share some desired properties.

Change Detection Dimensionality Reduction

Cannot find the paper you are looking for? You can Submit a new open access paper.