no code implementations • 18 Mar 2025 • Rui Cao, Wei Tu, Dongsheng Chen, Wenyu Zhang
We also address the challenges and future directions for further research.
no code implementations • 21 Feb 2025 • Wenyu Zhang, Jie Luo, Xinming Zhang, Yuan Fang
With the explosive growth of multimodal content online, pre-trained visual-language models have shown great potential for multimodal recommendation.
no code implementations • 25 Jan 2025 • Wei Wang, Zhaoxi Chen, Wenyu Zhang, Zetao Wang, Xiang Zhao, Chenyang Li, Jian Guan, Shankai Yin, Gang Li
Objective: The aim of the study is to develop a novel method for improved diagnosis of obstructive sleep apnea-hypopnea syndrome (OSAHS) in clinical or home settings, with the focus on achieving diagnostic performance comparable to the gold-standard polysomnography (PSG) with significantly reduced monitoring burden.
1 code implementation • 2 Jan 2025 • Bin Wang, Xunlong Zou, Shuo Sun, Wenyu Zhang, Yingxu He, Zhuohan Liu, Chengwei Wei, Nancy F. Chen, AiTi Aw
Singlish, a Creole language rooted in English, is a key focus in linguistic research within multilingual and multicultural contexts.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 31 Dec 2024 • Gaofeng Chen, Yaoduo Zhang, Li Huang, Pengfei Wang, Wenyu Zhang, Dong Zeng, Jianhua Ma, Ji He
Considering that the ultimate goals of the three tasks are all to reconstruct high-quality CT images, we therefore construct a set of cross-task mutual learning objectives for the three tasks, in which way, the three neural networks can be self-supervised optimized by learning from each other.
1 code implementation • 17 Dec 2024 • Wenyu Zhang, Wei En Ng, Lixin Ma, Yuwen Wang, Jungqi Zhao, Boyang Li, Lu Wang
Current vision-language models may incorporate single-dimensional spatial cues, such as depth, object boundary, and basic spatial directions (e. g. left, right, front, back), yet often lack the multi-dimensional spatial reasoning necessary for human-like understanding and real-world applications.
no code implementations • 13 Dec 2024 • Yingxu He, Zhuohan Liu, Shuo Sun, Bin Wang, Wenyu Zhang, Xunlong Zou, Nancy F. Chen, Ai Ti Aw
We introduce MERaLiON-AudioLLM (Multimodal Empathetic Reasoning and Learning in One Network), the first speech-text model tailored for Singapore's multilingual and multicultural landscape.
no code implementations • 13 Dec 2024 • Wenyu Zhang, Mohammad J. Khojasteh, Nikolay A. Atanasov, Florian Meyer
Recently proposed stochastic PFL introduces a diffusion term in the ordinary differential equation (ODE) that describes particle motion.
no code implementations • 28 Sep 2024 • Wei Wang, Chenyang Li, Zhaoxi Chen, Wenyu Zhang, Zetao Wang, Xi Guo, Jian Guan, Gang Li
Obstructive Sleep Apnea-Hypopnea Syndrome (OSAHS) is a sleep-related breathing disorder associated with significant morbidity and mortality worldwide.
no code implementations • 28 Sep 2024 • Wei Wang, Ruobing Song, Yunxiao Wu, Li Zheng, Wenyu Zhang, Zhaoxi Chen, Gang Li, Zhifei Xu
Study Objectives: To evaluate the agreement between the millimeter-wave radar-based device and polysomnography (PSG) in diagnosis of obstructive sleep apnea (OSA) and classification of sleep stage in children.
no code implementations • 10 Sep 2024 • Wenyu Zhang, Shuo Sun, Bin Wang, Xunlong Zou, Zhuohan Liu, Yingxu He, Geyu Lin, Nancy F. Chen, Ai Ti Aw
The rapid advancements in large language models (LLMs) have significantly enhanced natural language processing capabilities, facilitating the development of AudioLLMs that process and understand speech and audio inputs alongside text.
no code implementations • 30 Aug 2024 • Wenyu Zhang, Mohammad Javad Khojasteh, Florian Meyer
Our numerical results demonstrate that the proposed method can correctly determine the number of sources and provide accurate location estimates.
no code implementations • 4 Aug 2024 • Khoi Nguyen Tiet Nguyen, Wenyu Zhang, Kangkang Lu, YuHuan Wu, Xingjian Zheng, Hui Li Tan, Liangli Zhen
Deep learning models excel in various computer vision tasks but are susceptible to adversarial examples-subtle perturbations in input data that lead to incorrect predictions.
no code implementations • 16 Jul 2024 • Guoliang You, Xiaomeng Chu, Yifan Duan, Wenyu Zhang, Xingchen Li, Sha Zhang, Yao Li, Jianmin Ji, Yanyong Zhang
In this work, we endeavor to integrate the perception of these elements into the planning task.
1 code implementation • 23 Jun 2024 • Bin Wang, Xunlong Zou, Geyu Lin, Shuo Sun, Zhuohan Liu, Wenyu Zhang, Zhengyuan Liu, AiTi Aw, Nancy F. Chen
We introduce AudioBench, a universal benchmark designed to evaluate Audio Large Language Models (AudioLLMs).
Ranked #1 on
Audio Scene Understanding
on Clotho-AQA
2 code implementations • 14 Jun 2024 • Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, JENNIFER SANTOSO, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Railey Montalan, Ryan Ignatius, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Ngee Chia Tai, Ayu Purwarianti, Sebastian Ruder, William Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng-Xin Yong, Samuel Cahyawijaya
Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1, 300 indigenous languages and a population of 671 million people.
no code implementations • 4 Jun 2024 • Mohamed Ragab, Peiliang Gong, Emadeldeen Eldele, Wenyu Zhang, Min Wu, Chuan-Sheng Foo, Daoqiang Zhang, XiaoLi Li, Zhenghua Chen
MAPU addresses the critical challenge of temporal consistency by introducing a novel temporal imputation task.
1 code implementation • 5 May 2024 • Wenyu Zhang, Li Shen, Chuan-Sheng Foo
For adaptation, we propose the Co-learn algorithm to improve target pseudolabel quality collaboratively through the source model and a pre-trained feature extractor.
1 code implementation • CVPR 2024 • Wenyu Zhang, Qingmu Liu, Felix Ong Wei Cong, Mohamed Ragab, Chuan-Sheng Foo
UniSSDA is at the intersection of Universal Domain Adaptation (UniDA) and Semi-Supervised Domain Adaptation (SSDA): the UniDA setting does not allow for fine-grained categorization of target private classes not represented in the source domain, while SSDA focuses on the restricted closed-set setting where source and target label spaces match exactly.
1 code implementation • 16 Sep 2023 • Wenyu Zhang, Xin Deng, Baojun Jia, Xingtong Yu, Yifan Chen, Jin Ma, Qing Ding, Xinming Zhang
Additionally, we introduce the MLP-based Sequential Residual Block (MSRB) for robust feature extraction from text images, and a Local Contour Awareness loss ($\mathcal{L}_{lca}$) to enhance the model's perception of details.
1 code implementation • 18 Jul 2023 • Wenyu Zhang, Qing Ding, Jian Hu, Yi Ma, Mingzhe Lu
Based on these two modules, we consulted the ResNet and design a pixel-wise graph attention network (PGANet).
no code implementations • 21 Mar 2023 • Wenyu Zhang, Kaiyuan Bai, Sherali Zeadally, Haijun Zhang, Hua Shao, Hui Ma, Victor C. M. Leung
Semantic communication is a new paradigm that exploits deep learning models to enable end-to-end communications processes, and recent studies have shown that it can achieve better noise resiliency compared with traditional communication schemes in a low signal-to-noise (SNR) regime.
no code implementations • 30 Dec 2022 • Wenyu Zhang, Bryan Teague, Florian Meyer
A numerical case study demonstrates the intelligent behavior of a single controlled anchor in a 3-D scenario and the resulting significantly improved localization accuracy.
no code implementations • 30 Dec 2022 • Wenyu Zhang, Florian Meyer
We perform a numerical evaluation in a passive acoustic monitoring scenario where multiple sources are tracked in 3-D from 1-D time-difference-of-arrival (TDOA) measurements provided by pairs of hydrophones.
1 code implementation • ICCV 2023 • Wenyu Zhang, Li Shen, Chuan-Sheng Foo
We propose to distil useful target domain information through a co-learning strategy to improve target pseudolabel quality for finetuning the source model.
no code implementations • 20 Jun 2022 • Erik Leitinger, Bryan Teague, Wenyu Zhang, Mingchao Liang, Florian Meyer
A promising approach to address this problem is to exchange radio signals between mobile agents and static physical anchors (PAs) that bounce off flat surfaces in the indoor environment.
no code implementations • 16 Jun 2022 • Wenyu Zhang, Mohamed Ragab, Chuan-Sheng Foo
Domain generalization methods aim to learn models robust to domain shift with data from a limited number of source domains and without access to target domain samples during training.
1 code implementation • 30 May 2022 • Wenyu Zhang, Li Shen, Wanyue Zhang, Chuan-Sheng Foo
Recent test-time adaptation methods update batch normalization layers of pre-trained source models deployed in new target environments with streaming data to mitigate such performance degradation.
1 code implementation • CVPR 2023 • Xin Deng, Wenyu Zhang, Qing Ding, Xinming Zhang
In point cloud analysis, point-based methods have rapidly developed in recent years.
Ranked #1 on
3D Semantic Segmentation
on OpenTrench3D
no code implementations • 29 Sep 2021 • Wenyu Zhang, Chuan-Sheng Foo, Mohamed Ragab
Domain generalization aims to learn models robust to domain shift, with limited source domains at training and without any access to target domain samples except at test time.
no code implementations • 29 Sep 2021 • Wenyu Zhang, Li Shen, Chuan-Sheng Foo, Wanyue Zhang
Test-time adaptation of pre-trained source models with streaming unlabelled target data is an attractive setting that protects the privacy of source data, but it has mini-batch size and class-distribution requirements on the streaming data which might not be desirable in practice.
1 code implementation • 23 Sep 2021 • Astha Garg, Wenyu Zhang, Jules Samaran, Savitha Ramasamy, Chuan-Sheng Foo
Several techniques for multivariate time series anomaly detection have been proposed recently, but a systematic comparison on a common set of datasets and metrics is lacking.
no code implementations • 16 Mar 2021 • Wenyu Zhang, Florian Meyer
Seamless situational awareness provided by modern radar systems relies on effective methods for multiobject tracking (MOT).
no code implementations • 17 Feb 2021 • Wenyu Zhang, Mohamed Ragab, Ramon Sagarna
In this paper, we propose Domain-Free Domain Generalization (DFDG), a model-agnostic method to achieve better generalization performance on the unseen test domain without the need for source domain labels.
no code implementations • 17 Feb 2021 • Wenyu Zhang
We propose POLA (Predicting Online by Learning rate Adaptation) to automatically regulate the learning rate of recurrent neural network models to adapt to changing time series patterns across time.
1 code implementation • 21 Oct 2020 • An Nguyen, Wenyu Zhang, Leo Schwinn, Bjoern Eskofier
Process Mining has recently gained popularity in healthcare due to its potential to provide a transparent, objective and data-based view on processes.
1 code implementation • 24 Aug 2020 • Skyler Seto, Martin T. Wells, Wenyu Zhang
Deep neural networks achieve state-of-the-art performance in a variety of tasks by extracting a rich set of features from unstructured data, however this performance is closely tied to model size.
1 code implementation • 18 Jul 2020 • Wenyu Zhang, Maryclare Griffin, David S. Matteson
In this paper, we assume that measurements during the trend period are independent deviations from a smooth nonlinear function of time, and that measurements during the equilibrium period are characterized by a simple long memory model.
Applications Quantitative Methods
no code implementations • 26 Mar 2020 • Wenyu Zhang, Skyler Seto, Devesh K. Jha
The purpose of these agents is to quickly adapt and/or generalize their notion of physics of interaction in the real world based on certain features about the interacting objects that provide different contexts to the predictive models.
no code implementations • 27 Jan 2020 • Wenyu Zhang, Devesh K. Jha, Emil Laftchiev, Daniel Nikovski
In the most general setting of these types of problems, one or more samples of data across multiple time series can be assigned several concurrent fault labels from a finite, known set and the task is to predict the possibility of fault occurrence over a desired time horizon.
no code implementations • 15 Oct 2018 • Wenyu Zhang, Daniel Gilbert, David Matteson
Change detection involves segmenting sequential data such that observations in the same segment share some desired properties.
no code implementations • 21 Dec 2015 • Skyler Seto, Wenyu Zhang, Yichen Zhou
Accurate and computationally efficient means for classifying human activities have been the subject of extensive research efforts.