Search Results for author: Shuai Wang

Found 278 papers, 81 papers with code

Automatic recognition of abdominal lymph nodes from clinical text

1 code implementation EMNLP (ClinicalNLP) 2020 Yifan Peng, SungWon Lee, Daniel C. Elton, Thomas Shen, Yu-Xing Tang, Qingyu Chen, Shuai Wang, Yingying Zhu, Ronald Summers, Zhiyong Lu

We then introduce an end-to-end approach based on the combination of rules and transformer-based methods to detect these abdominal lymph node mentions and classify their types from the MRI radiology reports.

Pre-training vs. Fine-tuning: A Reproducibility Study on Dense Retrieval Knowledge Acquisition

1 code implementation12 May 2025 Zheng Yao, Shuai Wang, Guido Zuccon

Recent research has questioned the role of fine-tuning vs. that of pre-training within dense retrievers, specifically arguing that retrieval knowledge is primarily gained during pre-training, meaning knowledge not acquired during pre-training cannot be sub-sequentially acquired via fine-tuning.

Contrastive Learning Decoder +2

Reassessing Large Language Model Boolean Query Generation for Systematic Reviews

no code implementations12 May 2025 Shuai Wang, Harrisen Scells, Bevan Koopman, Guido Zuccon

Our results show that query effectiveness varies significantly across models and prompt designs, with guided query formulation benefiting from well-chosen seed studies.

Language Modeling Language Modelling +2

BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts

no code implementations24 Apr 2025 Qingyue Wang, Qi Pang, Xixun Lin, Shuai Wang, Daoyuan Wu

Accordingly, our attack, namely BadMoE, exploits the unique architecture of MoE models by 1) identifying dormant experts unrelated to the target task, 2) constructing a routing-aware loss to optimize the activation triggers of these experts, and 3) promoting dormant experts to dominating roles via poisoned training data.

Backdoor Attack Mixture-of-Experts +1

Seeing The Words: Evaluating AI-generated Biblical Art

no code implementations23 Apr 2025 Hidde Makimei, Shuai Wang, Willem van Peursen

This triggers the discussion of whether AI can generate accurate images using text from the Bible with respect to the corresponding biblical contexts and backgrounds.

Green Robotic Mixed Reality with Gaussian Splatting

no code implementations18 Apr 2025 Chenxuan Liu, He Li, Zongze Li, Shuai Wang, Wei Xu, Kejiang Ye, Derrick Wing Kwan Ng, Chengzhong Xu

Realizing green communication in robotic mixed reality (RoboMR) systems presents a challenge, due to the necessity of uploading high-resolution images at high frequencies through wireless channels.

Mixed Reality SSIM

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

1 code implementation16 Apr 2025 Tianhui Song, Weixin Feng, Shuai Wang, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang

The success of text-to-image (T2I) generation models has spurred a proliferation of numerous model checkpoints fine-tuned from the same base model on various specialized datasets.

Image Generation model

GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction

no code implementations14 Apr 2025 Guanyuan Pan, Tiansheng Zhou, Bingtao Ma, Yaqi Wang, Jianxiang Zhao, Shuai Wang

Circuit link prediction identifying missing component connections from incomplete netlists is crucial in automating analog circuit design.

Language Modeling Language Modelling +3

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

no code implementations10 Apr 2025 ByteDance Seed, :, Jiaze Chen, Tiantian Fan, Xin Liu, Lingjun Liu, Zhiqi Lin, Mingxuan Wang, Chengyi Wang, Xiangpeng Wei, Wenyuan Xu, Yufeng Yuan, Yu Yue, Lin Yan, Qiying Yu, Xiaochen Zuo, Chi Zhang, Ruofei Zhu, Zhecheng An, Zhihao Bai, Yu Bao, Xingyan Bin, Jiangjie Chen, Feng Chen, Hongmin Chen, Riwei Chen, Liangqiang Chen, Zixin Chen, Jinsong Chen, Siyan Chen, Kaiyuan Chen, Zhi Chen, Jin Chen, Jiecao Chen, Jinxin Chi, Weinan Dai, Ning Dai, Jiahui Dai, Shihan Dou, Yantao Du, Zhengyin Du, Jianhui Duan, Chen Dun, Ting-Han Fan, Jiazhan Feng, Junda Feng, Ziyuan Feng, Yuwei Fu, Wenqi Fu, Hanjie Fu, Hao Ge, Hongyi Guo, Mingji Han, Li Han, Wenhao Hao, Xintong Hao, Qianyu He, Jerry He, Feng He, Wen Heng, Zehua Hong, Qi Hou, Liang Hu, Shengding Hu, Nan Hu, Kai Hua, Qi Huang, Ziyue Huang, Hongzhi Huang, Zihao Huang, Ting Huang, Wenhao Huang, Wei Jia, Bin Jia, Xiaoying Jia, Yuhua Jiang, Haobin Jiang, Ziheng Jiang, Kaihua Jiang, Chengquan Jiang, Jianpeng Jiao, Xiaoran Jin, Xing Jin, Xunhao Lai, Xiang Li, Liyi Li, Hongkai Li, Zheng Li, Shengxian Wan, Ya Wang, Yunshui Li, Chenggang Li, Niuniu Li, Siyu Li, Xi Li, Xiao Li, Aoyan Li, Yuntao Li, Nianning Liang, Xinnian Liang, Haibin Lin, Weijian Lin, Ye Lin, Zhicheng Liu, Guanlin Liu, Chenxiao Liu, Yan Liu, Gaohong Liu, Juncai Liu, Chundian Liu, Deyi Liu, Kaibo Liu, Siyao Liu, Qi Liu, Yongfei Liu, Kang Liu, Gan Liu, Boyi Liu, Rui Long, Weiqiang Lou, Chenwei Lou, Xiang Luo, Yao Luo, Caiping Lv, Heyang Lv, Bole Ma, Qianli Ma, Hongzhi Ma, Yiyuan Ma, Jin Ma, Wenchang Ma, Tingting Ma, Chen Mao, Qiyang Min, Zhe Nan, Guanghan Ning, Jinxiang Ou, Haojie Pan, Renming Pang, Yanghua Peng, Tao Peng, Lihua Qian, Mu Qiao, Meng Qu, Cheng Ren, Hongbin Ren, Yong Shan, Wei Shen, Ke Shen, Kai Shen, Guangming Sheng, Jinlong Shi, Wenlei Shi, Guang Shi, Shuai Shuai Cao, Yuxin Song, Zuquan Song, Jing Su, Yifan Sun, Tao Sun, Zewei Sun, Borui Wan, Xiaohui Wang, Xi Wang, Shuguang Wang, Jun Wang, Qinlong Wang, Chenyuan Wang, Shuai Wang, Zihan Wang, Changbao Wang, Jiaqiang Wang, Shihang Wang, Xuwu Wang, Zaiyuan Wang, Yuxuan Wang, Wenqi Wang, Taiqing Wang, Chengzhi Wei, Houmin Wei, Ziyun Wei, Shufa Wei, Zheng Wu, Yonghui Wu, Yangjun Wu, Bohong Wu, Shuang Wu, Jingqiao Wu, Ning Wu, Shuangzhi Wu, Jianmin Wu, Chenguang Xi, Fan Xia, Yuqiao Xian, Liang Xiang, Boren Xiang, Bowen Xiao, Zhen Xiao, Xia Xiao, Yongsheng Xiao, Chao Xin, Shulin Xin, Yuwen Xiong, Jingjing Xu, Ziwen Xu, Chenyin Xu, Jiayi Xu, Yifan Xu, Wei Xu, Yufei Xu, Shikun Xu, Shipeng Yan, Shen Yan, Qingping Yang, Xi Yang, Tianhao Yang, Yuehang Yang, Yuan Yang, Ximing Yang, Zeyu Yang, Guang Yang, Yifan Yang, Xuesong Yao, Bairen Yi, Fan Yin, Jianian Yin, Ziqiang Ying, Xiangyu Yu, Hongli Yu, Song Yu, Menghan Yu, Huan Yu, Siyu Yuan, Jun Yuan, Yutao Zeng, Tianyang Zhan, Zheng Zhang, Yun Zhang, Mofan Zhang, Wang Zhang, Ru Zhang, Zhi Zhang, Tianqi Zhang, Xinyi Zhang, Zhexi Zhang, Sijun Zhang, Wenqiang Zhang, Xiangxiang Zhang, Yongtao Zhang, Yuyu Zhang, Ge Zhang, He Zhang, Yue Zhang, Renjie Zheng, Ningxin Zheng, Zhuolin Zheng, Yaowei Zheng, Chen Zheng, Xiaoyun Zhi, Wanjun Zhong, Cheng Zhong, Zheng Zhong, Baoquan Zhong, Xun Zhou, Na Zhou, Huan Zhou, Hang Zhu, Defa Zhu, Wenjia Zhu, Lei Zuo

We introduce Seed1. 5-Thinking, capable of reasoning through thinking before responding, resulting in improved performance on a wide range of benchmarks.

Mixture-of-Experts reinforcement-learning +1

DDT: Decoupled Diffusion Transformer

1 code implementation8 Apr 2025 Shuai Wang, Zhi Tian, Weilin Huang, LiMin Wang

For ImageNet $256\times256$, Our DDT-XL/2 achieves a new state-of-the-art performance of {1. 31 FID}~(nearly $4\times$ faster training convergence compared to previous diffusion transformers).

Denoising Image Generation

Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation

no code implementations3 Apr 2025 Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li

The pretrained frontend employs a transformer decoder network with a causal convolutional encoder as the backbone and is pretrained in a self-supervised manner with two innovative pretext tasks: autoregressive hybrid prediction and contextual knowledge distillation.

Decoder Knowledge Distillation +1

$C^2$AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction

no code implementations1 Apr 2025 Wenxuan Wu, Xueyuan Chen, Shuai Wang, Jiadong Wang, Lingwei Meng, Xixin Wu, Helen Meng, Haizhou Li

Audio-Visual Target Speaker Extraction (AV-TSE) aims to mimic the human ability to enhance auditory perception using visual cues.

Target Speaker Extraction

Advancing THz Radio Map Construction and Obstacle Sensing: An Integrated Generative Framework in ISAC

no code implementations29 Mar 2025 Tianyu Hu, Shuai Wang, Yunhang Xie, Lingxiang Li, Zhi Chen, Boyu Ning, Wassim Hamidouche, Lina Bariah, Samson Lasaulce, Merouane Debbah

Integrated sensing and communication (ISAC) in the terahertz (THz) band enables obstacle detection, which in turn facilitates efficient beam management to mitigate THz signal blockage.

Generative Adversarial Network Integrated sensing and communication +1

GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics

no code implementations27 Mar 2025 Arsham Gholamzadeh Khoee, Shuai Wang, Yinan Yu, Robert Feldt, Dhasarathy Parthasarathy

Ensuring the reliability and effectiveness of software release decisions is critical, particularly in safety-critical domains like automotive systems.

Benchmarking Natural Language Queries

STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models

no code implementations23 Mar 2025 Xunguang Wang, Wenxuan Wang, Zhenlan Ji, Zongjie Li, Pingchuan Ma, Daoyuan Wu, Shuai Wang

Large Language Models (LLMs) have become increasingly vulnerable to jailbreak attacks that circumvent their safety mechanisms.

CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning

1 code implementation5 Mar 2025 Yuqi Zhou, Shuai Wang, Sunhao Dai, Qinglin Jia, Zhaocheng Du, Zhenhua Dong, Jun Xu

The subtask level, linking high-level goals with low-level executable actions, is crucial for task completion but faces two challenges: ineffective subtasks that lower-level agent cannot execute and inefficient subtasks that fail to contribute to the completion of the higher-level task.

OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale

1 code implementation4 Mar 2025 Haoyang Li, Shang Wu, Xiaokang Zhang, Xinmei Huang, Jing Zhang, Fuxin Jiang, Shuai Wang, Tieying Zhang, Jianjun Chen, Rui Shi, Hong Chen, Cuiping Li

Text-to-SQL, the task of translating natural language questions into SQL queries, plays a crucial role in enabling non-experts to interact with databases.

Text-To-SQL

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

no code implementations3 Mar 2025 Ziqian Ning, Huakang Chen, Yuepeng Jiang, Chunbo Hao, Guobin Ma, Shuai Wang, Jixun Yao, Lei Xie

To address these challenges, we propose DiffRhythm, the first latent diffusion-based song generation model capable of synthesizing complete songs with both vocal and accompaniment for durations of up to 4m45s in only ten seconds, maintaining high musicality and intelligibility.

Music Generation

Memory-Free and Parallel Computation for Quantized Spiking Neural Networks

no code implementations25 Feb 2025 Dehao Zhang, Shuai Wang, Yichen Xiao, Wenjie Wei, Yimeng Shan, Malu Zhang, Yang Yang

In this study, we first identify a new underlying cause for this decline: the loss of historical information due to the quantized membrane potential.

Computational Efficiency Quantization

GuidedBench: Equipping Jailbreak Evaluation with Guidelines

no code implementations24 Feb 2025 Ruixuan Huang, Xunguang Wang, Zongjie Li, Daoyuan Wu, Shuai Wang

Some jailbreak methods that claim to achieve over 90% attack success rate (ASR) on other benchmarks only reach a maximum of 30. 2% on our benchmark, providing a higher ceiling for more advanced jailbreak research; furthermore, using our scoring system reduces the variance of disagreements between different evaluator LLMs by up to 76. 33%.

Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism

1 code implementation20 Feb 2025 Yu Liang, Wenjie Wei, Ammar Belatreche, Honglin Cao, Zijian Zhou, Shuai Wang, Malu Zhang, Yang Yang

Binary Spiking Neural Networks (BSNNs) inherit the eventdriven paradigm of SNNs, while also adopting the reduced storage burden of binarization techniques.

Binarization

Exploring Large Language Models in Healthcare: Insights into Corpora Sources, Customization Strategies, and Evaluation Metrics

no code implementations17 Feb 2025 Shuqi Yang, Mingrui Jing, Shuai Wang, Jiaxin Kou, Manfei Shi, Weijie Xing, Yan Hu, Zheng Zhu

This study reviewed the use of Large Language Models (LLMs) in healthcare, focusing on their training corpora, customization techniques, and evaluation metrics.

Fairness Prompt Engineering

Can't See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs

no code implementations16 Feb 2025 Wenxuan Wang, Xiaoyuan Liu, Kuiyi Gao, Jen-tse Huang, Youliang Yuan, Pinjia He, Shuai Wang, Zhaopeng Tu

Multimodal Large Language Models (MLLMs) have expanded the capabilities of traditional language models by enabling interaction through both text and images.

Benchmarking

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing

no code implementations14 Feb 2025 Kuan Li, Liwen Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Shuai Wang, Minhao Cheng

However, the advancements in context window size for LLMs offer an alternative approach, raising the question of whether RAG remains necessary for effectively handling external knowledge.

Benchmarking RAG

Estimating Probabilities of Causation with Machine Learning Models

no code implementations13 Feb 2025 Shuai Wang, Ang Li

To estimate these probabilities for subpopulations with insufficient data, we propose using machine learning models that draw insights from subpopulations with sufficient data.

Towards THz-based Obstacle Sensing: A Generative Radio Environment Awareness Framework

no code implementations11 Feb 2025 Tianyu Hu, Yunhang Xie, Shuai Wang, Boyu Ning, Lingxiang Li, Zhi Chen

To solve such a problem, we propose a THz-based generative radio environment awareness framework, in which obstacle information is obtained directly from the aware radio environment.

Generative Adversarial Network

Automating a Complete Software Test Process Using LLMs: An Automotive Case Study

no code implementations6 Feb 2025 Shuai Wang, Yinan Yu, Robert Feldt, Dhasarathy Parthasarathy

Vehicle API testing verifies whether the interactions between a vehicle's internal systems and external applications meet expectations, ensuring that users can access and control various vehicle functions and data.

valid

Label Anything: An Interpretable, High-Fidelity and Prompt-Free Annotator

no code implementations5 Feb 2025 Wei-Bin Kou, Guangxu Zhu, Rongguang Ye, Shuai Wang, Ming Tang, Yik-Chung Wu

To mitigate this cost of manual labeling, we propose a Label Anything Model (denoted as LAM), serving as an interpretable, high-fidelity, and prompt-free data annotator.

Autonomous Driving

High-Precision Fabric Defect Detection via Adaptive Shape Convolutions and Large Kernel Spatial Modeling

no code implementations24 Jan 2025 Shuai Wang, Yang Xu, Hui Zheng, Baotian Li

Detecting fabric defects in the textile industry remains a challenging task due to the diverse and complex nature of defect patterns.

Defect Detection

ACEBench: Who Wins the Match Point in Tool Learning?

no code implementations22 Jan 2025 Chen Chen, Xinlong Hao, Weiwen Liu, Xu Huang, Xingshan Zeng, Shuai Yu, Dexun Li, Shuai Wang, Weinan Gan, Yuefeng Huang, Wulong Liu, Xinzhi Wang, Defu Lian, Baoqun Yin, Yasheng Wang, Wu Liu

Normal evaluates function calls in basic scenarios; Special evaluates function calls in scenarios with vague or incomplete instructions; Agent introduces multi-agent interactions to simulate function calling evaluation in real-world multi-turn interactions.

Decision Making

Communication-Efficient Federated Learning by Quantized Variance Reduction for Heterogeneous Wireless Edge Networks

no code implementations20 Jan 2025 Shuai Wang, Yanqing Xu, Chaoqun You, Mingjie Shao, Tony Q. S. Quek

In this paper, we propose a novel communication-efficient FL algorithm, named FedQVR, which relies on a sophisticated variance-reduced scheme to achieve heterogeneity-robustness in the presence of quantized transmission and heterogeneous local updates among active edge devices.

Federated Learning Quantization

How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs

no code implementations18 Jan 2025 Jialun Cao, Yuk-Kit Chan, Zixuan Ling, Wenxuan Wang, Shuqing Li, Mingwei Liu, Ruixi Qiao, Yuting Han, Chaozheng Wang, Boxi Yu, Pinjia He, Shuai Wang, Zibin Zheng, Michael R. Lyu, Shing-Chi Cheung

We propose How2Bench, which is comprised of a 55-criteria checklist as a set of guidelines to govern the development of code-related benchmarks comprehensively.

Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs

no code implementations14 Jan 2025 Shuai Wang, Liang Ding, Yibing Zhan, Yong Luo, Zheng He, Dapeng Tao

Automated code generation using large language models (LLMs) has gained attention due to its efficiency and adaptability.

Code Generation HumanEval

ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification

1 code implementation10 Jan 2025 Yi Ma, Shuai Wang, Tianchi Liu, Haizhou Li

Furthermore, we investigate phonetic traits from within-speaker and between-speaker variation perspectives to determine which trait is most effective for speaker verification, marking an important step towards explainable speaker verification.

Speaker Verification

SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor

no code implementations18 Dec 2024 Chenyu Yang, Shuai Wang, Hangting Chen, Jianwei Yu, Wei Tan, Rongzhi Gu, Yaoxun Xu, Yizhi Zhou, Haina Zhu, Haizhou Li

The emergence of novel generative modeling paradigms, particularly audio language models, has significantly advanced the field of song generation.

Language Modeling Language Modelling

Surrealistic-like Image Generation with Vision-Language Models

1 code implementation18 Dec 2024 Elif Ayten, Shuai Wang, Hjalmar Snoep

Recent advances in generative AI make it convenient to create different types of content, including text, images, and code.

Image Generation

RecSys Arena: Pair-wise Recommender System Evaluation with Large Language Models

1 code implementation15 Dec 2024 Zhuo Wu, Qinglin Jia, Chuhan Wu, Zhaocheng Du, Shuai Wang, Zan Wang, Zhenhua Dong

More specifically, for each sample we use LLM to generate a user profile description based on user behavior history or off-the-shelf profile features, which is used to guide LLM to play the role of this user and evaluate the relative preference for two recommendation results generated by different models.

Chatbot Recommendation Systems

MoMuSE: Momentum Multi-modal Target Speaker Extraction for Real-time Scenarios with Impaired Visual Cues

no code implementations11 Dec 2024 Junjie Li, Ke Zhang, Shuai Wang, Kong Aik Lee, Man-Wai Mak, Haizhou Li

Audio-visual Target Speaker Extraction (AV-TSE) aims to isolate the speech of a specific target speaker from an audio mixture using time-synchronized visual cues.

Target Speaker Extraction

Fab-ME: A Vision State-Space and Attention-Enhanced Framework for Fabric Defect Detection

no code implementations4 Dec 2024 Shuai Wang, Huiyan Kong, Baotian Li, Fa Zheng

Effective defect detection is critical for ensuring the quality, functionality, and economic value of textile products.

Defect Detection

Adaptive Interactive Segmentation for Multimodal Medical Imaging via Selection Engine

1 code implementation29 Nov 2024 Zhi Li, Kai Zhao, Yaqi Wang, Shuai Wang

To mitigate memory bottlenecks and optimize prompt frame selection during the inference of 2D image sequences, we developed an automated system, the Adaptive Frame Selection Engine (AFSE).

Interactive Segmentation Medical Image Analysis +1

2D Matryoshka Training for Information Retrieval

1 code implementation26 Nov 2024 Shuai Wang, Shengyao Zhuang, Bevan Koopman, Guido Zuccon

In this reproducibility study, we implement and evaluate both versions of 2D Matryoshka Training on STS tasks and extend our analysis to retrieval tasks.

Information Retrieval Retrieval +2

mmSpyVR: Exploiting mmWave Radar for Penetrating Obstacles to Uncover Privacy Vulnerability of Virtual Reality

1 code implementation15 Nov 2024 Luoyu MEI, Ruofeng Liu, Zhimeng Yin, Qingchuan Zhao, Wenchao Jiang, Shuai Wang, Kangjie Lu, Tian He

The mmSpyVR demonstrates the capability to extract critical VR privacy from the mmWave signals that have penetrated through obstacles.

Transfer Learning

GUI Agents with Foundation Models: A Comprehensive Survey

no code implementations7 Nov 2024 Shuai Wang, Weiwen Liu, Jingxuan Chen, Yuqi Zhou, Weinan Gan, Xingshan Zeng, Yuhan Che, Shuai Yu, Xinlong Hao, Kun Shao, Bin Wang, Chuhan Wu, Yasheng Wang, Ruiming Tang, Jianye Hao

Recent advances in foundation models, particularly Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs), have facilitated the development of intelligent agents capable of performing complex tasks.

Survey

Speech Separation with Pretrained Frontend to Minimize Domain Mismatch

1 code implementation5 Nov 2024 Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li

As a result, there exists a domain gap between real and synthetic data when deploying speech separation models in real-world applications.

Speech Separation

Beyond Utility: Evaluating LLM as Recommender

1 code implementation1 Nov 2024 Chumeng Jiang, Jiayin Wang, Weizhi Ma, Charles L. A. Clarke, Shuai Wang, Chuhan Wu, Min Zhang

We intend our evaluation framework and observations to benefit future research on the use of LLMs as recommenders.

Position Re-Ranking

The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings

no code implementations31 Oct 2024 Kangxiang Xia, Dake Guo, Jixun Yao, Liumeng Xue, Hanzhao Li, Shuai Wang, Zhao Guo, Lei Xie, Qingqing Zhang, Lei Luo, Minghui Dong, Peng Sun

The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge aims to benchmark and advance zero-shot spontaneous style voice cloning, particularly focusing on generating spontaneous behaviors in conversational speech.

Voice Cloning

FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution

no code implementations30 Oct 2024 Shuai Wang, Zexian Li, Tianhui Song, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang

Arbitrary-resolution image generation still remains a challenging task in AIGC, as it requires handling varying resolutions and aspect ratios while maintaining high visual quality.

Image Generation

Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification

no code implementations22 Oct 2024 Wen Huang, Bing Han, Zhengyang Chen, Shuai Wang, Yanmin Qian

In this paper, we propose Prototype and Instance Contrastive Learning (PICL), a novel method for unsupervised domain adaptation in speaker verification through dual-level contrastive learning.

Contrastive Learning Speaker Verification +1

Multi-Level Speaker Representation for Target Speaker Extraction

1 code implementation21 Oct 2024 Ke Zhang, Junjie Li, Shuai Wang, Yangjie Wei, Yi Wang, Yannan Wang, Haizhou Li

In this work, we propose a multi-level speaker representation approach, from raw features to neural embeddings, to serve as the speaker reference cue.

Target Speaker Extraction

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation

1 code implementation19 Oct 2024 Jingxuan Chen, Derek Yuen, Bin Xie, Yuhao Yang, Gongwei Chen, Zhihao Wu, Li Yixing, Xurui Zhou, Weiwen Liu, Shuai Wang, Kaiwen Zhou, Rui Shao, Liqiang Nie, Yasheng Wang, Jianye Hao, Jun Wang, Kun Shao

SPA-Bench offers three key contributions: (1) A diverse set of tasks covering system and third-party apps in both English and Chinese, focusing on features commonly used in daily routines; (2) A plug-and-play framework enabling real-time agent interaction with Android devices, integrating over ten agents with the flexibility to add more; (3) A novel evaluation pipeline that automatically assesses agent performance across multiple dimensions, encompassing seven metrics related to task completion and resource consumption.

AI Agent Benchmarking +2

Starbucks: Improved Training for 2D Matryoshka Embeddings

1 code implementation17 Oct 2024 Shengyao Zhuang, Shuai Wang, Bevan Koopman, Guido Zuccon

Effective approaches that can scale embedding model depth (i. e. layers) and embedding size allow for the creation of models that are highly scalable across different computational resources and task requirements.

Language Modelling text similarity

StepTool: A Step-grained Reinforcement Learning Framework for Tool Learning in LLMs

1 code implementation10 Oct 2024 Yuanqing Yu, Zhefan Wang, Weizhi Ma, Zhicheng Guo, Jingtao Zhan, Shuai Wang, Chuhan Wu, Zhiqiang Guo, Min Zhang

Despite having powerful reasoning and inference capabilities, Large Language Models (LLMs) still need external tools to acquire real-time information retrieval or domain-specific expertise to solve complex tasks, which is referred to as tool learning.

Information Retrieval Policy Gradient Methods

Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step

no code implementations4 Oct 2024 Wenxuan Wang, Kuiyi Gao, Zihan Jia, Youliang Yuan, Jen-tse Huang, Qiuzhi Liu, Shuai Wang, Wenxiang Jiao, Zhaopeng Tu

To assess the safety of existing models, we introduce a novel jailbreaking method called Chain-of-Jailbreak (CoJ) attack, which compromises image generation models through a step-by-step editing process.

Image Generation

Metasurface-generated large and arbitrary analog convolution kernels for accelerated machine vision

no code implementations27 Sep 2024 Ruiqi Liang, Shuai Wang, Yiying Dong, Liu Li, Ying Kuang, Bohan Zhang, Yuanmu Yang

Recently, to address the challenges in processing speed and power consumption of conventional digital convolution operations, many optical components have been suggested to replace the digital convolution layer in the neural network, accelerating various machine vision tasks.

Medical Diagnosis

M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions

no code implementations24 Sep 2024 Shuai Wang, Pengcheng Zhu, Haizhou Li

Fixed-dimensional speaker embeddings have become the dominant approach in speaker modeling, typically spanning hundreds to thousands of dimensions.

Speaker Verification

WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction

1 code implementation24 Sep 2024 Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li

Target speaker extraction (TSE) focuses on isolating the speech of a specific target speaker from overlapped multi-talker speech, which is a typical setup in the cocktail party problem.

Management speech-recognition +1

E1 TTS: Simple and Fast Non-Autoregressive TTS

no code implementations14 Sep 2024 Zhijun Liu, Shuai Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li

This paper introduces Easy One-Step Text-to-Speech (E1 TTS), an efficient non-autoregressive zero-shot text-to-speech system based on denoising diffusion pretraining and distribution matching distillation.

Denoising text-to-speech +1

DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning

no code implementations12 Sep 2024 Kangyang Luo, Shuai Wang, Yexuan Fu, Renrong Shao, Xiang Li, Yunshi Lan, Ming Gao, Jinlong Shu

In dual-model distillation, the trained dual generators work together to provide the training data for updates of the global model.

Federated Learning Image Classification

Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator

no code implementations11 Sep 2024 Kangyang Luo, Shuai Wang, Xiang Li, Yunshi Lan, Ming Gao, Jinlong Shu

Federated Learning (FL) is gaining popularity as a distributed learning framework that only shares model parameters or gradient updates and keeps private data locally.

Diversity Federated Learning +3

Joint Input and Output Coordination for Class-Incremental Learning

no code implementations9 Sep 2024 Shuai Wang, Yibing Zhan, Yong Luo, Han Hu, Wei Yu, Yonggang Wen, DaCheng Tao

This mechanism assigns different weights to different categories of data according to the gradient of the output score, and uses knowledge distillation (KD) to reduce the mutual interference between the outputs of old and new tasks.

class-incremental learning Class Incremental Learning +2

$\mathbb{USCD}$: Improving Code Generation of LLMs by Uncertainty-Aware Selective Contrastive Decoding

no code implementations9 Sep 2024 Shuai Wang, Liang Ding, Li Shen, Yong Luo, Zheng He, Wei Yu, DaCheng Tao

Then, we selectively eliminate output noise induced by lame prompts based on the uncertainty of the prediction distribution from the standard prompt.

Code Generation HumanEval +1

vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders

no code implementations3 Sep 2024 Yiwei Guo, Zhihan Li, Junjie Li, Chenpeng Du, Hankun Wang, Shuai Wang, Xie Chen, Kai Yu

To amend the loss of speaker timbre in the content tokens, vec2wav 2. 0 utilizes the WavLM features to provide strong timbre-dependent information.

Speech Synthesis Voice Conversion

ESP-PCT: Enhanced VR Semantic Performance through Efficient Compression of Temporal and Spatial Redundancies in Point Cloud Transformers

1 code implementation2 Sep 2024 Luoyu MEI, Yun Cheng, Ruofeng Liu, Zhimeng Yin, Wenchao Jiang, Shuai Wang, Wei Gong

Notably, ESP-PCT achieves a remarkable accuracy of 93. 2% while reducing the computational requirements (FLOPs) by 76. 9% and memory usage by 78. 2% compared to the existing Point Transformer model simultaneously.

ToolACE: Winning the Points of LLM Function Calling

no code implementations2 Sep 2024 Weiwen Liu, Xu Huang, Xingshan Zeng, Xinlong Hao, Shuai Yu, Dexun Li, Shuai Wang, Weinan Gan, Zhengying Liu, Yuanqing Yu, Zezhong Wang, Yuxian Wang, Wu Ning, Yutai Hou, Bin Wang, Chuhan Wu, Xinzhi Wang, Yong liu, Yasheng Wang, Duyu Tang, Dandan Tu, Lifeng Shang, Xin Jiang, Ruiming Tang, Defu Lian, Qun Liu, Enhong Chen

Function calling significantly extends the application boundary of large language models, where high-quality and diverse training data is critical for unlocking this capability.

Tackling Data Heterogeneity in Federated Learning via Loss Decomposition

1 code implementation22 Aug 2024 Shuang Zeng, Pengxin Guo, Shuai Wang, Jianbo Wang, Yuyin Zhou, Liangqiong Qu

To mitigate the impact of data heterogeneity on FL performance, we start with analyzing how FL training influence FL performance by decomposing the global loss into three terms: local loss, distribution shift loss and aggregation loss.

Federated Learning Privacy Preserving +1

API-guided Dataset Synthesis to Finetune Large Code Models

no code implementations15 Aug 2024 Zongjie Li, Daoyuan Wu, Shuai Wang, Zhendong Su

Inspired by APIs as high-level abstractions of code that encapsulate rich semantic information in a concise structure, we propose DataScope, an API-guided dataset synthesis framework designed to enhance the SFT process for LCMs in both general and domain-specific scenarios.

Multiple Contexts and Frequencies Aggregation Network forDeepfake Detection

no code implementations3 Aug 2024 Zifeng Li, Wenzhong Tang, Shijun Gao, Shuai Wang, Yanxiang Wang

Deepfake detection faces increasing challenges since the fast growth of generative models in developing massive and diverse Deepfake technologies.

DeepFake Detection Face Swapping

Context Embeddings for Efficient Answer Generation in RAG

no code implementations12 Jul 2024 David Rau, Shuai Wang, Hervé Déjean, Stéphane Clinchant

We address this challenge by presenting COCOM, an effective context compression method, reducing long contexts to only a handful of Context Embeddings speeding up the generation time by a large margin.

Answer Generation RAG +1

Ternary Spike-based Neuromorphic Signal Processing System

no code implementations7 Jul 2024 Shuai Wang, Dehao Zhang, Ammar Belatreche, Yichen Xiao, Hongyu Qing, Wenjie We, Malu Zhang, Yang Yang

QT-SNN, compatible with ternary spike trains from the TAE method, quantifies both membrane potentials and synaptic weights to reduce memory requirements while maintaining performance.

Quantization

On the Effectiveness of Acoustic BPE in Decoder-Only TTS

no code implementations4 Jul 2024 Bohan Li, Feiyu Shen, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu

Discretizing speech into tokens and generating them by a decoder-only model have been a promising direction for text-to-speech (TTS) and spoken language modeling (SLM).

Decoder Diversity +4

BERGEN: A Benchmarking Library for Retrieval-Augmented Generation

1 code implementation1 Jul 2024 David Rau, Hervé Déjean, Nadezhda Chirkova, Thibault Formal, Shuai Wang, Vassilina Nikoulina, Stéphane Clinchant

In response to the recent popularity of generative LLMs, many RAG approaches have been proposed, which involve an intricate number of different configurations such as evaluation datasets, collections, metrics, retrievers, and LLMs.

Benchmarking RAG +1

Symbolic Learning Enables Self-Evolving Agents

1 code implementation26 Jun 2024 Wangchunshu Zhou, Yixin Ou, Shengwei Ding, Long Li, Jialong Wu, Tiannan Wang, Jiamin Chen, Shuai Wang, Xiaohua Xu, Ningyu Zhang, Huajun Chen, Yuchen Eleanor Jiang

In this work, we introduce agent symbolic learning, a systematic framework that enables language agents to optimize themselves on their own in a data-centric way using symbolic optimizers.

SurgeMOD: Translating image-space tissue motions into vision-based surgical forces

1 code implementation25 Jun 2024 Mikel De Iturrate Reyzabal, Dionysios Malas, Shuai Wang, Sebastien Ourselin, Hongbin Liu

Using internal movements generated by natural processes like breathing or the cardiac cycle, we infer the image-space basis of the motion on the frequency domain.

An Investigation of Prompt Variations for Zero-shot LLM-based Rankers

1 code implementation20 Jun 2024 Shuoqi Sun, Shengyao Zhuang, Shuai Wang, Guido Zuccon

We provide a systematic understanding of the impact of specific components and wordings used in prompts on the effectiveness of rankers based on zero-shot Large Language Models (LLMs).

Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting

no code implementations19 Jun 2024 Shuai Wang, Dehao Zhang, Kexin Shi, Yuchen Wang, Wenjie Wei, Jibin Wu, Malu Zhang

Here, we take advantage of spiking neural networks' energy efficiency and propose an end-to-end lightweight KWS model.

Keyword Spotting

DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion

no code implementations12 Jun 2024 Ziqian Ning, Shuai Wang, Pengcheng Zhu, Zhichao Wang, Jixun Yao, Lei Xie, Mengxiao Bi

With speaker-independent semantic tokens to guide the training of the content encoder, the dependency on ASR is removed and the model can operate under extremely small chunks, with cascading errors eliminated.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark

1 code implementation9 Jun 2024 Linhan Ma, Dake Guo, Kun Song, Yuepeng Jiang, Shuai Wang, Liumeng Xue, Weiming Xu, Huan Zhao, BinBin Zhang, Lei Xie

Furthermore, we have created subsets of varying sizes, categorized by segment quality scores to allow for TTS model training and fine-tuning.

text-to-speech Text to Speech

Autoregressive Diffusion Transformer for Text-to-Speech Synthesis

no code implementations8 Jun 2024 Zhijun Liu, Shuai Wang, Sho Inoue, Qibing Bai, Haizhou Li

Our experiments reveal that employing Integral Kullback-Leibler (IKL) divergence for distillation at each autoregressive step significantly boosts the perceived quality of the samples.

Audio Generation Decoder +4

SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner

no code implementations8 Jun 2024 Xunguang Wang, Daoyuan Wu, Zhenlan Ji, Zongjie Li, Pingchuan Ma, Shuai Wang, Yingjiu Li, Yang Liu, Ning Liu, Juergen Rahmel

Jailbreaking is an emerging adversarial attack that bypasses the safety alignment deployed in off-the-shelf large language models (LLMs) and has evolved into multiple categories: human-based, optimization-based, generation-based, and the recent indirect and multilingual jailbreaks.

Adversarial Attack LLM Jailbreak +1

Ada-HGNN: Adaptive Sampling for Scalable Hypergraph Neural Networks

no code implementations22 May 2024 Shuai Wang, David W. Zhang, Jia-Hong Huang, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

Hypergraphs serve as an effective model for depicting complex connections in various real-world scenarios, from social to biological networks.

MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer

1 code implementation15 May 2024 Chengyu Wu, Chengkai Wang, Yaqi Wang, Huiyu Zhou, Yatao Zhang, Qifeng Wang, Shuai Wang

In addition, efficient and effective interactions between multi-modal representations need to be further explored, lacking insightful exploration of prognostic correlation in multi-modality features.

Representation Learning

Mamba-FETrack: Frame-Event Tracking via State Space Model

2 code implementations28 Apr 2024 Ju Huang, Shiao Wang, Shuai Wang, Zhe Wu, Xiao Wang, Bo Jiang

Specifically, our Mamba-based tracker achieves 43. 5/55. 6 on the SR/PR metric, while the ViT-S based tracker (OSTrack) obtains 40. 0/50. 9.

Mamba Object Localization

Testing and Understanding Erroneous Planning in LLM Agents through Synthesized User Inputs

no code implementations27 Apr 2024 Zhenlan Ji, Daoyuan Wu, Pingchuan Ma, Zongjie Li, Shuai Wang

These synthesized inputs are natural language paragraphs that specify the requirements for completing a series of tasks.

Integrated Sensing and Communication for Edge Inference with End-to-End Multi-View Fusion

no code implementations16 Apr 2024 Xibin Jin, Guoliang Li, Shuai Wang, Miaowen Wen, Chengzhong Xu, H. Vincent Poor

Integrated sensing and communication (ISAC) is a promising solution to accelerate edge inference via the dual use of wireless signals.

Integrated sensing and communication ISAC

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

no code implementations9 Apr 2024 Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, HUI ZHANG, Xie Chen, Kai Yu

Discrete speech tokens have been more and more popular in multiple speech processing fields, including automatic speech recognition (ASR), text-to-speech (TTS) and singing voice synthesis (SVS).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis

2 code implementations29 Mar 2024 Chao Pang, Xingxing Weng, Jiang Wu, Jiayu Li, Yi Liu, Jiaxing Sun, Weijia Li, Shuai Wang, Litong Feng, Gui-Song Xia, Conghui He

VHM is built on a large-scale remote sensing image-text dataset with rich-content captions (VersaD), and an honest instruction dataset comprising both factual and deceptive questions (HnstD).

Hallucination Image Captioning +8

Information Cascade Prediction under Public Emergencies: A Survey

no code implementations28 Mar 2024 Qi Zhang, Guang Wang, Li Lin, Kaiwen Xia, Shuai Wang

With the advent of the era of big data, massive information, expert experience, and high-accuracy models bring great opportunities to the information cascade prediction of public emergencies.

Prediction Survey

Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations

no code implementations21 Mar 2024 Xun Lin, Yi Yu, Song Xia, Jue Jiang, Haoran Wang, Zitong Yu, Yizhong Liu, Ying Fu, Shuai Wang, Wenzhong Tang, Alex Kot

This is particularly true for medical image segmentation (MIS) datasets, where the processes of collection and fine-grained annotation are time-intensive and laborious.

Image Classification Image Generation +4

NN-Defined Modulator: Reconfigurable and Portable Software Modulator on IoT Gateways

1 code implementation14 Mar 2024 Jiazhao Wang, Wenchao Jiang, Ruofeng Liu, Bin Hu, Demin Gao, Shuai Wang

A physical-layer modulator is a vital component for an IoT gateway to map the symbols to signals.

From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification

no code implementations10 Mar 2024 Fei Wang, Chao Shang, Sarthak Jain, Shuai Wang, Qiang Ning, Bonan Min, Vittorio Castelli, Yassine Benajiba, Dan Roth

We investigate common constraints in NLP tasks, categorize them into three classes based on the types of their arguments, and propose a unified framework, ACT (Aligning to ConsTraints), to automatically produce supervision signals for user alignment with constraints.

Abstractive Text Summarization Entity Typing +3

LLMs Can Defend Themselves Against Jailbreaking in a Practical Manner: A Vision Paper

no code implementations24 Feb 2024 Daoyuan Wu, Shuai Wang, Yang Liu, Ning Liu

Our key insight is that regardless of the kind of jailbreak strategies employed, they eventually need to include a harmful prompt (e. g., "how to make a bomb") in the prompt sent to LLMs, and we found that existing LLMs can effectively recognize such harmful prompts that violate their safety policies.

Adversarial Attack Safety Alignment

Large Language Models for Stemming: Promises, Pitfalls and Failures

no code implementations19 Feb 2024 Shuai Wang, Shengyao Zhuang, Guido Zuccon

With this respect, we identify three avenues, each characterised by different trade-offs in terms of computational cost, effectiveness and robustness : (1) use LLMs to stem the vocabulary for a collection, i. e., the set of unique words that appear in the collection (vocabulary stemming), (2) use LLMs to stem each document separately (contextual stemming), and (3) use LLMs to extract from each document entities that should not be stemmed, then use vocabulary stemming to stem the rest of the terms (entity-based contextual stemming).

FeB4RAG: Evaluating Federated Search in the Context of Retrieval Augmented Generation

no code implementations19 Feb 2024 Shuai Wang, Ekaterina Khramtsova, Shengyao Zhuang, Guido Zuccon

Federated search systems aggregate results from multiple search engines, selecting appropriate sources to enhance result quality and align with user intent.

Benchmarking Chatbot +4

Eliminating Information Leakage in Hard Concept Bottleneck Models with Supervised, Hierarchical Concept Learning

no code implementations3 Feb 2024 Ao Sun, Yuanyuan Yuan, Pingchuan Ma, Shuai Wang

This paper alleviates the information leakage issue by introducing label supervision in concept predication and constructing a hierarchical concept set.

ReSLLM: Large Language Models are Strong Resource Selectors for Federated Search

no code implementations31 Jan 2024 Shuai Wang, Shengyao Zhuang, Bevan Koopman, Guido Zuccon

Our ReSLLM method exploits LLMs to drive the selection of resources in federated search in a zero-shot setting.

An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios

no code implementations27 Jan 2024 Zongjie Li, Wenying Qiu, Pingchuan Ma, Yichen Li, You Li, Sijia He, Baozheng Jiang, Shuai Wang, Weixi Gu

In this paper, we present a comprehensive empirical study on the accuracy and robustness of LLMs in the context of the Chinese industrial production area.

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

no code implementations25 Jan 2024 Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, HUI ZHANG, Xie Chen, Kai Yu

Recent TTS models with decoder-only Transformer architecture, such as SPEAR-TTS and VALL-E, achieve impressive naturalness and demonstrate the ability for zero-shot adaptation given a speech prompt.

Decoder Hallucination +3

Zero-shot Generative Large Language Models for Systematic Review Screening Automation

no code implementations12 Jan 2024 Shuai Wang, Harrisen Scells, Shengyao Zhuang, Martin Potthast, Bevan Koopman, Guido Zuccon

Systematic reviews are crucial for evidence-based medicine as they comprehensively analyse published research findings on specific questions.

OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models

1 code implementation12 Jan 2024 Shuai Wang, Liang Ding, Li Shen, Yong Luo, Bo Du, DaCheng Tao

Advancing automated programming necessitates robust and comprehensive code generation benchmarks, yet current evaluation frameworks largely neglect object-oriented programming (OOP) in favor of functional programming (FP), e. g., HumanEval and MBPP.

Code Generation HumanEval +1

The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge

no code implementations26 Dec 2023 Meng Ge, Yizhou Peng, Yidi Jiang, Jingru Lin, Junyi Ao, Mehmet Sinan Yildirim, Shuai Wang, Haizhou Li, Mengling Feng

This paper summarizes our team's efforts in both tracks of the ICMC-ASR Challenge for in-car multi-channel automatic speech recognition.

Automatic Speech Recognition Data Augmentation +2

VRPTEST: Evaluating Visual Referring Prompting in Large Multimodal Models

no code implementations7 Dec 2023 Zongjie Li, Chaozheng Wang, Chaowei Liu, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao

With recent advancements in Large Multimodal Models (LMMs) across various domains, a novel prompting method called visual referring prompting has emerged, showing significant potential in enhancing human-computer interaction within multimodal systems.

InstructTA: Instruction-Tuned Targeted Attack for Large Vision-Language Models

1 code implementation4 Dec 2023 Xunguang Wang, Zhenlan Ji, Pingchuan Ma, Zongjie Li, Shuai Wang

This practical setting poses challenges to the cross-prompt and cross-model transferability of targeted adversarial attack, which aims to confuse the LVLM to output a response that is semantically similar to the attacker's chosen target text.

Adversarial Attack Language Modelling +2

Accurate Segmentation of Optic Disc And Cup from Multiple Pseudo-labels by Noise-aware Learning

1 code implementation30 Nov 2023 Tengjin Weng, Yang shen, Zhidong Zhao, Zhiming Cheng, Shuai Wang

Optic disc and cup segmentation plays a crucial role in automating the screening and diagnosis of optic glaucoma.

Denoising Segmentation

PMP-Swin: Multi-Scale Patch Message Passing Swin Transformer for Retinal Disease Classification

no code implementations20 Nov 2023 Zhihan Yang, Zhiming Cheng, Tengjin Weng, Shucheng He, Yaqi Wang, Xin Ye, Shuai Wang

Specifically, we design a Patch Message Passing (PMP) module based on the Message Passing mechanism to establish global interaction for pathological semantic features and to exploit the subtle differences further between different diseases.

Multi-class Classification

FDNet: Feature Decoupled Segmentation Network for Tooth CBCT Image

no code implementations11 Nov 2023 Xiang Feng, Chengkai Wang, Chengyu Wu, Yunxiang Li, Yongbo He, Shuai Wang, Yaiqi Wang

Precise Tooth Cone Beam Computed Tomography (CBCT) image segmentation is crucial for orthodontic treatment planning.

Image Segmentation Segmentation +1

Evaluating Generative Ad Hoc Information Retrieval

no code implementations8 Nov 2023 Lukas Gienapp, Harrisen Scells, Niklas Deckers, Janek Bevendorff, Shuai Wang, Johannes Kiesel, Shahbaz Syed, Maik Fröbe, Guido Zuccon, Benno Stein, Matthias Hagen, Martin Potthast

To lay a foundation for developing new evaluation methods for generative retrieval systems, we survey the relevant literature from the fields of information retrieval and natural language processing, identify search tasks and system architectures in generative retrieval, develop a new user model, and study its operationalization.

Document Ranking Information Retrieval +1

Privacy-preserving Federated Primal-dual Learning for Non-convex and Non-smooth Problems with Model Sparsification

no code implementations30 Oct 2023 Yiwei Li, Chien-Wei Huang, Shuai Wang, Chong-Yung Chi, Tony Q. S. Quek

Federated learning (FL) has been recognized as a rapidly growing research area, where the model is trained over massively distributed clients under the orchestration of a parameter server (PS) without sharing clients' data.

Federated Learning Privacy Preserving

SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group Sparsity

no code implementations30 Oct 2023 Haitao Xu, Songwei Liu, Yuyang Xu, Shuai Wang, Jiashi Li, Chenqian Yan, Liangqiang Li, Lean Fu, Xin Pan, Fangmin Chen

Our framework consists of two parts: (a) A fine-grained kernel sparsity schema with a sparsity granularity between structured pruning and unstructured pruning.

Network Pruning

PC-bzip2: a phase-space continuity enhanced lossless compression algorithm for light field microscopy data

no code implementations14 Oct 2023 Changqing Su, Zihan Lin, You Zhou, Shuai Wang, Yuhan Gao, Chenggang Yan, Bo Xiong

Moreover, by introducing the temporal continuity, our method shows the superior compression ratio on time series data of zebrafish blood vessels.

Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach

1 code implementation10 Oct 2023 Zhenlan Ji, Pingchuan Ma, Zongjie Li, Shuai Wang

We illustrate the insights that our framework can provide by studying over 3 popular LLMs with over 12 prompt adjustment strategies.

Benchmarking Code Generation +3

Forecasting Tropical Cyclones with Cascaded Diffusion Models

1 code implementation2 Oct 2023 Pritthijit Nath, Pancham Shukla, Shuai Wang, César Quilodrán-Casas

As tropical cyclones become more intense due to climate change, the rise of Al-based modelling provides a more affordable and accessible approach compared to traditional methods based on mathematical models.

SSIM Super-Resolution +1

Split and Merge: Aligning Position Biases in LLM-based Evaluators

no code implementations29 Sep 2023 Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu

Specifically, PORTIA splits the answers into multiple segments, aligns similar content across candidate answers, and then merges them back into a single prompt for evaluation by LLMs.

Language Modelling Large Language Model +1

Exposing Image Splicing Traces in Scientific Publications via Uncertainty-guided Refinement

1 code implementation28 Sep 2023 Xun Lin, Wenzhong Tang, Haoran Wang, Yizhong Liu, Yakun Ju, Shuai Wang, Zitong Yu

Compared to image duplication and synthesis, image splicing detection is more challenging due to the lack of reference images and the typically small tampered areas.

Image Forensics Image Manipulation

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

no code implementations25 Sep 2023 Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang

To address this issue, we introduce an automatic in-the-wild speech data preprocessing framework (AutoPrep) in this paper, which is designed to enhance speech quality, generate speaker labels, and produce transcriptions automatically.

Automatic Speech Recognition Speech Enhancement +3

Multiple Satellites Collaboration for Joint Code-aided CFOs and CPOs Estimation

no code implementations22 Sep 2023 Pingyue Yue, Yixuan Li, Yue Li, Rui Zhang, Shuai Wang, Jianping An

Low Earth Orbit (LEO) satellites are being extensively researched in the development of secure Internet of Remote Things (IoRT).

parameter estimation

Active Learning for Multilingual Fingerspelling Corpora

no code implementations21 Sep 2023 Shuai Wang, Eric Nalisnick

We apply active learning to help with data scarcity problems in sign languages.

Active Learning

Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition

1 code implementation21 Sep 2023 Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li

Current speaker recognition systems primarily rely on supervised approaches, constrained by the scale of labeled datasets.

Speaker Recognition

Agents: An Open-source Framework for Autonomous Language Agents

1 code implementation14 Sep 2023 Wangchunshu Zhou, Yuchen Eleanor Jiang, Long Li, Jialong Wu, Tiannan Wang, Shi Qiu, Jintian Zhang, Jing Chen, Ruipu Wu, Shuai Wang, Shiding Zhu, Jiyu Chen, Wentao Zhang, Xiangru Tang, Ningyu Zhang, Huajun Chen, Peng Cui, Mrinmaya Sachan

Recent advances on large language models (LLMs) enable researchers and developers to build autonomous language agents that can automatically solve various tasks and interact with environments, humans, and other agents using natural language interfaces.

Enabling Runtime Verification of Causal Discovery Algorithms with Automated Conditional Independence Reasoning (Extended Version)

no code implementations11 Sep 2023 Pingchuan Ma, Zhenlan Ji, Peisen Yao, Shuai Wang, Kui Ren

Based on the decision procedure to CIR, CICheck includes two variants: ED-CICheck and ED-CICheck, which detect erroneous CI tests (to enhance reliability) and prune excessive CI tests (to enhance privacy), respectively.

Causal Discovery

Generating Natural Language Queries for More Effective Systematic Review Screening Prioritisation

1 code implementation11 Sep 2023 Shuai Wang, Harrisen Scells, Martin Potthast, Bevan Koopman, Guido Zuccon

Our best approach is not only viable based on the information available at the time of screening, but also has similar effectiveness to the final title.

Natural Language Queries

Integrated Robotics Networks with Co-optimization of Drone Placement and Air-Ground Communications

no code implementations9 Sep 2023 Menghao Hu, Tong Zhang, Shuai Wang, Guoliang Li, Yingyang Chen, Qiang Li, Gaojie Chen

Terrestrial robots, i. e., unmanned ground vehicles (UGVs), and aerial robots, i. e., unmanned aerial vehicles (UAVs), operate in separate spaces.

Parsing is All You Need for Accurate Gait Recognition in the Wild

1 code implementation31 Aug 2023 Jinkai Zheng, Xinchen Liu, Shuai Wang, Lihao Wang, Chenggang Yan, Wu Liu

Furthermore, due to the lack of suitable datasets, we build the first parsing-based dataset for gait recognition in the wild, named Gait3D-Parsing, by extending the large-scale and challenging Gait3D dataset.

All Gait Recognition in the Wild +1

Least Squares Maximum and Weighted Generalization-Memorization Machines

no code implementations31 Aug 2023 Shuai Wang, Zhen Wang, Yuan-Hai Shao

Furthermore, we propose some different memory impact functions for the MIMM and WIMM.

Memorization

Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models

no code implementations29 Aug 2023 Qingyue Wang, Yanhe Fu, Yanan Cao, Shuai Wang, Zhiliang Tian, Liang Ding

We evaluate our method on both open and closed LLMs, and the experiments on the widely-used public dataset show that our method can generate more consistent responses in a long-context conversation.

16k 8k +1

Deep Equilibrium Object Detection

1 code implementation ICCV 2023 Shuai Wang, Yao Teng, LiMin Wang

To be more specific to object decoding, we use a two-step unrolled equilibrium equation to explicitly capture the query vector refinement.

Decoder Object +3

Compound Attention and Neighbor Matching Network for Multi-contrast MRI Super-resolution

no code implementations5 Jul 2023 Wenxuan Chen, Sirui Wu, Shuai Wang, Zhongsen Li, Jia Yang, Huifeng Yao, Xiaolei Song

Multi-contrast magnetic resonance imaging (MRI) reflects information about human tissue from different perspectives and has many clinical applications.

Image Super-Resolution

Communication Resources Constrained Hierarchical Federated Learning for End-to-End Autonomous Driving

1 code implementation28 Jun 2023 Wei-Bin Kou, Shuai Wang, Guangxu Zhu, Bin Luo, Yingxian Chen, Derrick Wing Kwan Ng, Yik-Chung Wu

While federated learning (FL) improves the generalization of end-to-end autonomous driving by model aggregation, the conventional single-hop FL (SFL) suffers from slow convergence rate due to long-range communications among vehicles and cloud server.

Autonomous Driving Federated Learning

Wespeaker baselines for VoxSRC2023

no code implementations27 Jun 2023 Shuai Wang, Chengdong Liang, Xu Xiang, Bing Han, Zhengyang Chen, Hongji Wang, Wen Ding

This report showcases the results achieved using the wespeaker toolkit for the VoxSRC2023 Challenge.

Geometric Pooling: maintaining more useful information

no code implementations21 Jun 2023 Hao Xu, Jia Liu, Yang shen, Kenan Lou, Yanxia Bao, Ruihua Zhang, Shuyue Zhou, Hongsen Zhao, Shuai Wang

However, by analyzing the statistical characteristic of activated units after pooling, we found that a large number of units dropped by sorting pooling are negative-value units that contain useful information and can contribute considerably to the final decision.

Node Classification

Towards Practical Federated Causal Structure Learning

1 code implementation15 Jun 2023 Zhaoyu Wang, Pingchuan Ma, Shuai Wang

Federated learning can solve this problem, but existing solutions for federated causal structure learning make unrealistic assumptions about data and lack convergence guarantees.

Federated Learning scientific discovery

Precise and Generalized Robustness Certification for Neural Networks

1 code implementation11 Jun 2023 Yuanyuan Yuan, Shuai Wang, Zhendong Su

We identify two key properties, independence and continuity, that convert the latent space into a precise and analysis-friendly input space representation for certification.

Autonomous Driving Style Transfer

Enhancing Point Annotations with Superpixel and Confidence Learning Guided for Improving Semi-Supervised OCT Fluid Segmentation

no code implementations5 Jun 2023 Tengjin Weng, Yang shen, Kai Jin, Zhiming Cheng, Yunxiang Li, Gewen Zhang, Shuai Wang, Yaqi Wang

Specifically, we use points to annotate fluid regions in unlabeled OCT images and the Superpixel-Guided Pseudo-Label Generation (SGPLG) module generates pseudo-labels and pixel-level label trust maps from the point annotations.

Denoising Pseudo Label +1

Taxonomy Expansion for Named Entity Recognition

no code implementations22 May 2023 Karthikeyan K, Yogarshi Vyas, Jie Ma, Giovanni Paolini, Neha Anna John, Shuai Wang, Yassine Benajiba, Vittorio Castelli, Dan Roth, Miguel Ballesteros

We experiment with 6 diverse datasets and show that PLM consistently performs better than most other approaches (0. 5 - 2. 5 F1), including in novel settings for taxonomy expansion not considered in prior work.

named-entity-recognition Named Entity Recognition +2

Causality-Aided Trade-off Analysis for Machine Learning Fairness

no code implementations22 May 2023 Zhenlan Ji, Pingchuan Ma, Shuai Wang, Yanhui Li

This paper uses causality analysis as a principled method for analyzing trade-offs between fairness parameters and other crucial metrics in ML pipelines.

Causal Discovery Causal Inference +1

DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding

no code implementations21 May 2023 Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi

Voice conversion is an increasingly popular technology, and the growing number of real-time applications requires models with streaming conversion capabilities.

Data Augmentation Decoder +2

A Weak Supervision Approach for Few-Shot Aspect Based Sentiment

no code implementations19 May 2023 Robert Vacareanu, Siddharth Varia, Kishaloy Halder, Shuai Wang, Giovanni Paolini, Neha Anna John, Miguel Ballesteros, Smaranda Muresan

We explore how weak supervision on abundant unlabeled data can be leveraged to improve few-shot performance in aspect-based sentiment analysis (ABSA) tasks.

Aspect-Based Sentiment Analysis Aspect Extraction +4

Explain Any Concept: Segment Anything Meets Concept-Based Explanation

1 code implementation NeurIPS 2023 Ao Sun, Pingchuan Ma, Yuanyuan Yuan, Shuai Wang

For computer vision tasks, mainstream pixel-based XAI methods explain DNN decisions by identifying important pixels, and emerging concept-based XAI explore forming explanations with concepts (e. g., a head in an image).

Instance Segmentation Semantic Segmentation

Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion

no code implementations16 May 2023 Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng

Experimental results show that our proposed method achieves comparable similarity and higher naturalness than the supervised method, which needs a huge amount of annotated corpora for training and is applicable to improve similarity for VC methods with other SSL representations as input.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Towards Generalizable Medical Image Segmentation with Pixel-wise Uncertainty Estimation

no code implementations13 May 2023 Shuai Wang, Zipei Yan, Daoan Zhang, Zhongsen Li, Sirui Wu, Wenxuan Chen, Rui Li

In contrast, the IID hypothesis is not universally guaranteed in numerous real-world applications, especially in medical image analysis.

Image Segmentation Medical Image Analysis +2

Black-box Source-free Domain Adaptation via Two-stage Knowledge Distillation

no code implementations13 May 2023 Shuai Wang, Daoan Zhang, Zipei Yan, Shitong Shao, Rui Li

In Stage \uppercase\expandafter{\romannumeral1}, we train the target model from scratch with soft pseudo-labels generated by the source model in a knowledge distillation manner.

Knowledge Distillation Source-Free Domain Adaptation +1

"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process

1 code implementation4 May 2023 Pingchuan Ma, Zongjie Li, Ao Sun, Shuai Wang

Moreover, we propose a novel on-the-fly (OTF) repairing scheme that repairs unethical suggestions made by LLMs in real-time.

Moral Scenarios

Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning

no code implementations29 Apr 2023 Mingyang Wang, Zhenshan Bing, Xiangtong Yao, Shuai Wang, Hang Su, Chenguang Yang, Kai Huang, Alois Knoll

On MuJoCo and Meta-World benchmarks, MoSS outperforms prior works in terms of asymptotic performance, sample efficiency (3-50x faster), adaptation efficiency, and generalization robustness on broad and diverse task distributions.

Meta Reinforcement Learning MuJoCo +3

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models

1 code implementation26 Apr 2023 Shitong Shao, Xiaohan Yuan, Zhen Huang, Ziming Qiu, Shuai Wang, Kevin Zhou

Based on this insight, we propose an approach called DiffuseExpand for expanding datasets for 2D medical image segmentation using DPM, which first samples a variety of masks from Gaussian noise to ensure the diversity, and then synthesizes images to ensure the alignment of images and masks.

Diversity Image Generation +4

Towards Open-Vocabulary Video Instance Segmentation

1 code implementation ICCV 2023 Haochen Wang, Cilin Yan, Shuai Wang, XiaoLong Jiang, Xu Tang, Yao Hu, Weidi Xie, Efstratios Gavves

Video Instance Segmentation (VIS) aims at segmenting and categorizing objects in videos from a closed set of training categories, lacking the generalization ability to handle novel categories in real-world videos.

Instance Segmentation Segmentation +3

Demonstration of InsightPilot: An LLM-Empowered Automated Data Exploration System

no code implementations2 Apr 2023 Pingchuan Ma, Rui Ding, Shuai Wang, Shi Han, Dongmei Zhang

In brief, an IQuery is an abstraction and automation of data analysis operations, which mimics the approach of data analysts and simplifies the exploration process for users.

Language Modeling Language Modelling +1

Feature Alignment and Uniformity for Test Time Adaptation

1 code implementation CVPR 2023 Shuai Wang, Daoan Zhang, Zipei Yan, JianGuo Zhang, Rui Li

Test time adaptation (TTA) aims to adapt deep neural networks when receiving out of distribution test domain samples.

Domain Generalization Image Segmentation +3

Prototype Knowledge Distillation for Medical Segmentation with Missing Modality

1 code implementation17 Mar 2023 Shuai Wang, Zipei Yan, Daoan Zhang, Haining Wei, Zhongsen Li, Rui Li

Specifically, our ProtoKD can not only distillate the pixel-wise knowledge of multi-modality data to single-modality data but also transfer intra-class and inter-class feature variations, such that the student model could learn more robust feature representation from the teacher model and inference with only one single modality data.

Image Segmentation Knowledge Distillation +3

Bootstrap The Original Latent: Learning a Private Model from a Black-box Model

no code implementations7 Mar 2023 Shuai Wang, Daoan Zhang, JianGuo Zhang, Weiwei Zhang, Rui Li

In this paper, considering the balance of data/model privacy of model owners and user needs, we propose a new setting called Back-Propagated Black-Box Adaptation (BPBA) for users to better train their private models via the guidance of the back-propagated results of a Black-box foundation/source model.

model

Can ChatGPT Write a Good Boolean Query for Systematic Review Literature Search?

no code implementations3 Feb 2023 Shuai Wang, Harrisen Scells, Bevan Koopman, Guido Zuccon

The ability of ChatGPT to follow complex instructions and generate queries with high precision makes it a valuable tool for researchers conducting systematic reviews, particularly for rapid reviews where time is a constraint and often trading-off higher precision for lower recall is acceptable.

Predictions of photophysical properties of phosphorescent platinum(II) complexes based on ensemble machine learning approach

no code implementations8 Jan 2023 Shuai Wang, ChiYung Yam, Shuguang Chen, Lihong Hu, Liping Li, Faan-Fung Hung, Jiaqi Fan, Chi-Ming Che, Guanhua Chen

Here, we develop a general protocol for accurate predictions of emission wavelength, radiative decay rate constant, and PL quantum yield for phosphorescent Pt(II) emitters based on the combination of first-principles quantum mechanical method, machine learning (ML) and experimental calibration.

Ensemble Learning Triplet

SrTR: Self-reasoning Transformer with Visual-linguistic Knowledge for Scene Graph Generation

no code implementations19 Dec 2022 Yuxiang Zhang, Zhenbo Liu, Shuai Wang

The execution efficiency of the one-stage scene graph generation approaches are quite high, which infer the effective relation between entity pairs using sparse proposal sets and a few queries.

Decoder Graph Generation +5

MeSH Suggester: A Library and System for MeSH Term Suggestion for Systematic Review Boolean Query Construction

1 code implementation18 Dec 2022 Shuai Wang, Hang Li, Guido Zuccon

One challenge to creating an effective systematic review Boolean query is the selection of effective MeSH Terms to include in the query.

Teaching What You Should Teach: A Data-Based Distillation Method

no code implementations11 Dec 2022 Shitong Shao, Huanran Chen, Zhen Huang, Linrui Gong, Shuai Wang, Xinxiao wu

To be specific, we design a neural network-based data augmentation module with priori bias, which assists in finding what meets the teacher's strengths but the student's weaknesses, by learning magnitudes and probabilities to generate suitable data samples.

Data Augmentation Knowledge Distillation +1

Knowledge-Guided Exploration in Deep Reinforcement Learning

no code implementations26 Oct 2022 Sahisnu Mazumder, Bing Liu, Shuai Wang, Yingxuan Zhu, Xiaotian Yin, Lifeng Liu, Jian Li

This paper proposes a new method to drastically speed up deep reinforcement learning (deep RL) training for problems that have the property of state-action permissibility (SAP).

Deep Reinforcement Learning reinforcement-learning +1

Large-Scale Bandwidth and Power Optimization for Multi-Modal Edge Intelligence Autonomous Driving

no code implementations18 Oct 2022 Xinrao Li, Tong Zhang, Shuai Wang, Guangxu Zhu, Rui Wang, Tsung-Hui Chang

However, wireless channels between the edge server and the autonomous vehicles are time-varying due to the high-mobility of vehicles.

Autonomous Driving

Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis

1 code implementation12 Oct 2022 Siddharth Varia, Shuai Wang, Kishaloy Halder, Robert Vacareanu, Miguel Ballesteros, Yassine Benajiba, Neha Anna John, Rishita Anubhai, Smaranda Muresan, Dan Roth

Aspect-based Sentiment Analysis (ABSA) is a fine-grained sentiment analysis task which involves four elements from user-generated texts: aspect term, aspect category, opinion term, and sentiment polarity.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3

Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

no code implementations11 Oct 2022 Muhammad Khalifa, Yogarshi Vyas, Shuai Wang, Graham Horwood, Sunil Mallya, Miguel Ballesteros

The standard classification setting where categories are fixed during both training and testing falls short in dynamic environments where new document categories could potentially emerge.

Classification Document Classification +1

Decompiling x86 Deep Neural Network Executables

no code implementations3 Oct 2022 Zhibo Liu, Yuanyuan Yuan, Shuai Wang, Xiaofei Xie, Lei Ma

BTD takes DNN executables and outputs full model specifications, including types of DNN operators, network topology, dimensions, and parameters that are (nearly) identical to those of the input models.

Accelerated partial separable model using dimension-reduced optimization technique for ultra-fast cardiac MRI

no code implementations2 Oct 2022 Zhongsen Li, Aiqi Sun, Chuyu Liu, Haining Wei, Shuai Wang, Mingzhu Fu, Rui Li

The main objective of this study is to accelerate the PS model, shorten the time required for acquisition and reconstruction, and maintain good image quality simultaneously.

Dimensionality Reduction Image Reconstruction

Automated MeSH Term Suggestion for Effective Query Formulation in Systematic Reviews Literature Search

1 code implementation19 Sep 2022 Shuai Wang, Harrisen Scells, Bevan Koopman, Guido Zuccon

However, identifying the correct MeSH terms to include in a query is difficult: information experts are often unfamiliar with the MeSH database and unsure about the appropriateness of MeSH terms for a query.

Language-aware Domain Generalization Network for Cross-Scene Hyperspectral Image Classification

no code implementations6 Sep 2022 Yuxiang Zhang, Mengmeng Zhang, Wei Li, Shuai Wang, Ran Tao

Text information including extensive prior knowledge about land cover classes has been ignored in hyperspectral image classification (HSI) tasks.

Contrastive Learning Domain Generalization +1

Multi-Point Integrated Sensing and Communication: Fusion Model and Functionality Selection

no code implementations16 Aug 2022 Guoliang Li, Shuai Wang, Kejiang Ye, Miaowen Wen, Derrick Wing Kwan Ng, Marco Di Renzo

Integrated sensing and communication (ISAC) represents a paradigm shift, where previously competing wireless transmissions are jointly designed to operate in harmony via the shared use of the hardware platform for improving the spectral and energy efficiencies.

Integrated sensing and communication ISAC

XInsight: eXplainable Data Analysis Through The Lens of Causality

no code implementations26 Jul 2022 Pingchuan Ma, Rui Ding, Shuai Wang, Shi Han, Dongmei Zhang

XInsight is a three-module, end-to-end pipeline designed to extract causal graphs, translate causal primitives into XDA semantics, and quantify the quantitative contribution of each explanation to a data fact.

Decision Making

Cross Vision-RF Gait Re-identification with Low-cost RGB-D Cameras and mmWave Radars

no code implementations16 Jul 2022 Dongjiang Cao, Ruofeng Liu, Hao Li, Shuai Wang, Wenchao Jiang, Chris Xiaoxuan Lu

Human identification is a key requirement for many applications in everyday life, such as personalized services, automatic surveillance, continuous authentication, and contact tracing during pandemics, etc.

Metric Learning Person Re-Identification

A Framework Based on Generational and Environmental Response Strategies for Dynamic Multi-objective Optimization

no code implementations6 Jul 2022 Qingya Li, Xiangzhi Liu, Fuqiang Wang, Shuai Wang, Peng Zhang, Xiaoming Wu

In this paper, a novel framework based on generational and environmental response strategies (FGERS) is proposed, in which response strategies are run both in the environmental change stage and the environmental static stage to obtain population evolution information of those both stages.

VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM

no code implementations4 Jul 2022 Danpeng Chen, Shuai Wang, Weijian Xie, Shangjin Zhai, Nan Wang, Hujun Bao, Guofeng Zhang

Even if the plane parameters are involved in the optimization, we effectively simplify the back-end map by using planar structures.

Cannot find the paper you are looking for? You can Submit a new open access paper.