Search Results for author: Pengyu Wang

Found 42 papers, 22 papers with code

WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer

no code implementations10 Apr 2025 Huilin Yin, Pengyu Wang, Senmao Li, Jun Yan, Daniel Watzenig

Robust object detection for Unmanned Surface Vehicles (USVs) in complex water environments is essential for reliable navigation and operation.

Object object-detection +1

CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

no code implementations27 Feb 2025 Nian Shao, Rui Zhou, Pengyu Wang, Xian Li, Ying Fang, Yujie Yang, Xiaofei Li

Compared to linear-frequency domain or time-domain speech enhancement, the key advantage of Mel-spectrogram enhancement is that Mel-frequency presents speech in a more compact way and thus is easier to learn, which will benefit both speech quality and ASR.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Optimal Actuator Attacks on Autonomous Vehicles Using Reinforcement Learning

no code implementations11 Feb 2025 Pengyu Wang, Jialu Li, Ling Shi

With the increasing prevalence of autonomous vehicles (AVs), their vulnerability to various types of attacks has grown, presenting significant security challenges.

Autonomous Vehicles reinforcement-learning +2

VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification

1 code implementation11 Feb 2025 Pengyu Wang, Ying Fang, Xiaofei Li

Reverberant speech, denoting the speech signal degraded by the process of reverberation, contains crucial knowledge of both anechoic source speech and room impulse response (RIR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Unit Region Encoding: A Unified and Compact Geometry-aware Representation for Floorplan Applications

no code implementations19 Jan 2025 Huichao Zhang, Pengyu Wang, Manyi Li, Zuojun Li, Yaguang Wu

The floorplans are represented as the latent encodings on a set of boundary-adaptive unit region partition based on the clustering of the proposed geometry-aware density map.

Metric Learning

Learning-based Detection of GPS Spoofing Attack for Quadrotors

no code implementations10 Jan 2025 Pengyu Wang, Zhaohua Yang, Jialu Li, Ling Shi

Safety-critical cyber-physical systems (CPS), such as quadrotor UAVs, are particularly prone to cyber attacks, which can result in significant consequences if not detected promptly and accurately.

ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle

1 code implementation4 Jan 2025 Yinchuan Wang, Bin Ren, Xiang Zhang, Pengyu Wang, Chaoqun Wang, Rui Song, Yibin Li, Max Q. -H. Meng

In this article, a LiDAR-based SLAM method is presented to improve the accuracy of pose estimations for ground vehicles in rough terrains, which is termed Rotation-Optimized LiDAR-Only (ROLO) SLAM.

Pose Estimation

Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models

no code implementations14 Nov 2024 Wei Wang, Zhaowei Li, Qi Xu, Linfeng Li, Yiqing Cai, Botian Jiang, Hang Song, Xingcan Hu, Pengyu Wang, Li Xiao

Multi-modal large language models (MLLMs) have achieved remarkable success in fine-grained visual understanding across a range of tasks.

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

no code implementations11 Nov 2024 Mianqiu Huang, Xiaoran Liu, Shaojun Zhou, Mozhi Zhang, Chenkun Tan, Pengyu Wang, Qipeng Guo, Zhe Xu, Linyang Li, Zhikai Lei, Linlin Li, Qun Liu, Yaqian Zhou, Xipeng Qiu, Xuanjing Huang

With the development of large language models (LLMs), the sequence length of these models continues to increase, drawing significant attention to long-context language models.

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

1 code implementation31 Oct 2024 Xinghao Wang, Pengyu Wang, Bo wang, Dong Zhang, Yunhua Zhou, Xipeng Qiu

By leveraging weight decomposition, BitStack can dynamically adjust the model size with minimal transmission between running memory and storage devices.

Quantization

MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time

1 code implementation18 Oct 2024 Mozhi Zhang, Pengyu Wang, Chenkun Tan, Mianqiu Huang, Dong Zhang, Yaqian Zhou, Xipeng Qiu

Large Language Models (LLMs) acquire extensive knowledge and remarkable abilities from extensive text corpora, making them powerful tools for various applications.

Diversity

ChemDFM-X: Towards Large Multimodal Model for Chemistry

no code implementations20 Sep 2024 Zihan Zhao, Bo Chen, Jingpiao Li, Lu Chen, Liyang Wen, Pengyu Wang, Zichen Zhu, Danyang Zhang, Ziping Wan, Yansi Li, Zhongyang Dai, Xin Chen, Kai Yu

Rapid developments of AI tools are expected to offer unprecedented assistance to the research of natural science including chemistry.

model

Xinyu: An Efficient LLM-based System for Commentary Generation

no code implementations21 Aug 2024 Yiquan Wu, Bo Tang, Chenyang Xi, Yu Yu, Pengyu Wang, Yifei Liu, Kun Kuang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Jie Hu, Peng Cheng, Zhonghao Wang, Yi Wang, Yi Luo, MingChuan Yang

To address the advanced requirements, we present an argument ranking model for arguments and establish a comprehensive evidence database that includes up-to-date events and classic books, thereby strengthening the substantiation of the evidence with retrieval augmented generation (RAG) technology.

RAG Text Generation

Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving

1 code implementation19 Aug 2024 Jun Yan, Pengyu Wang, Danni Wang, Weiquan Huang, Daniel Watzenig, Huilin Yin

In the task of semantic segmentation for autonomous driving, it is significant to study the zero-shot adversarial robustness of SAM.

Adversarial Robustness Autonomous Driving +5

Recent Advances in Data-driven Intelligent Control for Wireless Communication: A Comprehensive Survey

no code implementations6 Aug 2024 Wei Huo, Huiwen Yang, Nachuan Yang, Zhaohua Yang, Jiuzhou Zhang, Fuhai Nan, Xingzhou Chen, Yifan Mao, Suyang Hu, Pengyu Wang, Xuanyu Zheng, Mingming Zhao, Ling Shi

As the volume of data continues to escalate, the integration of data-driven methods has become indispensable for enabling adaptive and intelligent control mechanisms in future wireless communication systems.

Scheduling

Case2Code: Learning Inductive Reasoning with Synthetic Data

1 code implementation17 Jul 2024 Yunfan Shao, Linyang Li, Yichuan Ma, Peiji Li, Demin Song, Qinyuan Cheng, ShiMin Li, Xiaonan Li, Pengyu Wang, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin

In this paper, we hope to focus on evaluating and teaching LLMs to conduct inductive reasoning, that is, LLMs are supposed to infer underlying rules by observing examples or sequential transformations.

Sparsity-Accelerated Training for Large Language Models

no code implementations3 Jun 2024 Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li, Liangtai Sun, Su Zhu, Shuai Fan, Kai Yu

Large language models (LLMs) have demonstrated proficiency across various natural language processing (NLP) tasks but often require additional training, such as continual pre-training and supervised fine-tuning.

SpeechAlign: Aligning Speech Generation to Human Preferences

1 code implementation8 Apr 2024 Dong Zhang, Zhaowei Li, ShiMin Li, Xin Zhang, Pengyu Wang, Yaqian Zhou, Xipeng Qiu

However, the integration of human feedback to align speech outputs to human preferences is often neglected.

Language Modeling Language Modelling

Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting

no code implementations11 Mar 2024 WenTing Chen, Pengyu Wang, Hui Ren, Lichao Sun, Quanzheng Li, Yixuan Yuan, Xiang Li

To address these challenges, we propose a novel medical image synthesis model that leverages fine-grained image-text alignment and anatomy-pathology prompts to generate highly detailed and accurate synthetic medical images.

Anatomy Descriptive +1

NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

1 code implementation29 Feb 2024 Miao Li, Ming-Bin Chen, Bo Tang, Shengbin Hou, Pengyu Wang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Keming Mao, Peng Cheng, Yi Luo

We present NewsBench, a novel evaluation framework to systematically assess the capabilities of Large Language Models (LLMs) for editorial capabilities in Chinese journalism.

Ethics Multiple-choice

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

1 code implementation24 Jan 2024 Xinghao Wang, Junliang He, Pengyu Wang, Yunhua Zhou, Tianxiang Sun, Xipeng Qiu

These methods regularize the representation space by pulling similar sentence representations closer and pushing away the dissimilar ones and have been proven effective in various NLP tasks, e. g., semantic textual similarity (STS) tasks.

Contrastive Learning Denoising +4

InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance

1 code implementation20 Jan 2024 Pengyu Wang, Dong Zhang, Linyang Li, Chenkun Tan, Xinghao Wang, Ke Ren, Botian Jiang, Xipeng Qiu

With the rapid development of large language models (LLMs), they are not only used as general-purpose AI assistants but are also customized through further fine-tuning to meet the requirements of different applications.

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems

1 code implementation8 Jan 2024 Dong Zhang, Zhaowei Li, Pengyu Wang, Xin Zhang, Yaqian Zhou, Xipeng Qiu

In this paper, we propose SpeechAgents, a multi-modal LLM based multi-agent system designed for simulating human communication.

Language Modelling Large Language Model +1

Attitude Takeover Control for Noncooperative Space Targets Based on Gaussian Processes with Online Model Learning

no code implementations24 Oct 2023 YuHan Liu, Pengyu Wang, Chang-Hun Lee, Roland Tóth

One major challenge for autonomous attitude takeover control for on-orbit servicing of spacecraft is that an accurate dynamic motion model of the combined vehicles is highly nonlinear, complex and often costly to identify online, which makes traditional model-based control impractical for this task.

Gaussian Processes

Watermarking LLMs with Weight Quantization

1 code implementation17 Oct 2023 Linyang Li, Botian Jiang, Pengyu Wang, Ke Ren, Hang Yan, Xipeng Qiu

Abuse of large language models reveals high risks as large language models are being deployed at an astonishing speed.

Language Modeling Language Modelling +2

PerturbScore: Connecting Discrete and Continuous Perturbations in NLP

1 code implementation13 Oct 2023 Linyang Li, Ke Ren, Yunfan Shao, Pengyu Wang, Xipeng Qiu

Through experimental results, we find that we can build a connection between discrete and continuous perturbations and use the proposed PerturbScore to learn such correlation, surpassing previous methods used in discrete perturbation measuring.

The Uncertainty-based Retrieval Framework for Ancient Chinese CWS and POS

1 code implementation LT4HALA (LREC) 2022 Pengyu Wang, Zhichen Ren

Automatic analysis for modern Chinese has greatly improved the accuracy of text mining in related fields, but the study of ancient Chinese is still relatively rare.

Chinese Word Segmentation Part-Of-Speech Tagging +2

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities

1 code implementation18 May 2023 Dong Zhang, ShiMin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu

Multi-modal large language models are regarded as a crucial step towards Artificial General Intelligence (AGI) and have garnered significant interest with the emergence of ChatGPT.

Language Modeling Language Modelling +3

Origin Tracing and Detecting of LLMs

no code implementations27 Apr 2023 Linyang Li, Pengyu Wang, Ke Ren, Tianxiang Sun, Xipeng Qiu

The extraordinary performance of large language models (LLMs) heightens the importance of detecting whether the context is generated by an AI system.

Learning For Predictive Control: A Dual Gaussian Process Approach

no code implementations7 Nov 2022 YuHan Liu, Pengyu Wang, Roland Tóth

Gaussian process (GP) based estimation of system models is an effective tool to learn unknown dynamics directly from input/output data.

Model Predictive Control

The Open-World Lottery Ticket Hypothesis for OOD Intent Classification

1 code implementation13 Oct 2022 Yunhua Zhou, Pengyu Wang, Peiju Liu, Yuxin Wang, Xipeng Qiu

Most existing methods of Out-of-Domain (OOD) intent classification rely on extensive auxiliary OOD corpora or specific training paradigms.

intent-classification Intent Classification

EventHPE: Event-based 3D Human Pose and Shape Estimation

1 code implementation ICCV 2021 Shihao Zou, Chuan Guo, Xinxin Zuo, Sen Wang, Pengyu Wang, Xiaoqin Hu, Shoushun Chen, Minglun Gong, Li Cheng

Event camera is an emerging imaging sensor for capturing dynamics of moving objects as events, which motivates our work in estimating 3D human pose and shape from the event signals.

3D human pose and shape estimation Optical Flow Estimation

3D Shape Segmentation via Shape Fully Convolutional Networks

no code implementations28 Feb 2017 Pengyu Wang, Yuan Gan, Panpan Shui, Fenggen Yu, Yan Zhang, Songle Chen, Zhengxing Sun

3D shapes are represented as graph structures in the SFCN architecture, based on novel graph convolution and pooling operations, which are similar to convolution and pooling operations used on images.

Image Segmentation Segmentation +1

Stochastic Collapsed Variational Inference for Sequential Data

no code implementations5 Dec 2015 Pengyu Wang, Phil Blunsom

Stochastic variational inference for collapsed models has recently been successfully applied to large scale topic modelling.

Variational Inference

Stochastic Collapsed Variational Inference for Hidden Markov Models

no code implementations5 Dec 2015 Pengyu Wang, Phil Blunsom

In this paper, we propose a stochastic collapsed variational inference algorithm for hidden Markov models, in a sequential data setting.

Variational Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.