Search Results for author: Boyang Zhang

Found 12 papers, 1 papers with code

Can the capability of Large Language Models be described by human ability? A Meta Study

no code implementations13 Apr 2025 Mingrui Zan, Yunquan Zhang, Boyang Zhang, Fangming Liu, Daning Cheng

The evaluation benchmarks are categorized into 6 primary abilities and 11 sub-abilities in human aspect.

MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

no code implementations26 Feb 2025 Ziyue Jiang, Yi Ren, RuiQi Li, Shengpeng Ji, Boyang Zhang, Zhenhui Ye, Chen Zhang, Bai Jionghao, Xiaoda Yang, Jialong Zuo, Yu Zhang, Rui Liu, Xiang Yin, Zhou Zhao

While recent zero-shot text-to-speech (TTS) models have significantly improved speech quality and expressiveness, mainstream systems still suffer from issues related to speech-text alignment modeling: 1) models without explicit speech-text alignment modeling exhibit less robustness, especially for hard sentences in practical applications; 2) predefined alignment-based models suffer from naturalness constraints of forced alignments.

Speech Synthesis Text to Speech

A General Error-Theoretical Analysis Framework for Constructing Compression Strategies

no code implementations19 Feb 2025 Boyang Zhang, Daning Cheng, Yunquan Zhang, Meiqi Tu, Fangmin Liu, Jiake Tian

The exponential growth in parameter size and computational complexity of deep models poses significant challenges for efficient deployment.

Quantization

Lossless Model Compression via Joint Low-Rank Factorization Optimization

no code implementations9 Dec 2024 Boyang Zhang, Daning Cheng, Yunquan Zhang, Fangmin Liu, Jiake Tian

Low-rank factorization is a popular model compression technique that minimizes the error $\delta$ between approximated and original weight matrices.

Model Compression Model Optimization

Compression for Better: A General and Stable Lossless Compression Framework

no code implementations9 Dec 2024 Boyang Zhang, Daning Cheng, Yunquan Zhang, Fangmin Liu, WenGuang Chen

A key challenge is effectively leveraging compression errors and defining the boundaries for lossless compression to minimize model loss.

Computational Efficiency Model Compression +1

FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization

no code implementations9 Dec 2024 Boyang Zhang, Daning Cheng, Yunquan Zhang, Fangmin Liu

We introduce a deep model series expansion framework to address this issue, enabling rapid and accurate approximation of unquantized models without calibration sets or fine-tuning.

Quantization

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

no code implementations30 Jul 2024 Boyang Zhang, Yicong Tan, Yun Shen, Ahmed Salem, Michael Backes, Savvas Zannettou, Yang Zhang

Through attacks on implemented and deployable agents in multi-agent scenarios, we accentuate the realistic risks associated with these vulnerabilities.

Language Modelling

Comprehensive Assessment of Toxicity in ChatGPT

no code implementations3 Nov 2023 Boyang Zhang, Xinyue Shen, Wai Man Si, Zeyang Sha, Zeyuan Chen, Ahmed Salem, Yun Shen, Michael Backes, Yang Zhang

Moderating offensive, hateful, and toxic language has always been an important but challenging topic in the domain of safe use in NLP.

A Plot is Worth a Thousand Words: Model Information Stealing Attacks via Scientific Plots

1 code implementation23 Feb 2023 Boyang Zhang, Xinlei He, Yun Shen, Tianhao Wang, Yang Zhang

Given the simplicity and effectiveness of the attack method, our study indicates scientific plots indeed constitute a valid side channel for model information stealing attacks.

valid

Two-Stage Co-Segmentation Network Based on Discriminative Representation for Recovering Human Mesh From Videos

no code implementations CVPR 2023 Boyang Zhang, Kehua Ma, Suping Wu, Zhixiang Yuan

However, most of the existing methods focus on the temporal consistency of videos, while ignoring the spatial representation in complex scenes, thus failing to recover a reasonable and smooth human mesh sequence under extreme illumination and chaotic backgrounds. To alleviate this problem, we propose a two-stage co-segmentation network based on discriminative representation for recovering human body meshes from videos.

D-CryptO: Deep learning-based analysis of colon organoid morphology from brightfield images

no code implementations12 Oct 2022 Lyan Abdul, Jocelyn Xu, Alexander Sotra, Abbas Chaudary, Jerry Gao, Shravanthi Rajasekar, Nicky Anvari, Hamidreza Mahyar, Boyang Zhang

With D-CryptO, subtle variations in how colon organoids responded to the different chemotherapeutic drugs were detected, which suggest potentially distinct mechanisms of action.

Morphological Analysis

Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos

no code implementations7 Oct 2022 Boyang Zhang, Suping Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin

Different from them, our STR aims to learn accurate and natural motion sequences in an unconstrained environment through temporal and spatial tendency and to fully excavate the spatio-temporal features of existing video data.

3D Human Pose Estimation Temporal Sequences

Cannot find the paper you are looking for? You can Submit a new open access paper.