no code implementations • 13 Apr 2025 • Mingrui Zan, Yunquan Zhang, Boyang Zhang, Fangming Liu, Daning Cheng
The evaluation benchmarks are categorized into 6 primary abilities and 11 sub-abilities in human aspect.
no code implementations • 26 Feb 2025 • Ziyue Jiang, Yi Ren, RuiQi Li, Shengpeng Ji, Boyang Zhang, Zhenhui Ye, Chen Zhang, Bai Jionghao, Xiaoda Yang, Jialong Zuo, Yu Zhang, Rui Liu, Xiang Yin, Zhou Zhao
While recent zero-shot text-to-speech (TTS) models have significantly improved speech quality and expressiveness, mainstream systems still suffer from issues related to speech-text alignment modeling: 1) models without explicit speech-text alignment modeling exhibit less robustness, especially for hard sentences in practical applications; 2) predefined alignment-based models suffer from naturalness constraints of forced alignments.
no code implementations • 19 Feb 2025 • Boyang Zhang, Daning Cheng, Yunquan Zhang, Meiqi Tu, Fangmin Liu, Jiake Tian
The exponential growth in parameter size and computational complexity of deep models poses significant challenges for efficient deployment.
no code implementations • 9 Dec 2024 • Boyang Zhang, Daning Cheng, Yunquan Zhang, Fangmin Liu, Jiake Tian
Low-rank factorization is a popular model compression technique that minimizes the error $\delta$ between approximated and original weight matrices.
no code implementations • 9 Dec 2024 • Boyang Zhang, Daning Cheng, Yunquan Zhang, Fangmin Liu, WenGuang Chen
A key challenge is effectively leveraging compression errors and defining the boundaries for lossless compression to minimize model loss.
no code implementations • 9 Dec 2024 • Boyang Zhang, Daning Cheng, Yunquan Zhang, Fangmin Liu
We introduce a deep model series expansion framework to address this issue, enabling rapid and accurate approximation of unquantized models without calibration sets or fine-tuning.
no code implementations • 30 Jul 2024 • Boyang Zhang, Yicong Tan, Yun Shen, Ahmed Salem, Michael Backes, Savvas Zannettou, Yang Zhang
Through attacks on implemented and deployable agents in multi-agent scenarios, we accentuate the realistic risks associated with these vulnerabilities.
no code implementations • 3 Nov 2023 • Boyang Zhang, Xinyue Shen, Wai Man Si, Zeyang Sha, Zeyuan Chen, Ahmed Salem, Yun Shen, Michael Backes, Yang Zhang
Moderating offensive, hateful, and toxic language has always been an important but challenging topic in the domain of safe use in NLP.
1 code implementation • 23 Feb 2023 • Boyang Zhang, Xinlei He, Yun Shen, Tianhao Wang, Yang Zhang
Given the simplicity and effectiveness of the attack method, our study indicates scientific plots indeed constitute a valid side channel for model information stealing attacks.
no code implementations • CVPR 2023 • Boyang Zhang, Kehua Ma, Suping Wu, Zhixiang Yuan
However, most of the existing methods focus on the temporal consistency of videos, while ignoring the spatial representation in complex scenes, thus failing to recover a reasonable and smooth human mesh sequence under extreme illumination and chaotic backgrounds. To alleviate this problem, we propose a two-stage co-segmentation network based on discriminative representation for recovering human body meshes from videos.
no code implementations • 12 Oct 2022 • Lyan Abdul, Jocelyn Xu, Alexander Sotra, Abbas Chaudary, Jerry Gao, Shravanthi Rajasekar, Nicky Anvari, Hamidreza Mahyar, Boyang Zhang
With D-CryptO, subtle variations in how colon organoids responded to the different chemotherapeutic drugs were detected, which suggest potentially distinct mechanisms of action.
no code implementations • 7 Oct 2022 • Boyang Zhang, Suping Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin
Different from them, our STR aims to learn accurate and natural motion sequences in an unconstrained environment through temporal and spatial tendency and to fully excavate the spatio-temporal features of existing video data.
Ranked #61 on
3D Human Pose Estimation
on MPI-INF-3DHP