1 code implementation • 3 Apr 2025 • Haowei Lin, Xiangyu Wang, Ruilin Yan, Baizhou Huang, Haotian Ye, Jianhua Zhu, ZiHao Wang, James Zou, Jianzhu Ma, Yitao Liang
Moreover, LLM performance on KUMO tasks correlates strongly with results on newly released real-world reasoning benchmarks, underscoring KUMO's value as a robust, enduring assessment tool for genuine LLM reasoning capabilities.
no code implementations • 2 Nov 2024 • Baizhou Huang, Xiao Pu, Xiaojun Wan
Specifically, we formulate the watermark scrubbing attack as a constrained optimization problem by capturing its objectives with two distributions, a Watermark Distribution and a Fidelity Distribution.
no code implementations • 19 Jun 2024 • Junzhe Zhang, Huixuan Zhang, Xunjian Yin, Baizhou Huang, Xu Zhang, Xinyu Hu, Xiaojun Wan
Our benchmark facilitates independent correction of misreading and misrecognition errors by editing the corresponding knowledge component.
no code implementations • 22 May 2024 • Baizhou Huang, Xiaojun Wan
To this end, we introduce \textbf{WaterPool}, a simple yet effective key module that preserves a complete key sampling space required by imperceptibility while utilizing semantics-based search to improve the key restoration process.
1 code implementation • 4 Feb 2024 • Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, ZiHao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang
The ever-growing ecosystem of LLMs has posed a challenge in selecting the most appropriate pre-trained model to fine-tune amidst a sea of options.
1 code implementation • 23 Oct 2023 • Xunjian Yin, Baizhou Huang, Xiaojun Wan
With the rapid development of NLP, large-scale language models (LLMs) excel in various tasks across multiple domains now.
1 code implementation • 29 Sep 2023 • Baizhou Huang, Shuai Lu, Weizhu Chen, Xiaojun Wan, Nan Duan
We propose the Multi-Perspective Self-Consistency (MPSC) framework incorporating both inter- and intra-consistency across outputs from multiple perspectives.
no code implementations • 3 Sep 2022 • Baizhou Huang, Shikang Du, Xiaojun Wan
Crosstalk is a traditional Chinese theatrical performance art.