Search Results for author: Renxi Wang

Found 8 papers, 5 papers with code

Explore the Reasoning Capability of LLMs in the Chess Testbed

no code implementations11 Nov 2024 Shu Wang, Lei Ji, Renxi Wang, Wenxiao Zhao, Haokun Liu, Yifan Hou, Ying Nian Wu

Based on the observation that expert chess players employ a dual approach combining long-term strategic play with short-term tactical play along with language explanation, we propose improving the reasoning capability of large language models in chess by integrating annotated strategy and tactic.

ToolGen: Unified Tool Retrieval and Calling via Generation

1 code implementation4 Oct 2024 Renxi Wang, Xudong Han, Lei Ji, Shu Wang, Timothy Baldwin, Haonan Li

As large language models (LLMs) advance, their inability to autonomously execute tasks by directly interacting with external tools remains a critical limitation.

Retrieval Text Generation

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models

1 code implementation31 Mar 2024 Lizhi Lin, Honglin Mu, Zenan Zhai, Minghan Wang, Yuxia Wang, Renxi Wang, Junjie Gao, Yixuan Zhang, Wanxiang Che, Timothy Baldwin, Xudong Han, Haonan Li

Generative models are rapidly gaining popularity and being integrated into everyday applications, raising concerns over their safe use as various vulnerabilities are exposed.

Red Teaming Survey

Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

1 code implementation18 Feb 2024 Renxi Wang, Haonan Li, Xudong Han, Yixuan Zhang, Timothy Baldwin

However, LLMs are optimized for language generation instead of tool use during training or alignment, limiting their effectiveness as agents.

Mathematical Reasoning Multi-hop Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.