Search Results for author: Min Cai

Found 2 papers, 1 papers with code

AvalonBench: Evaluating LLMs Playing the Game of Avalon

1 code implementation8 Oct 2023 Jonathan Light, Min Cai, Sheng Shen, Ziniu Hu

In this paper, we explore the potential of Large Language Models (LLMs) Agents in playing the strategic social deduction game, Resistance Avalon.

Decision Making

Self-Convinced Prompting: Few-Shot Question Answering with Repeated Introspection

no code implementations8 Oct 2023 Haodi Zhang, Min Cai, Xinhe Zhang, Chen Jason Zhang, Rui Mao, Kaishun Wu

While large language models (LLMs) such as ChatGPT and PaLM have demonstrated remarkable performance in various language understanding and generation tasks, their capabilities in complex reasoning and intricate knowledge utilization still fall short of human-level proficiency.

Miscellaneous Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.