Search Results for author: Ruoxi Ning

Found 4 papers, 4 papers with code

NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens

1 code implementation18 Mar 2024 Cunxiang Wang, Ruoxi Ning, Boqi Pan, Tonghui Wu, Qipeng Guo, Cheng Deng, Guangsheng Bao, Xiangkun Hu, Zheng Zhang, Qian Wang, Yue Zhang

The rapid advancement of Large Language Models (LLMs) has introduced a new frontier in natural language processing, particularly in understanding and processing long-context information.

Benchmarking Question Answering

GLoRE: Evaluating Logical Reasoning of Large Language Models

1 code implementation13 Oct 2023 Hanmeng Liu, Zhiyang Teng, Ruoxi Ning, Jian Liu, Qiji Zhou, Yue Zhang

Recently, large language models (LLMs), including notable models such as GPT-4 and burgeoning community models, have showcased significant general language understanding abilities.

Logical Reasoning Natural Language Understanding

Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4

1 code implementation7 Apr 2023 Hanmeng Liu, Ruoxi Ning, Zhiyang Teng, Jian Liu, Qiji Zhou, Yue Zhang

With the release of Generative Pretrained Transformer 4 (GPT-4), highlighted as "advanced" at reasoning tasks, we are eager to learn the GPT-4 performance on various logical reasoning tasks.

Logical Reasoning Natural Language Inference +2

Cannot find the paper you are looking for? You can Submit a new open access paper.