Search Results for author: Shengnan An

Found 9 papers, 5 papers with code

Make Your LLM Fully Utilize the Context

1 code implementation • 25 Apr 2024 • Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou

While many contemporary large language models (LLMs) can process lengthy input, they still struggle to fully utilize information within the long context, known as the lost-in-the-middle challenge.

4k Information Retrieval +1

138

Paper
Code

Compositional API Recommendation for Library-Oriented Code Generation

no code implementations • 29 Feb 2024 • Zexiong Ma, Shengnan An, Bing Xie, Zeqi Lin

However, the performance remains unsatisfactory in generating library-oriented code, especially for the libraries not present in the training data of LLMs.

Library-Oriented Code Generation

Paper
Add Code

Learning From Mistakes Makes LLM Better Reasoner

1 code implementation • 31 Oct 2023 • Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen

To further improve their reasoning capabilities, this work explores whether LLMs can LEarn from MistAkes (LEMA), akin to the human learning process.

GPT-4 GSM8K +2

Paper
Code

Skill-Based Few-Shot Selection for In-Context Learning

no code implementations • 23 May 2023 • Shengnan An, Bo Zhou, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Weizhu Chen, Jian-Guang Lou

Few-shot selection -- selecting appropriate examples for each test instance separately -- is important for in-context learning.

In-Context Learning Semantic Parsing +1

Paper
Add Code

How Do In-Context Examples Affect Compositional Generalization?

no code implementations • 8 May 2023 • Shengnan An, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Jian-Guang Lou, Dongmei Zhang

Compositional generalization--understanding unseen combinations of seen primitives--is an essential reasoning capability in human intelligence.

In-Context Learning

Paper
Add Code

Does Deep Learning Learn to Abstract? A Systematic Probing Framework

1 code implementation • 23 Feb 2023 • Shengnan An, Zeqi Lin, Bei Chen, Qiang Fu, Nanning Zheng, Jian-Guang Lou

Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context.

361

Paper
Code

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models

no code implementations • 7 Mar 2022 • Shengnan An, Yifei Li, Zeqi Lin, Qian Liu, Bei Chen, Qiang Fu, Weizhu Chen, Nanning Zheng, Jian-Guang Lou

This motivates us to propose input-tuning, which fine-tunes both the continuous prompts and the input representations, leading to a more effective way to adapt unfamiliar inputs to frozen PLMs.

Language Modelling Natural Language Understanding +1