Search Results for author: Shansan Gong

Found 9 papers, 7 papers with code

Training-Free Long-Context Scaling of Large Language Models

1 code implementation • 27 Feb 2024 • Chenxin An, Fei Huang, Jun Zhang, Shansan Gong, Xipeng Qiu, Chang Zhou, Lingpeng Kong

The ability of Large Language Models (LLMs) to process and generate coherent text is markedly weakened when the number of input tokens exceeds their pretraining length.

16k

214

Paper
Code

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

no code implementations • 21 Feb 2024 • Xueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong

Multimodal reasoning stands as a pivotal capability for large vision-language models (LVLMs).

Geometry Problem Solving Molecular Property Prediction +2

Paper
Add Code

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

1 code implementation • 12 Feb 2024 • Jiacheng Ye, Shansan Gong, Liheng Chen, Lin Zheng, Jiahui Gao, Han Shi, Chuan Wu, Zhenguo Li, Wei Bi, Lingpeng Kong

This work explores the integration of diffusion models and Chain-of-Thought (CoT), a well-established technique to improve the reasoning ability in autoregressive language models.

Math

Paper
Code

DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models

1 code implementation • 9 Oct 2023 • Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng Kong

Diffusion models have gained prominence in generating high-quality sequences of text.

669

Paper
Code

L-Eval: Instituting Standardized Evaluation for Long Context Language Models

3 code implementations • 20 Jul 2023 • Chenxin An, Shansan Gong, Ming Zhong, Xingjian Zhao, Mukai Li, Jun Zhang, Lingpeng Kong, Xipeng Qiu

Recently, there has been growing interest in extending the context length of large language models (LLMs), aiming to effectively process long inputs of one turn or conversations with more extensive histories.

Instruction Following

11,092

Paper
Code

In-Context Learning with Many Demonstration Examples

1 code implementation • 9 Feb 2023 • Mukai Li, Shansan Gong, Jiangtao Feng, Yiheng Xu, Jun Zhang, Zhiyong Wu, Lingpeng Kong

Based on EVALM, we scale up the size of examples efficiently in both instruction tuning and in-context learning to explore the boundary of the benefits from more annotated data.

16k 8k +2

Paper
Code

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

1 code implementation • 17 Oct 2022 • Shansan Gong, Mukai Li, Jiangtao Feng, Zhiyong Wu, Lingpeng Kong

Bringing together theoretical analysis and empirical evidence, we demonstrate the great potential of diffusion models in complex conditional language generation tasks.

Text Generation

669

Paper
Code

Few-Shot Natural Language Inference Generation with PDD: Prompt and Dynamic Demonstration

no code implementations • 21 May 2022 • Kaijian Li, Shansan Gong, Kenny Q. Zhu

Natural Language Inference Generation task is to generate a text hypothesis given a text premise and a logical relation between the two.

Data Augmentation Natural Language Inference +1

Paper
Add Code

Positive, Negative and Neutral: Modeling Implicit Feedback in Session-based News Recommendation

1 code implementation • 12 May 2022 • Shansan Gong, Kenny Q. Zhu

News recommendation for anonymous readers is a useful but challenging task for many news portals, where interactions between readers and articles are limited within a temporary login session.

News Recommendation Session-Based Recommendations

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.