Search Results for author: Shan Dong

Found 2 papers, 1 papers with code

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

no code implementations7 Oct 2024 Lei Wang, Shan Dong, Yuhui Xu, Hanze Dong, Yalu Wang, Amrita Saha, Ee-Peng Lim, Caiming Xiong, Doyen Sahoo

Although some recent benchmarks have been developed to evaluate the long-context capabilities of LLMs, there is a lack of benchmarks evaluating the mathematical reasoning abilities of LLMs over long contexts, which is crucial for LLMs' application in real-world scenarios.

Information Retrieval Mathematical Reasoning

All in an Aggregated Image for In-Image Learning

1 code implementation28 Feb 2024 Lei Wang, Wanyu Xu, Zhiqiang Hu, Yihuai Lan, Shan Dong, Hao Wang, Roy Ka-Wei Lee, Ee-Peng Lim

This paper introduces a new in-context learning (ICL) mechanism called In-Image Learning (I$^2$L) that combines demonstration examples, visual cues, and chain-of-thought reasoning into an aggregated image to enhance the capabilities of Large Multimodal Models (e. g., GPT-4V) in multimodal reasoning tasks.

Hallucination In-Context Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.