1 code implementation • 5 Feb 2024 • Xinyi Wang, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang
To understand how pre-training with a next-token prediction objective contributes to the emergence of such reasoning capability, we propose that we can view an LM as deriving new conclusions by aggregating indirect reasoning paths seen at pre-training time.
no code implementations • 23 May 2023 • Alfonso Amayuelas, Liangming Pan, Wenhu Chen, William Wang
This paper investigates the capabilities of Large Language Models (LLMs) in the context of understanding their own knowledge and measuring their uncertainty.
1 code implementation • ICLR 2022 • Alfonso Amayuelas, Shuai Zhang, Susie Xi Rao, Ce Zhang
We introduce a set of models that use Neural Networks to create one-point vector embeddings to answer the queries.
no code implementations • 21 Dec 2020 • Naman Goel, Alfonso Amayuelas, Amit Deshpande, Amit Sharma
For example, in multi-stage settings where decisions are made in multiple screening rounds, we use our framework to derive the minimal distributions required to design a fair algorithm.