Search Results for author: Nathanael Schärli

Found 8 papers, 4 papers with code

Measuring Compositional Generalization: A Comprehensive Method on Realistic Data

3 code implementations • ICLR 2020 • Daniel Keysers, Nathanael Schärli, Nathan Scales, Hylke Buisman, Daniel Furrer, Sergii Kashubin, Nikola Momchev, Danila Sinopalnikov, Lukasz Stafiniak, Tibor Tihon, Dmitry Tsarkov, Xiao Wang, Marc van Zee, Olivier Bousquet

We present a large and realistic natural language question answering dataset that is constructed according to this method, and we use it to analyze the compositional generalization ability of three machine learning architectures.

Ranked #5 on Semantic Parsing on CFQ

BIG-bench Machine Learning Question Answering +1

32,806

Paper
Code

Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures

1 code implementation • 17 Jul 2020 • Daniel Furrer, Marc van Zee, Nathan Scales, Nathanael Schärli

While mainstream machine learning methods are known to have limited ability to compositionally generalize, new architectures and techniques continue to be proposed to address this limitation.

Language Modelling Semantic Parsing

32,804

Paper
Code

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

1 code implementation • 17 Oct 2022 • Mirac Suzgun, Nathan Scales, Nathanael Schärli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed H. Chi, Denny Zhou, Jason Wei

BIG-Bench (Srivastava et al., 2022) is a diverse evaluation suite that focuses on tasks believed to be beyond the capabilities of current language models.

Language Modelling

375

Paper
Code

Large Language Models Can Be Easily Distracted by Irrelevant Context

1 code implementation • 31 Jan 2023 • Freda Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou

We use this benchmark to measure the distractibility of cutting-edge prompting techniques for large language models, and find that the model performance is dramatically decreased when irrelevant information is included.

Arithmetic Reasoning Language Modelling +1

Paper
Code

*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task

no code implementations • 15 Dec 2020 • Dmitry Tsarkov, Tibor Tihon, Nathan Scales, Nikola Momchev, Danila Sinopalnikov, Nathanael Schärli

We present *-CFQ ("star-CFQ"): a suite of large-scale datasets of varying scope based on the CFQ semantic parsing benchmark, designed for principled investigation of the scalability of machine learning systems in a realistic compositional task setting.

BIG-bench Machine Learning Semantic Parsing

Paper
Add Code

Least-to-Most Prompting Enables Complex Reasoning in Large Language Models

no code implementations • 21 May 2022 • Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, Ed Chi

Chain-of-thought prompting has demonstrated remarkable performance on various natural language reasoning tasks.

Ranked #96 on Arithmetic Reasoning on GSM8K

Arithmetic Reasoning Math

Paper
Add Code

Compositional Semantic Parsing with Large Language Models

no code implementations • 29 Sep 2022 • Andrew Drozdov, Nathanael Schärli, Ekin Akyürek, Nathan Scales, Xinying Song, Xinyun Chen, Olivier Bousquet, Denny Zhou

Humans can reason compositionally when presented with new tasks.

Ranked #1 on Semantic Parsing on CFQ

Semantic Parsing

Paper
Add Code

Teaching Large Language Models to Self-Debug

no code implementations • 11 Apr 2023 • Xinyun Chen, Maxwell Lin, Nathanael Schärli, Denny Zhou

In particular, we demonstrate that Self-Debugging can teach the large language model to perform rubber duck debugging; i. e., without any human feedback on the code correctness or error messages, the model is able to identify its mistakes by investigating the execution results and explaining the generated code in natural language.

Ranked #10 on Code Generation on MBPP

Code Generation Language Modelling +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.