Search Results for author: Federico Cassano

Found 10 papers, 9 papers with code

SelfCodeAlign: Self-Alignment for Code Generation

2 code implementations31 Oct 2024 Yuxiang Wei, Federico Cassano, Jiawei Liu, Yifeng Ding, Naman jain, Zachary Mueller, Harm de Vries, Leandro von Werra, Arjun Guha, Lingming Zhang

In our primary experiments, we use SelfCodeAlign with CodeQwen1. 5-7B to generate a dataset of 74k instruction-response pairs.

Code Generation HumanEval

Planning In Natural Language Improves LLM Search For Code Generation

2 code implementations5 Sep 2024 Evan Wang, Federico Cassano, Catherine Wu, Yunfeng Bai, Will Song, Vaskar Nath, Ziwen Han, Sean Hendryx, Summer Yue, Hugh Zhang

We empirically demonstrate that this lack of diversity can be mitigated by searching over candidate plans for solving a problem in natural language.

Code Generation Diversity +1

DafnyBench: A Benchmark for Formal Software Verification

1 code implementation12 Jun 2024 Chloe Loughridge, Qinyi Sun, Seth Ahrenbach, Federico Cassano, Chuyue Sun, Ying Sheng, Anish Mudide, Md Rakib Hossain Misu, Nada Amin, Max Tegmark

We introduce DafnyBench, the largest benchmark of its kind for training and evaluating machine learning systems for formal software verification.

Type Prediction With Program Decomposition and Fill-in-the-Type Training

1 code implementation25 May 2023 Federico Cassano, Ming-Ho Yee, Noah Shinn, Arjun Guha, Steven Holtzen

TypeScript and Python are two programming languages that support optional type annotations, which are useful but tedious to introduce and maintain.

Type prediction

Reflexion: Language Agents with Verbal Reinforcement Learning

4 code implementations NeurIPS 2023 Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao

Large language models (LLMs) have been increasingly used to interact with external environments (e. g., games, compilers, APIs) as goal-driven agents.

Decision Making HumanEval +3

Cannot find the paper you are looking for? You can Submit a new open access paper.