Search Results for author: Armando Solar-Lezama

Found 47 papers, 20 papers with code

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

no code implementations12 Mar 2024 Naman jain, King Han, Alex Gu, Wen-Ding Li, Fanjia Yan, Tianjun Zhang, Sida Wang, Armando Solar-Lezama, Koushik Sen, Ion Stoica

Large Language Models (LLMs) applied to code-related applications have emerged as a prominent field, attracting significant interest from both academia and industry.

Code Generation

The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?

no code implementations29 Feb 2024 Alex Gu, Wen-Ding Li, Naman jain, Theo X. Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama

In this work, we focus on these counterfeit samples: programs sampled from a language model that 1) have a high enough log-probability to be generated at a moderate temperature and 2) pass weak correctness checks.

Code Generation Language Modelling

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

no code implementations5 Jan 2024 Alex Gu, Baptiste Rozière, Hugh Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida I. Wang

The best setup, GPT-4 with chain of thought (CoT), achieves a pass@1 of 75% and 81% on input and output prediction, respectively.

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

1 code implementation23 Oct 2023 Theo X. Olausson, Alex Gu, Benjamin Lipkin, Cedegao E. Zhang, Armando Solar-Lezama, Joshua B. Tenenbaum, Roger Levy

Logical reasoning, i. e., deductively inferring the truth value of a conclusion from a set of premises, is an important task for artificial intelligence with wide potential impacts on science, mathematics, and society.

Logical Reasoning

Learning a Hierarchical Planner from Humans in Multiple Generations

no code implementations17 Oct 2023 Leonardo Hernandez Cano, Yewen Pu, Robert D. Hawkins, Josh Tenenbaum, Armando Solar-Lezama

Compared to learning from demonstrations or experiences, programmatic learning allows the machine to acquire a novel skill as soon as the program is written, and, by building a library of programs, a machine can quickly learn how to perform complex tasks.

Is Self-Repair a Silver Bullet for Code Generation?

1 code implementation16 Jun 2023 Theo X. Olausson, Jeevana Priya Inala, Chenglong Wang, Jianfeng Gao, Armando Solar-Lezama

We hypothesize that this is because self-repair is bottlenecked by the model's ability to provide feedback on its own code; using a stronger model to artificially boost the quality of the feedback, we observe substantially larger performance gains.

Code Generation

Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models

no code implementations15 Jun 2023 Sarah J. Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean, Annie Wang, Keith Tyser, Zad Chin, Yann Hicke, Nikhil Singh, Madeleine Udell, Yoon Kim, Tonio Buonassisi, Armando Solar-Lezama, Iddo Drori

We curate a comprehensive dataset of 4, 550 questions and solutions from problem sets, midterm exams, and final exams across all MIT Mathematics and Electrical Engineering and Computer Science (EECS) courses required for obtaining a degree.

Electrical Engineering Few-Shot Learning +3

Top-Down Synthesis for Library Learning

1 code implementation29 Nov 2022 Matthew Bowers, Theo X. Olausson, Lionel Wong, Gabriel Grand, Joshua B. Tenenbaum, Kevin Ellis, Armando Solar-Lezama

This paper introduces corpus-guided top-down synthesis as a mechanism for synthesizing library functions that capture common functionality from a corpus of programs in a domain specific language (DSL).

Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

no code implementations22 Nov 2022 Vitali Petsiuk, Alexander E. Siemenn, Saisamrit Surbehera, Zad Chin, Keith Tyser, Gregory Hunter, Arvind Raghavan, Yann Hicke, Bryan A. Plummer, Ori Kerret, Tonio Buonassisi, Kate Saenko, Armando Solar-Lezama, Iddo Drori

For example, asking a model to generate a varying number of the same object to measure its ability to count or providing a text prompt with several objects that each have a different attribute to identify its ability to match objects and attributes correctly.

Attribute Text-to-Image Generation

ObSynth: An Interactive Synthesis System for Generating Object Models from Natural Language Specifications

no code implementations20 Oct 2022 Alex Gu, Tamara Mitrovska, Daniela Velez, Jacob Andreas, Armando Solar-Lezama

We introduce ObSynth, an interactive system leveraging the domain knowledge embedded in large language models (LLMs) to help users design object models from high level natural language prompts.

Object

Neurosymbolic Programming for Science

no code implementations10 Oct 2022 Jennifer J. Sun, Megan Tjandrasuwita, Atharva Sehgal, Armando Solar-Lezama, Swarat Chaudhuri, Yisong Yue, Omar Costilla-Reyes

Neurosymbolic Programming (NP) techniques have the potential to accelerate scientific discovery.

AutumnSynth: Synthesis of Reactive Programs with Structured Latent State

no code implementations NeurIPS Workshop AIPLANS 2021 Ria Das, Joshua B. Tenenbaum, Armando Solar-Lezama, Zenna Tavares

The human ability to efficiently discover causal theories of their environments from observations is a feat of nature that remains elusive in machines.

Program Synthesis

Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

1 code implementation NeurIPS 2021 Yichen David Yang, Jeevana Priya Inala, Osbert Bastani, Yewen Pu, Armando Solar-Lezama, Martin Rinard

Our results demonstrate that our approach can obtain the benefits of program-guided reinforcement learning without requiring the user to provide a new guiding program for every new task.

Program Synthesis reinforcement-learning +1

Neurosymbolic Transformers for Multi-Agent Communication

1 code implementation NeurIPS 2020 Jeevana Priya Inala, Yichen Yang, James Paulos, Yewen Pu, Osbert Bastani, Vijay Kumar, Martin Rinard, Armando Solar-Lezama

We study the problem of inferring communication structures that can solve cooperative multi-agent planning problems while minimizing the amount of communication.

Representing Partial Programs with Blended Abstract Semantics

no code implementations ICLR 2021 Maxwell Nye, Yewen Pu, Matthew Bowers, Jacob Andreas, Joshua B. Tenenbaum, Armando Solar-Lezama

In this search process, a key challenge is representing the behavior of a partially written program before it can be executed, to judge if it is on the right track and predict where to search next.

Program Synthesis

Causal Inductive Synthesis Corpus

no code implementations NeurIPS Workshop CAP 2020 Zenna Tavares, Ria Das, Elizabeth Weeks, Kate Lin, Joshua B. Tenenbaum, Armando Solar-Lezama

We introduce the Causal Inductive Synthesis Corpus (CISC) -- a manually constructed collection of interactive domains.

Model Discovery

Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Construction from Human Design Sequences

1 code implementation5 Oct 2020 Karl D. D. Willis, Yewen Pu, Jieliang Luo, Hang Chu, Tao Du, Joseph G. Lambourne, Armando Solar-Lezama, Wojciech Matusik

Parametric computer-aided design (CAD) is a standard paradigm used to design manufactured objects, where a 3D shape is represented as a program supported by the CAD software.

CAD Reconstruction Program Synthesis

Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Reconstruction

no code implementations28 Sep 2020 Karl Willis, Yewen Pu, Jieliang Luo, Hang Chu, Tao Du, Joseph Lambourne, Armando Solar-Lezama, Wojciech Matusik

We provide a dataset of 8, 625 designs, comprising sequential sketch and extrude modeling operations, together with a complementary environment called the Fusion 360 Gym, to assist with performing CAD reconstruction.

CAD Reconstruction

Program Synthesis with Pragmatic Communication

no code implementations NeurIPS 2020 Yewen Pu, Kevin Ellis, Marta Kryven, Josh Tenenbaum, Armando Solar-Lezama

Given a specification, we score a candidate program both on its consistency with the specification, and also whether a rational speaker would chose this particular specification to communicate that program.

Inductive Bias Program Synthesis

Verifiably Safe Exploration for End-to-End Reinforcement Learning

1 code implementation2 Jul 2020 Nathan Hunt, Nathan Fulton, Sara Magliacane, Nghia Hoang, Subhro Das, Armando Solar-Lezama

We also prove that our method of enforcing the safety constraints preserves all safe policies from the original environment.

object-detection Object Detection +3

DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

3 code implementations15 Jun 2020 Kevin Ellis, Catherine Wong, Maxwell Nye, Mathias Sable-Meyer, Luc Cary, Lucas Morales, Luke Hewitt, Armando Solar-Lezama, Joshua B. Tenenbaum

It builds expertise by creating programming languages for expressing domain concepts, together with neural networks to guide the search for programs within these languages.

Drawing Pictures Program induction +1

Synthesizing Programmatic Policies that Inductively Generalize

no code implementations ICLR 2020 Jeevana Priya Inala, Osbert Bastani, Zenna Tavares, Armando Solar-Lezama

We show that our algorithm can be used to learn policies that inductively generalize to novel environments, whereas traditional neural network policies fail to do so.

Imitation Learning Reinforcement Learning (RL)

Write, Execute, Assess: Program Synthesis with a REPL

no code implementations NeurIPS 2019 Kevin Ellis, Maxwell Nye, Yewen Pu, Felix Sosa, Josh Tenenbaum, Armando Solar-Lezama

We present a neural program synthesis approach integrating components which write, execute, and assess code to navigate the search space of possible programs.

Navigate Program Synthesis

Deductive Optimization of Relational Data Storage

1 code implementation8 Mar 2019 John K. Feser, Samuel Madden, Nan Tang, Armando Solar-Lezama

Optimizing the physical data storage and retrieval of data are two key database management problems.

Programming Languages Databases

Learning to Infer Program Sketches

1 code implementation17 Feb 2019 Maxwell Nye, Luke Hewitt, Joshua Tenenbaum, Armando Solar-Lezama

Our goal is to build systems which write code automatically from the kinds of specifications humans can most easily provide, such as examples and natural language instruction.

Memorization Program Synthesis

Probabilistic Verification of Fairness Properties via Concentration

1 code implementation2 Dec 2018 Osbert Bastani, Xin Zhang, Armando Solar-Lezama

As machine learning systems are increasingly used to make real world legal and financial decisions, it is of paramount importance that we develop algorithms to verify that these systems do not discriminate against minorities.

BIG-bench Machine Learning Fairness

Library Learning for Neurally-Guided Bayesian Program Induction

no code implementations1 Dec 2018 Kevin Ellis, Lucas Morales, Mathias Sablé-Meyer, Armando Solar-Lezama, Joshua B. Tenenbaum

Successful approaches to program induction require a hand-engineered domain-specific language (DSL), constraining the space of allowed programs and imparting prior knowledge of the domain.

Program induction regression +1

Learning Libraries of Subroutines for Neurally–Guided Bayesian Program Induction

no code implementations NeurIPS 2018 Kevin Ellis, Lucas Morales, Mathias Sablé-Meyer, Armando Solar-Lezama, Josh Tenenbaum

Successful approaches to program induction require a hand-engineered domain-specific language (DSL), constraining the space of allowed programs and imparting prior knowledge of the domain.

Program induction regression +1

Verifiable Reinforcement Learning via Policy Extraction

1 code implementation NeurIPS 2018 Osbert Bastani, Yewen Pu, Armando Solar-Lezama

While deep reinforcement learning has successfully solved many challenging control tasks, its real-world applicability has been limited by the inability to ensure the safety of learned policies.

Imitation Learning Model Compression +2

The Three Pillars of Machine Programming

no code implementations20 Mar 2018 Justin Gottschlich, Armando Solar-Lezama, Nesime Tatbul, Michael Carbin, Martin Rinard, Regina Barzilay, Saman Amarasinghe, Joshua B. Tenenbaum, Tim Mattson

In this position paper, we describe our vision of the future of machine programming through a categorical examination of three pillars of research.

BIG-bench Machine Learning Position

Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

no code implementations NeurIPS 2018 Xin Zhang, Armando Solar-Lezama, Rishabh Singh

We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output.

REAS: Combining Numerical Optimization with SAT Solving

no code implementations13 Feb 2018 Jeevana Priya Inala, Sicun Gao, Soonho Kong, Armando Solar-Lezama

In this paper, we present ReaS, a technique that combines numerical optimization with SAT solving to synthesize unknowns in a program that involves discrete and floating point computation.

Learning to select examples for program synthesis

no code implementations ICLR 2018 Yewen Pu, Zachery Miranda, Armando Solar-Lezama, Leslie Pack Kaelbling

In this paper we address this challenge by constructing a representative subset of examples that is both small and is able to constrain the solver sufficiently.

Program Synthesis

SyGuS-Comp 2017: Results and Analysis

no code implementations29 Nov 2017 Rajeev Alur, Dana Fisman, Rishabh Singh, Armando Solar-Lezama

Syntax-Guided Synthesis (SyGuS) is the computational problem of finding an implementation f that meets both a semantic constraint given by a logical formula phi in a background theory T, and a syntactic constraint given by a grammar G, which specifies the allowed set of candidate implementations.

Selecting Representative Examples for Program Synthesis

1 code implementation ICML 2018 Yewen Pu, Zachery Miranda, Armando Solar-Lezama, Leslie Pack Kaelbling

Program synthesis is a class of regression problems where one seeks a solution, in the form of a source-code program, mapping the inputs to their corresponding outputs exactly.

Program Synthesis

Capturing the Future by Replaying the Past

2 code implementations28 Oct 2017 James Koppel, Gabriel Scherer, Armando Solar-Lezama

Along the way, we explain delimited continuations and monadic reflection, show how the Filinski construction lets thermometer continuations express any monadic effect, share an elegant special-case for nondeterminism, and discuss why our construction is not prevented by theoretical results that exceptions and state cannot macro-express continuations.

Programming Languages

One Tool, Many Languages: Language-Parametric Transformation with Incremental Parametric Syntax

3 code implementations14 Jul 2017 James Koppel, Varot Premtoon, Armando Solar-Lezama

We present a new approach for building source-to-source transformations that can run on multiple programming languages, based on a new way of representing programs called incremental parametric syntax.

Programming Languages D.3.4; D.3.1

Learning to Acquire Information

1 code implementation20 Apr 2017 Yewen Pu, Leslie P. Kaelbling, Armando Solar-Lezama

Finding the optimal subset of observations is intractable in general, thus we focus on the problem of active diagnosis, where the agent selects the next most-informative observation based on the results of previous observations.

Sampling for Bayesian Program Learning

no code implementations NeurIPS 2016 Kevin Ellis, Armando Solar-Lezama, Josh Tenenbaum

Towards learning programs from data, we introduce the problem of sampling programs from posterior distributions conditioned on that data.

Program Synthesis

SyGuS-Comp 2016: Results and Analysis

no code implementations23 Nov 2016 Rajeev Alur, Dana Fisman, Rishabh Singh, Armando Solar-Lezama

Syntax-Guided Synthesis (SyGuS) is the computational problem of finding an implementation f that meets both a semantic constraint given by a logical formula $\varphi$ in a background theory T, and a syntactic constraint given by a grammar G, which specifies the allowed set of candidate implementations.

sk_p: a neural program corrector for MOOCs

no code implementations11 Jul 2016 Yewen Pu, Karthik Narasimhan, Armando Solar-Lezama, Regina Barzilay

We present a novel technique for automatic program correction in MOOCs, capable of fixing both syntactic and semantic errors without manual, problem specific correction strategies.

Machine Translation Translation

Unsupervised Learning by Program Synthesis

no code implementations NeurIPS 2015 Kevin Ellis, Armando Solar-Lezama, Josh Tenenbaum

We introduce an unsupervised learning algorithmthat combines probabilistic modeling with solver-based techniques for program synthesis. We apply our techniques to both a visual learning domain and a language learning problem, showing that our algorithm can learn many visual concepts from only a few examplesand that it can recover some English inflectional morphology. Taken together, these results give both a new approach to unsupervised learning of symbolic compositional structures, and a technique for applying program synthesis tools to noisy data.

Program Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.