Search Results for author: José Cambronero

Found 14 papers, 2 papers with code

Solving Data-centric Tasks using Large Language Models

no code implementations18 Feb 2024 Shraddha Barke, Christian Poelitz, Carina Suzana Negreanu, Benjamin Zorn, José Cambronero, Andrew D. Gordon, Vu Le, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams

Large language models (LLMs) are rapidly replacing help forums like StackOverflow, and are especially helpful for non-professional programmers and end users.

CodeFusion: A Pre-trained Diffusion Model for Code Generation

no code implementations26 Oct 2023 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

Imagine a developer who can only change their last line of code, how often would they have to start writing a function from scratch before it is correct?

Code Generation Denoising

Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs

no code implementations16 Oct 2023 Ananya Singha, José Cambronero, Sumit Gulwani, Vu Le, Chris Parnin

Inspired by prior work, we generate a collection of self-supervised structural tasks (e. g. navigate to a cell and row; transpose the table) and evaluate the performance differences when using 8 formats.

In-Context Learning Navigate

DataVinci: Learning Syntactic and Semantic String Repairs

no code implementations21 Aug 2023 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

DataVinci learns regular-expression-based patterns that cover a majority of values in a column and reports values that do not satisfy such patterns as data errors.

Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors

no code implementations29 Jun 2023 Tung Phung, Victor-Alexandru Pădurean, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

In our work, we systematically evaluate two models, ChatGPT (based on GPT-3. 5) and GPT-4, and compare their performance with human tutors for a variety of scenarios.

Benchmarking

Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

1 code implementation24 Jan 2023 Tung Phung, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

We investigate using LLMs to generate feedback for fixing syntax errors in Python programs, a key scenario in introductory programming.

Repairing Bugs in Python Assignments Using Large Language Models

no code implementations29 Sep 2022 Jialu Zhang, José Cambronero, Sumit Gulwani, Vu Le, Ruzica Piskac, Gustavo Soares, Gust Verbruggen

We propose to use a large language model trained on code, such as Codex, to build an APR system -- MMAPR -- for introductory Python programming assignments.

Chunking Language Modelling +2

CORNET: Learning Table Formatting Rules By Example

no code implementations11 Aug 2022 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Mohammad Raza, Gust Verbruggen

Since we are the first to introduce conditional formatting, we compare CORNET to a wide range of symbolic and neural baselines adapted from related domains.

Program Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.