Search Results for author: Sumit Gulwani

Found 34 papers, 5 papers with code

Exploring Interaction Patterns for Debugging: Enhancing Conversational Capabilities of AI-assistants

no code implementations9 Feb 2024 Bhavya Chopra, Yasharth Bajpai, Param Biyani, Gustavo Soares, Arjun Radhakrishna, Chris Parnin, Sumit Gulwani

The widespread availability of Large Language Models (LLMs) within Integrated Development Environments (IDEs) has led to their speedy adoption.

Fault localization

Generative AI for Education (GAIED): Advances, Opportunities, and Challenges

no code implementations2 Feb 2024 Paul Denny, Sumit Gulwani, Neil T. Heffernan, Tanja Käser, Steven Moore, Anna N. Rafferty, Adish Singla

This survey article has grown out of the GAIED (pronounced "guide") workshop organized by the authors at the NeurIPS 2023 conference.

CodeFusion: A Pre-trained Diffusion Model for Code Generation

no code implementations26 Oct 2023 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

Imagine a developer who can only change their last line of code, how often would they have to start writing a function from scratch before it is correct?

Code Generation Denoising

TST$^\mathrm{R}$: Target Similarity Tuning Meets the Real World

no code implementations26 Oct 2023 Anirudh Khatry, Sumit Gulwani, Priyanshu Gupta, Vu Le, Ananya Singha, Mukul Singh, Gust Verbruggen

Target similarity tuning (TST) is a method of selecting relevant examples in natural language (NL) to code generation through large language models (LLMs) to improve performance.

Code Generation Sentence +2

Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs

no code implementations16 Oct 2023 Ananya Singha, José Cambronero, Sumit Gulwani, Vu Le, Chris Parnin

Inspired by prior work, we generate a collection of self-supervised structural tasks (e. g. navigate to a cell and row; transpose the table) and evaluate the performance differences when using 8 formats.

In-Context Learning Navigate

Augmented Embeddings for Custom Retrievals

no code implementations9 Oct 2023 Anirudh Khatry, Yasharth Bajpai, Priyanshu Gupta, Sumit Gulwani, Ashish Tiwari

The flavor of retrieval typically used in classical applications can be termed as homogeneous and relaxed, where queries and corpus elements are both natural language (NL) utterances (homogeneous) and the goal is to pick most relevant elements from the corpus in the Top-K, where K is large, such as 10, 25, 50 or even 100 (relaxed).

Information Retrieval Retrieval

DataVinci: Learning Syntactic and Semantic String Repairs

no code implementations21 Aug 2023 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

DataVinci learns regular-expression-based patterns that cover a majority of values in a column and reports values that do not satisfy such patterns as data errors.

Demonstration of CORNET: A System For Learning Spreadsheet Formatting Rules By Example

no code implementations14 Aug 2023 Mukul Singh, Jose Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

After the user provides one or two formatted cells as examples, CORNET generates formatting rule suggestions for the user to apply to the spreadsheet.

Management Program Synthesis

Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors

no code implementations29 Jun 2023 Tung Phung, Victor-Alexandru Pădurean, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

In our work, we systematically evaluate two models, ChatGPT (based on GPT-3. 5) and GPT-4, and compare their performance with human tutors for a variety of scenarios.

Benchmarking

GrACE: Generation using Associated Code Edits

no code implementations23 May 2023 Priyanshu Gupta, Avishree Khare, Yasharth Bajpai, Saikat Chakraborty, Sumit Gulwani, Aditya Kanade, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

In our experiments with two datasets, the knowledge of prior edits boosts the performance of the LLMs significantly and enables them to generate 29% and 54% more correctly edited code in top-1 suggestions relative to the current state-of-the-art symbolic and neural approaches, respectively.

Bug fixing Code Generation

From Words to Code: Harnessing Data for Program Synthesis from Natural Language

no code implementations2 May 2023 Anirudh Khatry, Joyce Cahoon, Jordan Henkel, Shaleen Deep, Venkatesh Emani, Avrilia Floratou, Sumit Gulwani, Vu Le, Mohammad Raza, Sherry Shi, Mukul Singh, Ashish Tiwari

Existing approaches have utilized data context in a limited way by simply adding relevant information from the input data into the prompts sent to the LLM.

Program Synthesis

Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

1 code implementation24 Jan 2023 Tung Phung, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

We investigate using LLMs to generate feedback for fixing syntax errors in Python programs, a key scenario in introductory programming.

Repairing Bugs in Python Assignments Using Large Language Models

no code implementations29 Sep 2022 Jialu Zhang, José Cambronero, Sumit Gulwani, Vu Le, Ruzica Piskac, Gustavo Soares, Gust Verbruggen

We propose to use a large language model trained on code, such as Codex, to build an APR system -- MMAPR -- for introductory Python programming assignments.

Chunking Language Modelling +2

CORNET: Learning Table Formatting Rules By Example

no code implementations11 Aug 2022 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Mohammad Raza, Gust Verbruggen

Since we are the first to introduce conditional formatting, we compare CORNET to a wide range of symbolic and neural baselines adapted from related domains.

Program Synthesis

Overwatch: Learning Patterns in Code Edit Sequences

no code implementations25 Jul 2022 Yuhao Zhang, Yasharth Bajpai, Priyanshu Gupta, Ameya Ketkar, Miltiadis Allamanis, Titus Barik, Sumit Gulwani, Arjun Radhakrishna, Mohammad Raza, Gustavo Soares, Ashish Tiwari

Our experiments show that Overwatch has 78% precision and that Overwatch not only completed edits when developers missed the opportunity to use the IDE tool support but also predicted new edits that have no tool support in the IDE.

Neurosymbolic Repair for Low-Code Formula Languages

no code implementations24 Jul 2022 Rohan Bavishi, Harshit Joshi, José Pablo Cambronero Sánchez, Anna Fariha, Sumit Gulwani, Vu Le, Ivan Radicek, Ashish Tiwari

To address this problem, we developed LaMirage, a LAst-MIle RepAir-engine GEnerator that combines symbolic and neural techniques to perform last-mile repair in low-code formula languages.

Synchromesh: Reliable code generation from pre-trained language models

1 code implementation ICLR 2022 Gabriel Poesia, Oleksandr Polozov, Vu Le, Ashish Tiwari, Gustavo Soares, Christopher Meek, Sumit Gulwani

Then, Synchromesh feeds the examples to a pre-trained language model and samples programs using Constrained Semantic Decoding (CSD): a general framework for constraining the output to a set of valid programs in the target language.

Code Generation Language Modelling +1

Multi-modal Program Inference: a Marriage of Pre-trainedLanguage Models and Component-based Synthesis

no code implementations3 Sep 2021 Kia Rahmani, Mohammad Raza, Sumit Gulwani, Vu Le, Daniel Morris, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

Examples provide a precise but incomplete specification, and natural language provides an ambiguous but more "complete" task description.

Program Synthesis

Programming by Rewards

no code implementations14 Jul 2020 Nagarajan Natarajan, Ajaykrishna Karthikeyan, Prateek Jain, Ivan Radicek, Sriram Rajamani, Sumit Gulwani, Johannes Gehrke

The goal of the synthesizer is to synthesize a "decision function" $f$ which transforms the features to a decision value for the black-box component so as to maximize the expected reward $E[r \circ f (x)]$ for executing decisions $f(x)$ for various values of $x$.

Program Synthesis

Information-theoretic User Interaction: Significant Inputs for Program Synthesis

no code implementations22 Jun 2020 Ashish Tiwari, Arjun Radhakrishna, Sumit Gulwani, Daniel Perelman

In the context of interactive program synthesis, we use the above result to develop an {\em{active program learner}} that generates the significant inputs to pose as queries to the user in each iteration.

Clustering Program Synthesis

Quantitative Programming by Examples

no code implementations12 Sep 2019 Sumit Gulwani, Kunal Pathak, Arjun Radhakrishna, Ashish Tiwari, Abhishek Udupa

Programming-by-Example (PBE) systems synthesize an intended program in some (relatively constrained) domain-specific language from a small number of input-output examples provided by the user.

Neural-Guided Deductive Search for Real-Time Program Synthesis from Examples

no code implementations ICLR 2018 Ashwin Kalyan, Abhishek Mohta, Oleksandr Polozov, Dhruv Batra, Prateek Jain, Sumit Gulwani

In this work, we propose Neural Guided Deductive Search (NGDS), a hybrid synthesis technique that combines the best of both symbolic logic techniques and statistical models.

Program Synthesis

FlashProfile: A Framework for Synthesizing Data Profiles

no code implementations17 Sep 2017 Saswat Padhi, Prateek Jain, Daniel Perelman, Oleksandr Polozov, Sumit Gulwani, Todd Millstein

However, manual inspection of data to identify the different formats is infeasible in standard big-data scenarios.

Clustering

Learning Syntactic Program Transformations from Examples

no code implementations31 Aug 2016 Reudismam Rolim, Gustavo Soares, Loris D'Antoni, Oleksandr Polozov, Sumit Gulwani, Rohit Gheyi, Ryo Suzuki, Bjoern Hartmann

In the second domain, we use repetitive edits applied by developers to the same project to synthesize a program transformation that applies these edits to other locations in the code.

Prutor: A System for Tutoring CS1 and Collecting Student Programs for Analysis

3 code implementations12 Aug 2016 Rajdeep Das, Umair Z. Ahmed, Amey Karkare, Sumit Gulwani

Apart from the code snapshots at regular intervals, Prutor also collects other valuable data such as the time taken by the students to solve the problems, the number of compile and execution events, and the errors made.

Computers and Society Programming Languages Software Engineering

Automated Clustering and Program Repair for Introductory Programming Assignments

3 code implementations10 Mar 2016 Sumit Gulwani, Ivan Radiček, Florian Zuleger

We obtain promising initial results (the average usefulness grade 3. 4 on a scale from 1 to 5), and conclude that our approach can be used in an interactive setting.

Programming Languages

Automatic Synthesis of Geometry Problems for an Intelligent Tutoring System

no code implementations29 Oct 2015 Chris Alvin, Sumit Gulwani, Rupak Majumdar, Supratik Mukhopadhyay

This paper presents an intelligent tutoring system, GeoTutor, for Euclidean Geometry that is automatically able to synthesize proof problems and their respective solutions given a geometric figure together with a set of properties true of it.

Automatic Generation of Alternative Starting Positions for Simple Traditional Board Games

no code implementations14 Nov 2014 Umair Z. Ahmed, Krishnendu Chatterjee, Sumit Gulwani

Simple board games, like Tic-Tac-Toe and CONNECT-4, play an important role not only in the development of mathematical and logical skills, but also in the emotional and social development.

Board Games

Automating string processing in spreadsheets using input-output examples

no code implementations POPL 2011 2011 Sumit Gulwani

We describe the design of a string programming/expression lan- guage that supports restricted forms of regular expressions, condi- tionals and loops.

Novel Concepts

Cannot find the paper you are looking for? You can Submit a new open access paper.