Search Results for author: Vu Le

Found 18 papers, 3 papers with code

Solving Data-centric Tasks using Large Language Models

no code implementations18 Feb 2024 Shraddha Barke, Christian Poelitz, Carina Suzana Negreanu, Benjamin Zorn, José Cambronero, Andrew D. Gordon, Vu Le, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams

Large language models (LLMs) are rapidly replacing help forums like StackOverflow, and are especially helpful for non-professional programmers and end users.

Fast and Interpretable Face Identification for Out-Of-Distribution Data Using Vision Transformers

1 code implementation6 Nov 2023 Hai Phan, Cindy Le, Vu Le, Yihui He, Anh Totti Nguyen

DeepFace-EMD (Phan et al. 2022) reaches state-of-the-art accuracy on out-of-distribution data by first comparing two images at the image level, and then at the patch level.

Face Identification Re-Ranking

CodeFusion: A Pre-trained Diffusion Model for Code Generation

no code implementations26 Oct 2023 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

Imagine a developer who can only change their last line of code, how often would they have to start writing a function from scratch before it is correct?

Code Generation Denoising

TST$^\mathrm{R}$: Target Similarity Tuning Meets the Real World

no code implementations26 Oct 2023 Anirudh Khatry, Sumit Gulwani, Priyanshu Gupta, Vu Le, Ananya Singha, Mukul Singh, Gust Verbruggen

Target similarity tuning (TST) is a method of selecting relevant examples in natural language (NL) to code generation through large language models (LLMs) to improve performance.

Code Generation Sentence +2

Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs

no code implementations16 Oct 2023 Ananya Singha, José Cambronero, Sumit Gulwani, Vu Le, Chris Parnin

Inspired by prior work, we generate a collection of self-supervised structural tasks (e. g. navigate to a cell and row; transpose the table) and evaluate the performance differences when using 8 formats.

In-Context Learning Navigate

DataVinci: Learning Syntactic and Semantic String Repairs

no code implementations21 Aug 2023 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

DataVinci learns regular-expression-based patterns that cover a majority of values in a column and reports values that do not satisfy such patterns as data errors.

Demonstration of CORNET: A System For Learning Spreadsheet Formatting Rules By Example

no code implementations14 Aug 2023 Mukul Singh, Jose Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

After the user provides one or two formatted cells as examples, CORNET generates formatting rule suggestions for the user to apply to the spreadsheet.

Management Program Synthesis

Efficiently-Verifiable Strong Uniquely Solvable Puzzles and Matrix Multiplication

1 code implementation12 Jul 2023 Matthew Anderson, Vu Le

We advance the Cohn-Umans framework for developing fast matrix multiplication algorithms.

From Words to Code: Harnessing Data for Program Synthesis from Natural Language

no code implementations2 May 2023 Anirudh Khatry, Joyce Cahoon, Jordan Henkel, Shaleen Deep, Venkatesh Emani, Avrilia Floratou, Sumit Gulwani, Vu Le, Mohammad Raza, Sherry Shi, Mukul Singh, Ashish Tiwari

Existing approaches have utilized data context in a limited way by simply adding relevant information from the input data into the prompts sent to the LLM.

Program Synthesis

Repairing Bugs in Python Assignments Using Large Language Models

no code implementations29 Sep 2022 Jialu Zhang, José Cambronero, Sumit Gulwani, Vu Le, Ruzica Piskac, Gustavo Soares, Gust Verbruggen

We propose to use a large language model trained on code, such as Codex, to build an APR system -- MMAPR -- for introductory Python programming assignments.

Chunking Language Modelling +2

CORNET: Learning Table Formatting Rules By Example

no code implementations11 Aug 2022 Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Mohammad Raza, Gust Verbruggen

Since we are the first to introduce conditional formatting, we compare CORNET to a wide range of symbolic and neural baselines adapted from related domains.

Program Synthesis

Neurosymbolic Repair for Low-Code Formula Languages

no code implementations24 Jul 2022 Rohan Bavishi, Harshit Joshi, José Pablo Cambronero Sánchez, Anna Fariha, Sumit Gulwani, Vu Le, Ivan Radicek, Ashish Tiwari

To address this problem, we developed LaMirage, a LAst-MIle RepAir-engine GEnerator that combines symbolic and neural techniques to perform last-mile repair in low-code formula languages.

Synchromesh: Reliable code generation from pre-trained language models

1 code implementation ICLR 2022 Gabriel Poesia, Oleksandr Polozov, Vu Le, Ashish Tiwari, Gustavo Soares, Christopher Meek, Sumit Gulwani

Then, Synchromesh feeds the examples to a pre-trained language model and samples programs using Constrained Semantic Decoding (CSD): a general framework for constraining the output to a set of valid programs in the target language.

Code Generation Language Modelling +1

Multi-modal Program Inference: a Marriage of Pre-trainedLanguage Models and Component-based Synthesis

no code implementations3 Sep 2021 Kia Rahmani, Mohammad Raza, Sumit Gulwani, Vu Le, Daniel Morris, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

Examples provide a precise but incomplete specification, and natural language provides an ambiguous but more "complete" task description.

Program Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.