Search Results for author: Varun Kumar

Found 21 papers, 5 papers with code

Fewer Truncations Improve Language Modeling

no code implementations • 16 Apr 2024 • Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto

In large language model training, input documents are typically concatenated together and then split into sequences of equal length to avoid padding tokens.

Combinatorial Optimization Hallucination +4

Paper
Add Code

MyCrunchGPT: A chatGPT assisted framework for scientific machine learning

no code implementations • 27 Jun 2023 • Varun Kumar, Leonard Gleyzer, Adar Kahana, Khemraj Shukla, George Em Karniadakis

To demonstrate the flow of the MyCrunchGPT, and create an infrastructure that can facilitate a broader vision, we built a webapp based guided user interface, that includes options for a comprehensive summary report.

Code Generation Geophysics

Paper
Add Code

A Static Evaluation of Code Completion by Large Language Models

no code implementations • 5 Jun 2023 • Hantian Ding, Varun Kumar, Yuchen Tian, Zijian Wang, Rob Kwiatkowski, Xiaopeng Li, Murali Krishna Ramanathan, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang

Large language models trained on code have shown great potential to increase productivity of software developers.

Code Completion Code Generation

Paper
Add Code

Real-Time Prediction of Gas Flow Dynamics in Diesel Engines using a Deep Neural Operator Framework

no code implementations • 2 Apr 2023 • Varun Kumar, Somdatta Goswami, Daniel J. Smith, George Em Karniadakis

As an alternative to physics based models, we develop an operator-based regression model (DeepONet) to learn the relevant output states for a mean-value gas flow engine model using the engine operating conditions as input variables.

Paper
Add Code

Greener yet Powerful: Taming Large Code Generation Models with Quantization

no code implementations • 9 Mar 2023 • Xiaokai Wei, Sujan Gonugondla, Wasi Ahmad, Shiqi Wang, Baishakhi Ray, Haifeng Qian, Xiaopeng Li, Varun Kumar, Zijian Wang, Yuchen Tian, Qing Sun, Ben Athiwaratkun, Mingyue Shang, Murali Krishna Ramanathan, Parminder Bhatia, Bing Xiang

Such large models incur significant resource usage (in terms of memory, latency, and dollars) as well as carbon footprint.

Code Generation Code Summarization +2

Paper
Add Code

ReCode: Robustness Evaluation of Code Generation Models

2 code implementations • 20 Dec 2022 • Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth, Bing Xiang

Most existing works on robustness in text or code tasks have focused on classification, while robustness in generation tasks is an uncharted area and to date there is no comprehensive benchmark for robustness in code generation.

Code Generation

Paper
Code

Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models

no code implementations • 17 Nov 2022 • Ninareh Mehrabi, Palash Goyal, Apurv Verma, Jwala Dhamala, Varun Kumar, Qian Hu, Kai-Wei Chang, Richard Zemel, Aram Galstyan, Rahul Gupta

Natural language often contains ambiguities that can lead to misinterpretation and miscommunication.

Common Sense Reasoning

Paper
Add Code

Multi-lingual Evaluation of Code Generation Models

2 code implementations • 26 Oct 2022 • Ben Athiwaratkun, Sanjay Krishna Gouda, Zijian Wang, Xiaopeng Li, Yuchen Tian, Ming Tan, Wasi Uddin Ahmad, Shiqi Wang, Qing Sun, Mingyue Shang, Sujan Kumar Gonugondla, Hantian Ding, Varun Kumar, Nathan Fulton, Arash Farahani, Siddhartha Jain, Robert Giaquinto, Haifeng Qian, Murali Krishna Ramanathan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang

Using these benchmarks, we are able to assess the performance of code generation models in a multi-lingual fashion, and discovered generalization ability of language models on out-of-domain languages, advantages of multi-lingual models over mono-lingual, the ability of few-shot prompting to teach the model new languages, and zero-shot translation abilities even on mono-lingual settings.

Code Completion Code Translation +1

Paper
Code

An Analysis of the Effects of Decoding Algorithms on Fairness in Open-Ended Language Generation

no code implementations • 7 Oct 2022 • Jwala Dhamala, Varun Kumar, Rahul Gupta, Kai-Wei Chang, Aram Galstyan

We present a systematic analysis of the impact of decoding algorithms on LM fairness, and analyze the trade-off between fairness, diversity and quality.

Fairness Text Generation

Paper
Add Code

On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations

no code implementations • ACL 2022 • Yang Trista Cao, Yada Pruksachatkun, Kai-Wei Chang, Rahul Gupta, Varun Kumar, Jwala Dhamala, Aram Galstyan

Multiple metrics have been introduced to measure fairness in various natural language processing tasks.

Fairness

Paper
Add Code

Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal

no code implementations • Findings (ACL) 2022 • Umang Gupta, Jwala Dhamala, Varun Kumar, Apurv Verma, Yada Pruksachatkun, Satyapriya Krishna, Rahul Gupta, Kai-Wei Chang, Greg Ver Steeg, Aram Galstyan

Language models excel at generating coherent text, and model compression techniques such as knowledge distillation have enabled their use in resource-constrained settings.

counterfactual Fairness +3

Paper
Add Code

Industry Scale Semi-Supervised Learning for Natural Language Understanding

no code implementations • NAACL 2021 • Luoxin Chen, Francisco Garcia, Varun Kumar, He Xie, Jianhua Lu

This paper presents a production Semi-Supervised Learning (SSL) pipeline based on the student-teacher framework, which leverages millions of unlabeled examples to improve Natural Language Understanding (NLU) tasks.

intent-classification Intent Classification +6

Paper
Add Code

ProtoDA: Efficient Transfer Learning for Few-Shot Intent Classification

no code implementations • 28 Jan 2021 • Manoj Kumar, Varun Kumar, Hadrien Glaude, Cyprien delichy, Aman Alok, Rahul Gupta

We make use of a conditional generator for data augmentation that is trained directly using the meta-learning objective and simultaneously with prototypical networks, hence ensuring that data augmentation is customized to the task.

Classification Data Augmentation +9

Paper
Add Code

BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation

1 code implementation • 27 Jan 2021 • Jwala Dhamala, Tony Sun, Varun Kumar, Satyapriya Krishna, Yada Pruksachatkun, Kai-Wei Chang, Rahul Gupta

To systematically study and benchmark social biases in open-ended language generation, we introduce the Bias in Open-Ended Language Generation Dataset (BOLD), a large-scale dataset that consists of 23, 679 English text generation prompts for bias benchmarking across five domains: profession, gender, race, religion, and political ideology.

Benchmarking Text Generation

Paper
Code

Data Augmentation using Pre-trained Transformer Models

3 code implementations • AACL (lifelongnlp) 2020 • Varun Kumar, Ashutosh Choudhary, Eunah Cho

Language model based pre-trained models such as BERT have provided significant gains across different NLP tasks.

Data Augmentation Language Modelling

4,299

Paper
Code

A Closer Look At Feature Space Data Augmentation For Few-Shot Intent Classification

no code implementations • WS 2019 • Varun Kumar, Hadrien Glaude, Cyprien de Lichy, William Campbell

In particular, we show that (a) upsampling in latent space is a competitive baseline for feature space augmentation (b) adding the difference between two examples to a new example is a simple yet effective data augmentation method.

Data Augmentation General Classification +4

Paper
Add Code

Efficient Semi-Supervised Learning for Natural Language Understanding by Optimizing Diversity

no code implementations • 9 Oct 2019 • Eunah Cho, He Xie, John P. Lalor, Varun Kumar, William M. Campbell

In addition, methods optimizing diversity can reduce training data in many cases to 50% with little impact on performance.

Natural Language Understanding Task-Oriented Dialogue Systems

Paper
Add Code

Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models

no code implementations • ACL 2019 • Varun Kumar, Alison Smith-Renner, Leah Findlater, Kevin Seppi, Jordan Boyd-Graber

To address the lack of comparative evaluation of Human-in-the-Loop Topic Modeling (HLTM) systems, we implement and evaluate three contrasting HLTM modeling approaches using simulation experiments.

Topic Models

Paper
Add Code

SpaceNet MVOI: a Multi-View Overhead Imagery Dataset

no code implementations • ICCV 2019 • Nicholas Weir, David Lindenbaum, Alexei Bastidas, Adam Van Etten, Sean McPherson, Jacob Shermeyer, Varun Kumar, Hanlin Tang

To address this problem, we present an open source Multi-View Overhead Imagery dataset, termed SpaceNet MVOI, with 27 unique looks from a broad range of viewing angles (-32. 5 degrees to 54. 0 degrees).

object-detection Object Detection +1

Paper
Add Code

Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning

1 code implementation • 24 Jan 2018 • Scott Cyphers, Arjun K. Bansal, Anahita Bhiwandiwalla, Jayaram Bobba, Matthew Brookhart, Avijit Chakraborty, Will Constable, Christian Convey, Leona Cook, Omar Kanawi, Robert Kimball, Jason Knight, Nikolay Korovaiko, Varun Kumar, Yixing Lao, Christopher R. Lishka, Jaikrishnan Menon, Jennifer Myers, Sandeep Aswath Narayana, Adam Procter, Tristan J. Webb

The current approach, which we call "direct optimization", requires deep changes within each framework to improve the training performance for each hardware backend (CPUs, GPUs, FPGAs, ASICs) and requires $\mathcal{O}(fp)$ effort; where $f$ is the number of frameworks and $p$ is the number of platforms.

graph partitioning Management +1

215

Paper
Code

The GW/UMD CLPsych 2016 Shared Task System

no code implementations • WS 2016 • Ayah Zirikly, Varun Kumar, Philip Resnik

Lemmatization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.