Search Results for author: Kai-Wei Chang

Found 231 papers, 130 papers with code

Co-training Embeddings of Knowledge Graphs and Entity Descriptions for Cross-lingual Entity Alignment

no code implementations • 18 Jun 2018 • Muhao Chen, Yingtao Tian, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo

Since many multilingual KGs also provide literal descriptions of entities, in this paper, we introduce an embedding-based approach which leverages a weakly aligned multilingual KG for semi-supervised cross-lingual learning using entity descriptions.

Entity Alignment Knowledge Graphs

Paper
Add Code

Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks

no code implementations • 21 Apr 2018 • Wasi Uddin Ahmad, Xueying Bai, Zhechao Huang, Chao Jiang, Nanyun Peng, Kai-Wei Chang

Learning distributed sentence representations is one of the key challenges in natural language processing.

Multi-Task Learning Natural Language Inference +2

Paper
Add Code

Counterexamples for Robotic Planning Explained in Structured Language

no code implementations • 23 Mar 2018 • Lu Feng, Mahsa Ghasemi, Kai-Wei Chang, Ufuk Topcu

Automated techniques such as model checking have been used to verify models of robotic mission plans based on Markov decision processes (MDPs) and generate counterexamples that may help diagnose requirement violations.

Paper
Add Code

Beyond Bilingual: Multi-sense Word Embeddings using Multilingual Context

no code implementations • WS 2017 • Shyam Upadhyay, Kai-Wei Chang, Matt Taddy, Adam Kalai, James Zou

We present a multi-view Bayesian non-parametric algorithm which improves multi-sense word embeddings by (a) using multilingual (i. e., more than two languages) corpora to significantly improve sense embeddings beyond what one achieves with bilingual information, and (b) uses a principled approach to learn a variable number of senses per word, in a data-driven manner.

Representation Learning Word Embeddings

Paper
Add Code

Quantifying and Reducing Stereotypes in Word Embeddings

no code implementations • 20 Jun 2016 • Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, Adam Kalai

Machine learning algorithms are optimized to model statistical properties of the training data.

Word Embeddings

Paper
Add Code

Resource Constrained Structured Prediction

no code implementations • 28 Feb 2016 • Tolga Bolukbasi, Kai-Wei Chang, Joseph Wang, Venkatesh Saligrama

We study the problem of structured prediction under test-time budget constraints.

Dependency Parsing Optical Character Recognition +2

Paper
Add Code

A Credit Assignment Compiler for Joint Prediction

no code implementations • NeurIPS 2016 • Kai-Wei Chang, He He, Hal Daumé III, John Langford, Stephane Ross

Many machine learning applications involve jointly predicting multiple mutually dependent output variables.

Paper
Add Code

Distributed Training of Structured SVM

no code implementations • 8 Jun 2015 • Ching-pei Lee, Kai-Wei Chang, Shyam Upadhyay, Dan Roth

Training structured prediction models is time-consuming.

Structured Prediction

Paper
Add Code

IllinoisSL: A JAVA Library for Structured Prediction

no code implementations • 23 Sep 2015 • Kai-Wei Chang, Shyam Upadhyay, Ming-Wei Chang, Vivek Srikumar, Dan Roth

IllinoisSL is a Java library for learning structured prediction models.

Structured Prediction

Paper
Add Code

Learning to Search Better Than Your Teacher

no code implementations • 8 Feb 2015 • Kai-Wei Chang, Akshay Krishnamurthy, Alekh Agarwal, Hal Daumé III, John Langford

Methods for learning to search for structured prediction typically imitate a reference policy, with existing theoretical guarantees demonstrating low regret compared to that reference.

Multi-Armed Bandits Structured Prediction

Paper
Add Code

Learning to Search for Dependencies

no code implementations • 18 Mar 2015 • Kai-Wei Chang, He He, Hal Daumé III, John Langford

We demonstrate that a dependency parser can be built using a credit assignment compiler which removes the burden of worrying about low-level machine learning details from the parser implementation.

BIG-bench Machine Learning

Paper
Add Code

Quantification and Analysis of Scientific Language Variation Across Research Fields

no code implementations • 4 Dec 2018 • Pei Zhou, Muhao Chen, Kai-Wei Chang, Carlo Zaniolo

Quantifying differences in terminologies from various academic domains has been a longstanding problem yet to be solved.

Language Modelling

Paper
Add Code

Learning Word Embeddings for Low-Resource Languages by PU Learning

no code implementations • NAACL 2018 • Chao Jiang, Hsiang-Fu Yu, Cho-Jui Hsieh, Kai-Wei Chang

In such a situation, the co-occurrence matrix is sparse as the co-occurrences of many word pairs are unobserved.

Document Ranking Image Captioning +4

Paper
Add Code

Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems

no code implementations • EMNLP 2016 • Shyam Upadhyay, Ming-Wei Chang, Kai-Wei Chang, Wen-tau Yih

Ranked #1 on Math Word Problem Solving on ALG514

Math Word Problem Solving

Paper
Add Code

A Corpus to Learn Refer-to-as Relations for Nominals

no code implementations • LREC 2018 • Wasi Ahmad, Kai-Wei Chang

Coreference Resolution Learning Semantic Representations +2

Paper
Add Code

A Corpus of Drug Usage Guidelines Annotated with Type of Advice

no code implementations • LREC 2018 • Sarah Masud Preum, Md. Rizwan Parvez, Kai-Wei Chang, John Stankovic

Vocal Bursts Type Prediction

Paper
Add Code

Hands-on Learning to Search for Structured Prediction

no code implementations • HLT 2015 • Hal Daumé III, John Langford, Kai-Wei Chang, He He, Sudha Rao

Decision Making Dependency Parsing +2

Paper
Add Code

Typed Tensor Decomposition of Knowledge Bases for Relation Extraction

no code implementations • EMNLP 2014 • Kai-Wei Chang, Wen-tau Yih, Bishan Yang, Christopher Meek

Relation Relation Extraction +1

Paper
Add Code

A Constrained Latent Variable Model for Coreference Resolution

no code implementations • EMNLP 2013 • Kai-Wei Chang, Rajhans Samdani, Dan Roth

coreference-resolution Structured Prediction

Paper
Add Code

Multi-Relational Latent Semantic Analysis

no code implementations • EMNLP 2013 • Kai-Wei Chang, Wen-tau Yih, Christopher Meek

Word Sense Disambiguation

Paper
Add Code

A Joint Framework for Coreference Resolution and Mention Head Detection

no code implementations • CONLL 2015 • Haoruo Peng, Kai-Wei Chang, Dan Roth

Clustering coreference-resolution +1

Paper
Add Code

The Illinois-Columbia System in the CoNLL-2014 Shared Task

no code implementations • WS 2014 • Alla Rozovskaya, Kai-Wei Chang, Mark Sammons, Dan Roth, Nizar Habash

Grammatical Error Correction

Paper
Add Code

The University of Illinois System in the CoNLL-2013 Shared Task

no code implementations • WS 2013 • Alla Rozovskaya, Kai-Wei Chang, Mark Sammons, Dan Roth

Paper
Add Code

Illinois-Coref: The UI System in the CoNLL-2012 Shared Task

no code implementations • WS 2012 • Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Mark Sammons, Dan Roth

Coreference Resolution Named Entity Recognition (NER)

Paper
Add Code

Efficient Contextual Representation Learning Without Softmax Layer

no code implementations • 28 Feb 2019 • Liunian Harold Li, Patrick H. Chen, Cho-Jui Hsieh, Kai-Wei Chang

Our framework reduces the time spent on the output layer to a negligible level, eliminates almost all the trainable parameters of the softmax layer and performs language modeling without truncating the vocabulary.

Dimensionality Reduction Language Modelling +2

Paper
Add Code

Dynamically Expanded CNN Array for Video Coding

no code implementations • 10 May 2019 • Everett Fall, Kai-Wei Chang, Liang-Gee Chen

Marked progress has been made in video quality, compression, and computational efficiency.

Computational Efficiency

Paper
Add Code

Pre-Training Graph Neural Networks for Generic Structural Feature Extraction

no code implementations • 31 May 2019 • Ziniu Hu, Changjun Fan, Ting Chen, Kai-Wei Chang, Yizhou Sun

With the proposed pre-training procedure, the generic structural information is learned and preserved, thus the pre-trained GNN requires less amount of labeled data and fewer domain-specific features to achieve high performance on different downstream tasks.

Denoising

Paper
Add Code

Learning Bilingual Word Embeddings Using Lexical Definitions

no code implementations • WS 2019 • Weijia Shi, Muhao Chen, Yingtao Tian, Kai-Wei Chang

Bilingual word embeddings, which representlexicons of different languages in a shared em-bedding space, are essential for supporting se-mantic and knowledge transfers in a variety ofcross-lingual NLP tasks.

Translation Word Alignment +1

Paper
Add Code

BOSH: An Efficient Meta Algorithm for Decision-based Attacks

no code implementations • 10 Sep 2019 • Zhenxin Xiao, Puyudi Yang, Yuchen Jiang, Kai-Wei Chang, Cho-Jui Hsieh

Adversarial example generation becomes a viable method for evaluating the robustness of a machine learning model.

Adversarial Attack Bayesian Optimization

Paper
Add Code

Retrofitting Contextualized Word Embeddings with Paraphrases

no code implementations • IJCNLP 2019 • Weijia Shi, Muhao Chen, Pei Zhou, Kai-Wei Chang

Contextualized word embedding models, such as ELMo, generate meaningful representations of words and their context.

Sentence Sentence Classification +1

Paper
Add Code

``The Boating Store Had Its Best Sail Ever'': Pronunciation-attentive Contextualized Pun Recognition

no code implementations • ACL 2020 • Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao, Kai-Wei Chang, Wei Wang

In this paper, we propose Pronunciation-attentive Contextualized Pun Recognition (PCPR) to perceive human humor, detect if a sentence contains puns and locate them in the sentence.

Sentence

Paper
Add Code

What Does BERT with Vision Look At?

no code implementations • ACL 2020 • Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, Kai-Wei Chang

Pre-trained visually grounded language models such as ViLBERT, LXMERT, and UNITER have achieved significant performance improvement on vision-and-language tasks but what they learn during pre-training remains unclear.

Language Modelling

Paper
Add Code

Efficient Contextual Representation Learning With Continuous Outputs

no code implementations • TACL 2019 • Liunian Harold Li, Patrick H. Chen, Cho-Jui Hsieh, Kai-Wei Chang

Contextual representation models have achieved great success in improving various downstream natural language processing tasks.

Language Modelling Representation Learning +1

Paper
Add Code

Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Coverage Attention

no code implementations • ACL 2021 • Wasi Uddin Ahmad, Xiao Bai, Soomin Lee, Kai-Wei Chang

Natural language processing techniques have demonstrated promising results in keyphrase generation.

Keyphrase Extraction Keyphrase Generation

Paper
Add Code

On the Transferability of Adversarial Attacksagainst Neural Text Classifier

no code implementations • 17 Nov 2020 • Liping Yuan, Xiaoqing Zheng, Yi Zhou, Cho-Jui Hsieh, Kai-Wei Chang

Based on these studies, we propose a genetic algorithm to find an ensemble of models that can be used to induce adversarial examples to fool almost all existing models.

text-classification Text Classification

Paper
Add Code

CREATe: Clinical Report Extraction and Annotation Technology

no code implementations • 28 Feb 2021 • Yichao Zhou, Wei-Ting Chen, BoWen Zhang, David Lee, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei Ping, Wei Wang

Clinical case reports are written descriptions of the unique aspects of a particular clinical case, playing an essential role in sharing clinical experiences about atypical disease phenotypes and new therapies.

Paper
Add Code

Leveraging Unlabeled Data for Entity-Relation Extraction through Probabilistic Constraint Satisfaction

no code implementations • 20 Mar 2021 • Kareem Ahmed, Eric Wang, Guy Van Den Broeck, Kai-Wei Chang

We study the problem of entity-relation extraction in the presence of symbolic domain knowledge.

Relation Relation Extraction +1

Paper
Add Code

Adapting Coreference Resolution for Processing Violent Death Narratives

no code implementations • NAACL 2021 • Ankith Uppunda, Susan D. Cochran, Jacob G. Foster, Alina Arseniev-Koehler, Vickie M. Mays, Kai-Wei Chang

Coreference resolution is an important component in analyzing narrative text from administrative data (e. g., clinical or police sources).

coreference-resolution Data Augmentation

Paper
Add Code

``Nice Try, Kiddo'': Investigating Ad Hominems in Dialogue Responses

no code implementations • NAACL 2021 • Emily Sheng, Kai-Wei Chang, Prem Natarajan, Nanyun Peng

Ad hominem attacks are those that target some feature of a person{'}s character instead of the position the person is maintaining.

Abusive Language

Paper
Add Code

Cross-Lingual Dependency Parsing by POS-Guided Word Reordering

no code implementations • Findings of the Association for Computational Linguistics 2020 • Lu Liu, Yi Zhou, Jianhan Xu, Xiaoqing Zheng, Kai-Wei Chang, Xuanjing Huang

The words in each sentence of a source language corpus are rearranged to meet the word order in a target language under the guidance of a part-of-speech based language model (LM).

Dependency Parsing Language Modelling +2

Paper
Add Code

Does Robustness Improve Fairness? Approaching Fairness with Word Substitution Robustness Methods for Text Classification

no code implementations • Findings (ACL) 2021 • Yada Pruksachatkun, Satyapriya Krishna, Jwala Dhamala, Rahul Gupta, Kai-Wei Chang

Existing bias mitigation methods to reduce disparities in model outcomes across cohorts have focused on data augmentation, debiasing model embeddings, or adding fairness-based optimization objectives during training.

Data Augmentation Fairness +2

Paper
Add Code

Clinical Named Entity Recognition using Contextualized Token Representations

no code implementations • 23 Jun 2021 • Yichao Zhou, Chelsea Ju, J. Harry Caufield, Kevin Shih, Calvin Chen, Yizhou Sun, Kai-Wei Chang, Peipei Ping, Wei Wang

To facilitate various downstream applications using clinical case reports (CCRs), we pre-train two deep contextualized language models, Clinical Embeddings from Language Model (C-ELMo) and Clinical Contextual String Embeddings (C-Flair) using the clinical-related corpus from the PubMed Central.

Language Modelling named-entity-recognition +3

Paper
Add Code

On Measures of Biases and Harms in NLP

no code implementations • 7 Aug 2021 • Sunipa Dev, Emily Sheng, Jieyu Zhao, Aubrie Amstutz, Jiao Sun, Yu Hou, Mattie Sanseverino, Jiin Kim, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang

Recent studies show that Natural Language Processing (NLP) technologies propagate societal biases about demographic groups associated with attributes such as gender, race, and nationality.

Paper
Add Code

Relation-Guided Pre-Training for Open-Domain Question Answering

no code implementations • Findings (EMNLP) 2021 • Ziniu Hu, Yizhou Sun, Kai-Wei Chang

Answering complex open-domain questions requires understanding the latent relations between involving entities.

Natural Questions Open-Domain Question Answering +2

Paper
Add Code

Toward Degradation-Robust Voice Conversion

no code implementations • 14 Oct 2021 • Chien-yu Huang, Kai-Wei Chang, Hung-Yi Lee

However, in real-world scenarios, it is difficult to collect clean utterances of a speaker, and they are usually degraded by noises or reverberations.

Denoising Speech Enhancement +1

Paper
Add Code

On the Transferability of Adversarial Attacks against Neural Text Classifier

no code implementations • EMNLP 2021 • Liping Yuan, Xiaoqing Zheng, Yi Zhou, Cho-Jui Hsieh, Kai-Wei Chang

Based on these studies, we propose a genetic algorithm to find an ensemble of models that can be used to induce adversarial examples to fool almost all existing models.

text-classification Text Classification

Paper
Add Code

Robustness and Adversarial Examples in Natural Language Processing

no code implementations • EMNLP (ACL) 2021 • Kai-Wei Chang, He He, Robin Jia, Sameer Singh

In particular, we will review recent studies on analyzing the weakness of NLP systems when facing adversarial inputs and data with a distribution shift.

Paper
Add Code

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning

no code implementations • 16 Dec 2021 • Zhecan Wang, Haoxuan You, Liunian Harold Li, Alireza Zareian, Suji Park, Yiqing Liang, Kai-Wei Chang, Shih-Fu Chang

As for pre-training, a scene-graph-aware pre-training method is proposed to leverage structure knowledge extracted in the visual scene graph.

Visual Commonsense Reasoning

Paper
Add Code

Neuro-Symbolic Entropy Regularization

no code implementations • 25 Jan 2022 • Kareem Ahmed, Eric Wang, Kai-Wei Chang, Guy Van Den Broeck

We propose a loss, neuro-symbolic entropy regularization, that encourages the model to confidently predict a valid object.

Structured Prediction valid

Paper
Add Code

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models

no code implementations • 17 Feb 2022 • Da Yin, Li Dong, Hao Cheng, Xiaodong Liu, Kai-Wei Chang, Furu Wei, Jianfeng Gao

With the increasing of model capacity brought by pre-trained language models, there emerges boosting needs for more knowledgeable natural language processing (NLP) models with advanced functionalities including providing and making flexible use of encyclopedic and commonsense knowledge.

Language Modelling

Paper
Add Code

Measuring Fairness of Text Classifiers via Prediction Sensitivity

no code implementations • ACL 2022 • Satyapriya Krishna, Rahul Gupta, Apurv Verma, Jwala Dhamala, Yada Pruksachatkun, Kai-Wei Chang

With the rapid growth in language processing applications, fairness has emerged as an important consideration in data-driven solutions.

Attribute counterfactual +3

Paper
Add Code

Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal

no code implementations • Findings (ACL) 2022 • Umang Gupta, Jwala Dhamala, Varun Kumar, Apurv Verma, Yada Pruksachatkun, Satyapriya Krishna, Rahul Gupta, Kai-Wei Chang, Greg Ver Steeg, Aram Galstyan

Language models excel at generating coherent text, and model compression techniques such as knowledge distillation have enabled their use in resource-constrained settings.

counterfactual Fairness +3

Paper
Add Code

On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations

no code implementations • ACL 2022 • Yang Trista Cao, Yada Pruksachatkun, Kai-Wei Chang, Rahul Gupta, Varun Kumar, Jwala Dhamala, Aram Galstyan

Multiple metrics have been introduced to measure fairness in various natural language processing tasks.

Fairness

Paper
Add Code

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

no code implementations • 19 Apr 2022 • Md Rizwan Parvez, Jianfeng Chi, Wasi Uddin Ahmad, Yuan Tian, Kai-Wei Chang

Prior studies in privacy policies frame the question answering (QA) task as identifying the most relevant text segment or a list of sentences from a policy document given a user query.

Data Augmentation Question Answering +1

Paper
Add Code

Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks

no code implementations • 22 Apr 2022 • Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Xiyang Dai, Bin Xiao, Jianwei Yang, Haoxuan You, Kai-Wei Chang, Shih-Fu Chang, Lu Yuan

Experiments demonstrate that MAD leads to consistent gains in the low-shot, domain-shifted, and fully-supervised conditions on VCR, SNLI-VE, and VQA, achieving SOTA performance on VCR compared to other single models pretrained with image-text data.

Ranked #4 on Visual Question Answering (VQA) on VCR (Q-A) test

Question Answering Visual Commonsense Reasoning +2

Paper
Add Code

Towards Adversarially Robust Text Classifiers by Learning to Reweight Clean Examples

no code implementations • Findings (ACL) 2022 • Jianhan Xu, Cenyuan Zhang, Xiaoqing Zheng, Linyang Li, Cho-Jui Hsieh, Kai-Wei Chang, Xuanjing Huang

Most of the existing defense methods improve the adversarial robustness by making the models adapt to the training set augmented with some adversarial examples.

Adversarial Robustness

Paper
Add Code

DisinfoMeme: A Multimodal Dataset for Detecting Meme Intentionally Spreading Out Disinformation

no code implementations • 25 May 2022 • Jingnong Qu, Liunian Harold Li, Jieyu Zhao, Sunipa Dev, Kai-Wei Chang

Disinformation has become a serious problem on social media.

Multimodal Reasoning Optical Character Recognition (OCR)

Paper
Add Code

Using Item Response Theory to Measure Gender and Racial Bias of a BERT-based Automated English Speech Assessment System

no code implementations • NAACL (BEA) 2022 • Alexander Kwako, Yixin Wan, Jieyu Zhao, Kai-Wei Chang, Li Cai, Mark Hansen

This study addresses the need to examine potential biases of transformer-based models in the context of automated English speech assessment.

Paper
Add Code

An Analysis of the Effects of Decoding Algorithms on Fairness in Open-Ended Language Generation

no code implementations • 7 Oct 2022 • Jwala Dhamala, Varun Kumar, Rahul Gupta, Kai-Wei Chang, Aram Galstyan

We present a systematic analysis of the impact of decoding algorithms on LM fairness, and analyze the trade-off between fairness, diversity and quality.

Fairness Text Generation

Paper
Add Code

Watermarking Pre-trained Language Models with Backdooring

no code implementations • 14 Oct 2022 • Chenxi Gu, Chengsong Huang, Xiaoqing Zheng, Kai-Wei Chang, Cho-Jui Hsieh

Large pre-trained language models (PLMs) have proven to be a crucial component of modern natural language processing systems.

Multi-Task Learning

Paper
Add Code

SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning

no code implementations • 16 Oct 2022 • Tzu-hsun Feng, Annie Dong, Ching-Feng Yeh, Shu-wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-Yi Lee

We present the SUPERB challenge at SLT 2022, which aims at learning self-supervised speech representation for better performance, generalization, and efficiency.

Audio Generation Representation Learning +2

Paper
Add Code

The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

1 code implementation • 18 Oct 2022 • Nikil Roashan Selvam, Sunipa Dev, Daniel Khashabi, Tushar Khot, Kai-Wei Chang

How reliably can we trust the scores obtained from social bias benchmarks as faithful indicators of problematic social biases in a given language model?

Language Modelling

Paper
Code

Investigating Ensemble Methods for Model Robustness Improvement of Text Classifiers

no code implementations • 28 Oct 2022 • Jieyu Zhao, Xuezhi Wang, Yao Qin, Jilin Chen, Kai-Wei Chang

Large pre-trained language models have shown remarkable performance over the past few years.

Paper
Add Code

Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations

no code implementations • 2 Nov 2022 • Kuan-Hao Huang, Varun Iyer, Anoop Kumar, Sriram Venkatapathy, Kai-Wei Chang, Aram Galstyan

In this paper, we demonstrate that leveraging Abstract Meaning Representations (AMR) can greatly improve the performance of unsupervised syntactically controlled paraphrase generation.

Data Augmentation Paraphrase Generation +1

Paper
Add Code

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense

no code implementations • 10 Nov 2022 • Zhecan Wang, Haoxuan You, Yicheng He, Wenhao Li, Kai-Wei Chang, Shih-Fu Chang

Visual commonsense understanding requires Vision Language (VL) models to not only understand image and text but also cross-reference in-between to fully integrate and achieve comprehension of the visual scene described.

Paper
Add Code

Empowering Language Models with Knowledge Graph Reasoning for Question Answering

no code implementations • 15 Nov 2022 • Ziniu Hu, Yichong Xu, Wenhao Yu, Shuohang Wang, ZiYi Yang, Chenguang Zhu, Kai-Wei Chang, Yizhou Sun

Answering open-domain questions requires world knowledge about in-context entities.

Knowledge Graphs Language Modelling +2

Paper
Add Code

Auditing Algorithmic Fairness in Machine Learning for Health with Severity-Based LOGAN

no code implementations • 16 Nov 2022 • Anaelia Ovalle, Sunipa Dev, Jieyu Zhao, Majid Sarrafzadeh, Kai-Wei Chang

Therefore, ML auditing tools must be (1) better aligned with ML4H auditing principles and (2) able to illuminate and characterize communities vulnerable to the most harm.

Bias Detection Clustering +1

Paper
Add Code

Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models

no code implementations • 17 Nov 2022 • Ninareh Mehrabi, Palash Goyal, Apurv Verma, Jwala Dhamala, Varun Kumar, Qian Hu, Kai-Wei Chang, Richard Zemel, Aram Galstyan, Rahul Gupta

Natural language often contains ambiguities that can lead to misinterpretation and miscommunication.

Common Sense Reasoning

Paper
Add Code

Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding

no code implementations • 14 Dec 2022 • Haoxuan You, Rui Sun, Zhecan Wang, Kai-Wei Chang, Shih-Fu Chang

We present a new commonsense task, Human-centric Commonsense Grounding, that tests the models' ability to ground individuals given the context descriptions about what happened before, and their mental/physical states or intentions.

Paper
Add Code

GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods

no code implementations • CVPR 2023 • Da Yin, Feng Gao, Govind Thattai, Michael Johnston, Kai-Wei Chang

A key goal for the advancement of AI is to develop technologies that serve the needs not just of one group but of all communities regardless of their geographical region.

Paper
Add Code

Ensemble knowledge distillation of self-supervised speech models

no code implementations • 24 Feb 2023 • Kuan-Po Huang, Tzu-hsun Feng, Yu-Kuan Fu, Tsu-Yuan Hsu, Po-Chieh Yen, Wei-Cheng Tseng, Kai-Wei Chang, Hung-Yi Lee

We tried two different aggregation techniques, layerwise-average and layerwise-concatenation, to the representations of different teacher models and found that the former was more effective.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Semantic Strengthening of Neuro-Symbolic Learning

no code implementations • 28 Feb 2023 • Kareem Ahmed, Kai-Wei Chang, Guy Van Den Broeck

Numerous neuro-symbolic approaches have recently been proposed typically with the goal of adding symbolic knowledge to the output layer of a neural network.

Paper
Add Code

SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks

no code implementations • 1 Mar 2023 • Kai-Wei Chang, Yu-Kai Wang, Hua Shen, Iu-thing Kang, Wei-Cheng Tseng, Shang-Wen Li, Hung-Yi Lee

For speech processing, SpeechPrompt shows its high parameter efficiency and competitive performance on a few speech classification tasks.

Ranked #17 on Spoken Language Understanding on Fluent Speech Commands (using extra training data)

Classification Language Modelling +1

Paper
Add Code

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

no code implementations • 23 Mar 2023 • Haoxuan You, Mandy Guo, Zhecan Wang, Kai-Wei Chang, Jason Baldridge, Jiahui Yu

The field of vision and language has witnessed a proliferation of pre-trained foundation models.

Retrieval Text Generation +2

Paper
Add Code

Factoring the Matrix of Domination: A Critical Review and Reimagination of Intersectionality in AI Fairness

no code implementations • 16 Mar 2023 • Anaelia Ovalle, Arjun Subramonian, Vagrant Gautam, Gilbert Gee, Kai-Wei Chang

Through a critical review of how intersectionality is discussed in 30 papers from the AI fairness literature, we deductively and inductively: 1) map how intersectionality tenets operate within the AI fairness paradigm and 2) uncover gaps between the conceptualization and operationalization of intersectionality.

Fairness

Paper
Add Code

"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

no code implementations • 17 May 2023 • Anaelia Ovalle, Palash Goyal, Jwala Dhamala, Zachary Jaggers, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

Transgender and non-binary (TGNB) individuals disproportionately experience discrimination and exclusion from daily life.

Fairness Text Generation

Paper
Add Code

Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis

1 code implementation • 23 May 2023 • Oscar Chew, Hsuan-Tien Lin, Kai-Wei Chang, Kuan-Hao Huang

Recent research has revealed that machine learning models have a tendency to leverage spurious correlations that exist in the training set but may not hold true in general circumstances.

text-classification Text Classification

Paper
Code

PIP: Parse-Instructed Prefix for Syntactically Controlled Paraphrase Generation

1 code implementation • 26 May 2023 • Yixin Wan, Kuan-Hao Huang, Kai-Wei Chang

Existing fine-tuning methods for this task are costly as all the parameters of the model need to be updated during the training process.

Paraphrase Generation

Paper
Code

ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression

no code implementations • 26 May 2023 • Yixin Wan, Yuan Zhou, Xiulian Peng, Kai-Wei Chang, Yan Lu

To begin with, we are among the first to comprehensively investigate mainstream KD techniques on DNS models to resolve the two challenges.

Knowledge Distillation

Paper
Add Code

MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models

no code implementations • 2 Jun 2023 • Masoud Monajatipoor, Liunian Harold Li, Mozhdeh Rouhsedaghat, Lin F. Yang, Kai-Wei Chang

In this paper, we study an interesting hypothesis: can we transfer the in-context learning ability from the language domain to VL domain?

In-Context Learning Language Modelling +1

Paper
Add Code

SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts

no code implementations • 3 Jun 2023 • Haibin Wu, Kai-Wei Chang, Yuan-Kuei Wu, Hung-Yi Lee

In this paper, we present pioneering research that explores the application of prompt tuning to stimulate speech LMs for various generation tasks, within a unified framework called SpeechGen, with around 10M trainable parameters.

Open-Ended Question Answering

Paper
Add Code

ChatGPT for Us: Preserving Data Privacy in ChatGPT via Dialogue Text Ambiguation to Expand Mental Health Care Delivery

no code implementations • 19 May 2023 • Anaelia Ovalle, Mehrab Beikzadeh, Parshan Teimouri, Kai-Wei Chang, Majid Sarrafzadeh

Large language models have been useful in expanding mental health care delivery.

Paper
Add Code

FLIRT: Feedback Loop In-context Red Teaming

no code implementations • 8 Aug 2023 • Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

Here we propose an automatic red teaming framework that evaluates a given model and exposes its vulnerabilities against unsafe and inappropriate content generation.

In-Context Learning Response Generation

Paper
Add Code

Contextual Label Projection for Cross-Lingual Structured Prediction

1 code implementation • 16 Sep 2023 • Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

Label projection, which involves obtaining translated labels and texts jointly, is essential for leveraging machine translation to facilitate cross-lingual transfer in structured prediction tasks.

Event Argument Extraction Machine Translation +6

Paper
Code

Self-Augmentation Improves Zero-Shot Cross-Lingual Transfer

no code implementations • 19 Sep 2023 • Fei Wang, Kuan-Hao Huang, Kai-Wei Chang, Muhao Chen

In this paper, we propose a simple yet effective method, SALT, to improve the zero-shot cross-lingual transfer of the multilingual pretrained language models without the help of such external data.

Multilingual NLP Zero-Shot Cross-Lingual Transfer

Paper
Add Code

Towards General-Purpose Text-Instruction-Guided Voice Conversion

no code implementations • 25 Sep 2023 • Chun-Yi Kuan, Chen An Li, Tsu-Yuan Hsu, Tse-Yang Lin, Ho-Lam Chung, Kai-Wei Chang, Shuo-Yiin Chang, Hung-Yi Lee

This paper introduces a novel voice conversion (VC) model, guided by text instructions such as "articulate slowly with a deep tone" or "speak in a cheerful boyish voice".

Language Modelling Specificity +1

Paper
Add Code

Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model

no code implementations • 4 Oct 2023 • Kai-Wei Chang, Ming-Hsin Chen, Yun-Ping Lin, Jing Neng Hsu, Paul Kuo-Ming Huang, Chien-yu Huang, Shang-Wen Li, Hung-Yi Lee

Notably, in the low-resource scenario, prompting consistently outperforms adapter tuning.

Cross-Lingual ASR slot-filling +1

Paper
Add Code

Mitigating Bias for Question Answering Models by Tracking Bias Influence

no code implementations • 13 Oct 2023 • Mingyu Derek Ma, Jiun-Yu Kao, Arpit Gupta, Yu-Hsiang Lin, Wenbo Zhao, Tagyoung Chung, Wei Wang, Kai-Wei Chang, Nanyun Peng

Based on the intuition that a model would lean to be more biased if it learns from a biased example, we measure the bias level of a query instance by observing its influence on another instance.

Multiple-choice Multi-Task Learning +1

Paper
Add Code

Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts

no code implementations • 16 Oct 2023 • Christina Chance, Da Yin, Dakuo Wang, Kai-Wei Chang

Using counterfactual data augmentation to the FairytaleQA dataset, we evaluate model robustness against swapped gender character information, and then mitigate learned biases by introducing counterfactual gender stereotypes during training time.

counterfactual Data Augmentation +1

Paper
Add Code

An Exploration of In-Context Learning for Speech Language Model

no code implementations • 19 Oct 2023 • Ming-Hao Hsu, Kai-Wei Chang, Shang-Wen Li, Hung-Yi Lee

Despite the success of ICL in NLP, little work is exploring the possibility of ICL in speech processing.

Few-Shot Learning In-Context Learning +1

Paper
Add Code

Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond

no code implementations • 23 Oct 2023 • Zhecan Wang, Long Chen, Haoxuan You, Keyang Xu, Yicheng He, Wenhao Li, Noel Codella, Kai-Wei Chang, Shih-Fu Chang

Vision-language (VL) understanding tasks evaluate models' comprehension of complex visual scenes through multiple-choice questions.

counterfactual Multiple-choice +2

Paper
Add Code

On the steerability of large language models toward data-driven personas

no code implementations • 8 Nov 2023 • Junyi Li, Ninareh Mehrabi, Charith Peris, Palash Goyal, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

Large language models (LLMs) are known to generate biased responses where the opinions of certain groups and populations are underrepresented.

Collaborative Filtering Language Modelling +1

Paper
Add Code

JAB: Joint Adversarial Prompting and Belief Augmentation

no code implementations • 16 Nov 2023 • Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Jwala Dhamala, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

With the recent surge of language models in different applications, attention to safety and robustness of these models has gained significant importance.

Paper
Add Code

A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints

no code implementations • NeurIPS 2023 • Kareem Ahmed, Kai-Wei Chang, Guy Van Den Broeck

Under such distributions, computing the likelihood of even simple constraints is #P-hard.

Paper
Add Code

Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies

no code implementations • 19 Dec 2023 • Anaelia Ovalle, Ninareh Mehrabi, Palash Goyal, Jwala Dhamala, Kai-Wei Chang, Richard Zemel, Aram Galstyan, Yuval Pinter, Rahul Gupta

Our paper is the first to link LLM misgendering to tokenization and deficient neopronoun grammar, indicating that LLMs unable to correctly treat neopronouns as pronouns are more prone to misgender.

Paper
Add Code

ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models

no code implementations • 24 Jan 2024 • Rohan Wadhawan, Hritik Bansal, Kai-Wei Chang, Nanyun Peng

Our findings reveal a significant performance gap of 30. 8% between the best-performing LMM, GPT-4V(ision), and human capabilities using human evaluation indicating substantial room for improvement in context-sensitive text-rich visual reasoning.

Visual Reasoning

Paper
Add Code

The Male CEO and the Female Assistant: Probing Gender Biases in Text-To-Image Models Through Paired Stereotype Test

no code implementations • 16 Feb 2024 • Yixin Wan, Kai-Wei Chang

Recent large-scale Text-To-Image (T2I) models such as DALLE-3 demonstrate great potential in new applications, but also face unprecedented fairness challenges.

Fairness Image Generation

Paper
Add Code

Towards audio language modeling - an overview

no code implementations • 20 Feb 2024 • Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-Wei Chang, Ho-Lam Chung, Alexander H. Liu, Hung-Yi Lee

Neural audio codecs are initially introduced to compress audio data into compact codes to reduce transmission latency.

Language Modelling

Paper
Add Code

Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension

no code implementations • 28 Feb 2024 • Fan Yin, Jayanth Srinivasa, Kai-Wei Chang

We study how to characterize and predict the truthfulness of texts generated from large language models (LLMs), which serves as a crucial step in building trust between humans and LLMs.

Language Modelling Large Language Model +1

Paper
Add Code

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

no code implementations • 21 Mar 2024 • Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li

To this end, we introduce MathVerse, an all-around visual math benchmark designed for an equitable and in-depth evaluation of MLLMs.

Math Mathematical Reasoning

Paper
Add Code

Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation

no code implementations • 1 Apr 2024 • Yixin Wan, Arjun Subramonian, Anaelia Ovalle, Zongyu Lin, Ashima Suvarna, Christina Chance, Hritik Bansal, Rebecca Pattichis, Kai-Wei Chang

In this survey, we review prior studies on dimensions of bias: Gender, Skintone, and Geo-Culture.

Text-to-Image Generation

Paper
Add Code

Event Detection from Social Media for Epidemic Prediction

1 code implementation • 2 Apr 2024 • Tanmay Parekh, Anh Mac, Jiarui Yu, Yuxuan Dong, Syed Shahriar, Bonnie Liu, Eric Yang, Kuan-Hao Huang, Wei Wang, Nanyun Peng, Kai-Wei Chang

In our work, we pioneer exploiting Event Detection (ED) for better preparedness and early warnings of any upcoming epidemic by developing a framework to extract and analyze epidemic-related events from social media posts.

Event Detection

Paper
Code

Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought

no code implementations • 4 Apr 2024 • Jooyoung Lee, Fan Yang, Thanh Tran, Qian Hu, Emre Barut, Kai-Wei Chang, Chengwei Su

The Frozen large LM is then prompted to predict a task output based on the rationale generated by the lightweight LM.

Extractive Question-Answering Knowledge Distillation +3

Paper
Add Code

GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling

no code implementations • 7 Apr 2024 • Hritik Bansal, Po-Nien Kung, P. Jeffrey Brantingham, Kai-Wei Chang, Nanyun Peng

In this paper, we propose GenEARL, a training-free generative framework that harness the power of the modern generative models to understand event task descriptions given image contexts to perform the EARL task.

Language Modelling Large Language Model +1

Paper
Add Code

LLMs in Biomedicine: A study on clinical Named Entity Recognition

no code implementations • 10 Apr 2024 • Masoud Monajatipoor, Jiaxin Yang, Joel Stremmel, Melika Emami, Fazlolah Mohaghegh, Mozhdeh Rouhsedaghat, Kai-Wei Chang

Large Language Models (LLMs) demonstrate remarkable versatility in various NLP tasks but encounter distinct challenges in biomedicine due to medical language complexities and data scarcity.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency

no code implementations • 16 Apr 2024 • Yixin Wan, Kai-Wei Chang

Social biases can manifest in language agency.

Language Modelling Large Language Model +1

Paper
Add Code

BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis

1 code implementation • 10 Aug 2021 • Masoud Monajatipoor, Mozhdeh Rouhsedaghat, Liunian Harold Li, Aichi Chien, C. -C. Jay Kuo, Fabien Scalzo, Kai-Wei Chang

Vision-and-language(V&L) models take image and text as input and learn to capture the associations between them.

Language Modelling Question Answering +1

Paper
Code

Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies

1 code implementation • EMNLP 2021 • Sunipa Dev, Masoud Monajatipoor, Anaelia Ovalle, Arjun Subramonian, Jeff M Phillips, Kai-Wei Chang

Gender is widely discussed in the context of language tasks and when examining the stereotypes propagated by language models.

Paper
Code

ADDMU: Detection of Far-Boundary Adversarial Examples with Data and Model Uncertainty Estimation

1 code implementation • 22 Oct 2022 • Fan Yin, Yao Li, Cho-Jui Hsieh, Kai-Wei Chang

Finally, our analysis shows that the two types of uncertainty provided by \textbf{ADDMU} can be leveraged to characterize adversarial examples and identify the ones that contribute most to model's robustness in adversarial training.

Paper
Code

ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation

1 code implementation • 26 May 2023 • Kuan-Hao Huang, Varun Iyer, I-Hung Hsu, Anoop Kumar, Kai-Wei Chang, Aram Galstyan

Paraphrase generation is a long-standing task in natural language processing (NLP).

Data Augmentation Few-Shot Learning +6

Paper
Code

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step

1 code implementation • 24 Jun 2023 • Liunian Harold Li, Jack Hessel, Youngjae Yu, Xiang Ren, Kai-Wei Chang, Yejin Choi

We release our corpus of chain-of-thought samples and code.

Paper
Code

Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems

1 code implementation • 8 Oct 2023 • Yixin Wan, Jieyu Zhao, Aman Chadha, Nanyun Peng, Kai-Wei Chang

Recent advancements in Large Language Models empower them to follow freeform instructions, including imitating generic or specific demographic personas in conversations.

Benchmarking

Paper
Code

Counterfactual Language Model Adaptation for Suggesting Phrases

1 code implementation • IJCNLP 2017 • Kenneth C. Arnold, Kai-Wei Chang, Adam T. Kalai

Mobile devices use language models to suggest words and phrases for use in text entry.

counterfactual Language Modelling

Paper
Code

Robust Text Classifier on Test-Time Budgets

1 code implementation • IJCNLP 2019 • Md. Rizwan Parvez, Tolga Bolukbasi, Kai-Wei Chang, Venkatesh Saligrama

We propose a generic and interpretable learning framework for building robust text classification model that achieves accuracy comparable to full models under test-time budget constraints.

General Classification text-classification +1

Paper
Code

Mitigating Gender Bias in Natural Language Processing: Literature Review

1 code implementation • ACL 2019 • Tony Sun, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, William Yang Wang

In this paper, we review contemporary studies on recognizing and mitigating gender bias in NLP.

Paper
Code

Towards Understanding Gender Bias in Relation Extraction

1 code implementation • ACL 2020 • Andrew Gaut, Tony Sun, Shirlyn Tang, Yuxin Huang, Jing Qian, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, William Yang Wang

We use WikiGenderBias to evaluate systems for bias and find that NRE systems exhibit gender biased predictions and lay groundwork for future evaluation of bias in NRE.

counterfactual Data Augmentation +3

Paper
Code

Socially Aware Bias Measurements for Hindi Language Representations

2 code implementations • NAACL 2022 • Vijit Malik, Sunipa Dev, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang

Language representations are efficient tools used across NLP applications, but they are strife with encoded societal biases.

Cultural Vocal Bursts Intensity Prediction

Paper
Code

Conditional Supervised Contrastive Learning for Fair Text Classification

1 code implementation • 23 May 2022 • Jianfeng Chi, William Shand, Yaodong Yu, Kai-Wei Chang, Han Zhao, Yuan Tian

Contrastive representation learning has gained much attention due to its superior performance in learning representations from both image and sequential data.

Contrastive Learning Fairness +3

Paper
Code

LearningWord Embeddings for Low-resource Languages by PU Learning

1 code implementation • 9 May 2018 • Chao Jiang, Hsiang-Fu Yu, Cho-Jui Hsieh, Kai-Wei Chang

In such a situation, the co-occurrence matrix is sparse as the co-occurrences of many word pairs are unobserved.

Paper
Code

An Integer Linear Programming Framework for Mining Constraints from Data

1 code implementation • 18 Jun 2020 • Tao Meng, Kai-Wei Chang

This raises a question -- \emph{can we mine constraints and rules from data based on a learning algorithm?}

Multi-class Classification Multi-Label Classification

Paper
Code

Revealing Persona Biases in Dialogue Systems

1 code implementation • 18 Apr 2021 • Emily Sheng, Josh Arnold, Zhou Yu, Kai-Wei Chang, Nanyun Peng

Dialogue systems in the form of chatbots and personal assistants are being increasingly integrated into people's lives.

Paper
Code

Evaluating the Values of Sources in Transfer Learning

1 code implementation • NAACL 2021 • Md Rizwan Parvez, Kai-Wei Chang

Transfer learning that adapts a model trained on data-rich sources to low-resource targets has been widely applied in natural language processing (NLP).

Ranked #1 on Cross-Lingual POS Tagging on Universal Dependency Treebank

Cross-Lingual POS Tagging Transfer Learning

Paper
Code

MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models

1 code implementation • 30 May 2023 • Yu-Hsiang Wang, Huang-Yu Chen, Kai-Wei Chang, Winston Hsu, Hung-Yi Lee

In this paper, we introduce MiniSUPERB, a lightweight benchmark that efficiently evaluates SSL speech models with comparable results to SUPERB but lower computational costs significantly.

Self-Supervised Learning

Paper
Code

DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation

1 code implementation • 4 Mar 2024 • Xueqing Wu, Rui Zheng, Jingzhen Sha, Te-Lin Wu, Hanyu Zhou, Mohan Tang, Kai-Wei Chang, Nanyun Peng, Haoran Huang

We construct the DACO dataset, containing (1) 440 databases (of tabular data) collected from real-world scenarios, (2) ~2k query-answer pairs that can serve as weak supervision for model training, and (3) a concentrated but high-quality test set with human refined annotations that serves as our main evaluation benchmark.

2k Code Generation

Paper
Code

Visualizing Trends of Key Roles in News Articles

1 code implementation • IJCNLP 2019 • Chen Xia, Haoxiang Zhang, Jacob Moghtader, Allen Wu, Kai-Wei Chang

There are tons of news articles generated every day reflecting the activities of key roles such as people, organizations and political parties.

Paper
Code

"The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition

1 code implementation • 29 Apr 2020 • Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao, Kai-Wei Chang, Wei Wang

In this paper, we propose Pronunciation-attentive Contextualized Pun Recognition (PCPR) to perceive human humor, detect if a sentence contains puns and locate them in the sentence.

Sentence

Paper
Code

CASA: Causality-driven Argument Sufficiency Assessment

1 code implementation • 10 Jan 2024 • Xiao Liu, Yansong Feng, Kai-Wei Chang

Motivated by the definition of probability of sufficiency (PS) in the causal literature, we proposeCASA, a zero-shot causality-driven argument sufficiency assessment framework.

Logical Fallacy Detection

Paper
Code

Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing

1 code implementation • IJCNLP 2019 • Tao Meng, Nanyun Peng, Kai-Wei Chang

Experiments show that the Lagrangian relaxation and posterior regularization inference improve the performances on 15 and 17 out of 19 target languages, respectively.

Dependency Parsing

Paper
Code

Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training

1 code implementation • EMNLP 2021 • Kuan-Hao Huang, Wasi Uddin Ahmad, Nanyun Peng, Kai-Wei Chang

Pre-trained multilingual language encoders, such as multilingual BERT and XLM-R, show great potential for zero-shot cross-lingual transfer.

Sentence text-classification +4

Paper
Code

Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?

1 code implementation • Findings (ACL) 2021 • Jieyu Zhao, Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Kai-Wei Chang

We investigate the effectiveness of natural language interventions for reading-comprehension systems, studying this in the context of social stereotypes.

Ethics Few-Shot Learning +2

Paper
Code

TAGPRIME: A Unified Framework for Relational Structure Extraction

1 code implementation • 25 May 2022 • I-Hung Hsu, Kuan-Hao Huang, Shuning Zhang, Wenxin Cheng, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng

In this work, we propose to take a unified view of all these tasks and introduce TAGPRIME to address relational structure extraction problems.

Event Argument Extraction Language Modelling +2

Paper
Code

LOGAN: Local Group Bias Detection by Clustering

1 code implementation • EMNLP 2020 • Jieyu Zhao, Kai-Wei Chang

Machine learning techniques have been widely used in natural language processing (NLP).

Bias Detection BIG-bench Machine Learning +1

Paper
Code

Cross-lingual Dependency Parsing with Unlabeled Auxiliary Languages

1 code implementation • CONLL 2019 • Wasi Uddin Ahmad, Zhisong Zhang, Xuezhe Ma, Kai-Wei Chang, Nanyun Peng

We conduct experiments on cross-lingual dependency parsing where we train a dependency parser on a source language and transfer it to a wide range of target languages.

Cross-Lingual Transfer Dependency Parsing +2

Paper
Code

Double Perturbation: On the Robustness of Robustness and Counterfactual Bias Evaluation

1 code implementation • NAACL 2021 • Chong Zhang, Jieyu Zhao, huan zhang, Kai-Wei Chang, Cho-Jui Hsieh

Our method is able to reveal the hidden model biases not directly shown in the test dataset.

counterfactual

Paper
Code

On the Sensitivity and Stability of Model Interpretations in NLP

1 code implementation • ACL 2022 • Fan Yin, Zhouxing Shi, Cho-Jui Hsieh, Kai-Wei Chang

We propose two new criteria, sensitivity and stability, that provide complementary notions of faithfulness to the existed removal-based criteria.

Adversarial Robustness Dependency Parsing +2

Paper
Code

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

1 code implementation • 25 May 2022 • Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

We utilize this ontology to further introduce GENEVA, a diverse generalizability benchmarking dataset comprising four test suites, aimed at evaluating models' ability to handle limited data and unseen event type generalization.

Benchmarking Event Argument Extraction +1

Paper
Code

Improving the Adversarial Robustness of NLP Models by Information Bottleneck

1 code implementation • Findings (ACL) 2022 • Cenyuan Zhang, Xiang Zhou, Yixin Wan, Xiaoqing Zheng, Kai-Wei Chang, Cho-Jui Hsieh

Existing studies have demonstrated that adversarial examples can be directly attributed to the presence of non-robust features, which are highly predictive, but can be easily manipulated by adversaries to fool NLP models.

Adversarial Robustness SST-2

Paper
Code

KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation

1 code implementation • 27 Mar 2023 • Di wu, Da Yin, Kai-Wei Chang

Despite the significant advancements in keyphrase extraction and keyphrase generation methods, the predominant approach for evaluation mainly relies on exact matching with human references.

Keyphrase Extraction Keyphrase Generation

Paper
Code

"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters

1 code implementation • 13 Oct 2023 • Yixin Wan, George Pu, Jiao Sun, Aparna Garimella, Kai-Wei Chang, Nanyun Peng

Through benchmarking evaluation on 2 popular LLMs- ChatGPT and Alpaca, we reveal significant gender biases in LLM-generated recommendation letters.

Benchmarking Fairness +1

Paper
Code

BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation

1 code implementation • 27 Jan 2021 • Jwala Dhamala, Tony Sun, Varun Kumar, Satyapriya Krishna, Yada Pruksachatkun, Kai-Wei Chang, Rahul Gupta

To systematically study and benchmark social biases in open-ended language generation, we introduce the Bias in Open-Ended Language Generation Dataset (BOLD), a large-scale dataset that consists of 23, 679 English text generation prompts for bias benchmarking across five domains: profession, gender, race, religion, and political ideology.

Benchmarking Text Generation

Paper
Code

Generating Sports News from Live Commentary: A Chinese Dataset for Sports Game Summarization

1 code implementation • Asian Chapter of the Association for Computational Linguistics 2020 • Kuan-Hao Huang, Chen Li, Kai-Wei Chang

To deeply study this task, we present SportsSum, a Chinese sports game summarization dataset which contains 5, 428 soccer games of live commentaries and the corresponding news articles.

Paper
Code

LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following

1 code implementation • 18 Oct 2023 • Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Frank Wang, Kai-Wei Chang

Additional analysis shows that the contrastive objective and meta-actions are complementary in achieving the best results, and the resulting agent better aligns its states with corresponding instructions, making it more suitable for real-world embodied agents.

Contrastive Learning Instruction Following

Paper
Code

DeepEdit: Knowledge Editing as Decoding with Constraints

1 code implementation • 19 Jan 2024 • Yiwei Wang, Muhao Chen, Nanyun Peng, Kai-Wei Chang

To enforce these constraints, we utilize a depth-first search to adaptively substitute new knowledge for the LLMs' original reasoning steps, greedily seeking the optimal path of multi-hop reasoning with new knowledge.

Informativeness knowledge editing +2

Paper
Code

"Nice Try, Kiddo": Investigating Ad Hominems in Dialogue Responses

1 code implementation • 24 Oct 2020 • Emily Sheng, Kai-Wei Chang, Premkumar Natarajan, Nanyun Peng

Ad hominem attacks are those that target some feature of a person's character instead of the position the person is maintaining.

Abusive Language

Paper
Code

Efficient Shapley Values Estimation by Amortization for Text Classification

1 code implementation • 31 May 2023 • Chenghao Yang, Fan Yin, He He, Kai-Wei Chang, Xiaofei Ma, Bing Xiang

In practice, Shapley Values are often estimated with a small number of stochastic model evaluations.

text-classification Text Classification

Paper
Code

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

1 code implementation • 3 Jul 2023 • Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu Chang

However, we find visual and textual fine-grained information, e. g., keywords in the sentence and objects in the image, can be fairly informative for semantics understanding.

Image-text matching Sentence +2

Paper
Code

Learning to Represent Bilingual Dictionaries

1 code implementation • CONLL 2019 • Muhao Chen, Yingtao Tian, Haochen Chen, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo

Bilingual word embeddings have been widely used to capture the similarity of lexical semantics in different human languages.

Multi-Task Learning Paraphrase Identification +5

Paper
Code

Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer

1 code implementation • ACL 2020 • Jieyu Zhao, Subhabrata Mukherjee, Saghar Hosseini, Kai-Wei Chang, Ahmed Hassan Awadallah

In this paper, we study gender bias in multilingual embeddings and how it affects transfer learning for NLP applications.

Cross-Lingual Transfer Transfer Learning

Paper
Code

On the Robustness of Language Encoders against Grammatical Errors

1 code implementation • ACL 2020 • Fan Yin, Quanyu Long, Tao Meng, Kai-Wei Chang

We conduct a thorough study to diagnose the behaviors of pre-trained language encoders (ELMo, BERT, and RoBERTa) when confronted with natural grammatical errors.

Cloze Test Linguistic Acceptability +1

Paper
Code

Intent Classification and Slot Filling for Privacy Policies

1 code implementation • ACL 2021 • Wasi Uddin Ahmad, Jianfeng Chi, Tu Le, Thomas Norton, Yuan Tian, Kai-Wei Chang

We refer to predicting the privacy practice explained in a sentence as intent classification and identifying the text spans sharing specific information as slot filling.

General Classification intent-classification +3

Paper
Code

PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English

1 code implementation • 20 Dec 2022 • Jianfeng Chi, Wasi Uddin Ahmad, Yuan Tian, Kai-Wei Chang

Privacy policies provide individuals with information about their rights and how their personal information is handled.

Language Modelling Natural Language Understanding

Paper
Code

Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization

1 code implementation • 31 Mar 2024 • Hritik Bansal, Ashima Suvarna, Gantavya Bhatt, Nanyun Peng, Kai-Wei Chang, Aditya Grover

A common technique for aligning large language models (LLMs) relies on acquiring human preferences by comparing multiple generations conditioned on a fixed context.

Paper
Code

Gender Bias in Contextualized Word Embeddings

2 code implementations • NAACL 2019 • Jieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente Ordonez, Kai-Wei Chang

In this paper, we quantify, analyze and mitigate gender bias exhibited in ELMo's contextualized word vectors.

Word Embeddings

Paper
Code

PolicyQA: A Reading Comprehension Dataset for Privacy Policies

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Wasi Uddin Ahmad, Jianfeng Chi, Yuan Tian, Kai-Wei Chang

Prior studies in this domain frame the QA task as retrieving the most relevant text segment or a list of sentences from the policy document given a question.

Question Answering Reading Comprehension

Paper
Code

Representation Learning for Resource-Constrained Keyphrase Generation

1 code implementation • 15 Mar 2022 • Di wu, Wasi Uddin Ahmad, Sunipa Dev, Kai-Wei Chang

State-of-the-art keyphrase generation methods generally depend on large annotated datasets, limiting their performance in domains with limited annotated data.

Denoising Domain Adaptation +4

Paper
Code

Summarize and Generate to Back-translate: Unsupervised Translation of Programming Languages

1 code implementation • 23 May 2022 • Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

In code generation, the model learns to do the opposite.

Code Generation Code Summarization +2

Paper
Code

GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models

1 code implementation • 24 May 2022 • Da Yin, Hritik Bansal, Masoud Monajatipoor, Liunian Harold Li, Kai-Wei Chang

In this paper, we introduce a benchmark dataset, Geo-Diverse Commonsense Multilingual Language Models Analysis (GeoMLAMA), for probing the diversity of the relational knowledge in multilingual PLMs.

Language Modelling

Paper
Code

Text encoders bottleneck compositionality in contrastive vision-language models

1 code implementation • 24 May 2023 • Amita Kamath, Jack Hessel, Kai-Wei Chang

We first curate CompPrompts, a set of increasingly compositional image captions that VL models should be able to capture (e. g., single object, to object+property, to multiple interacting objects).

Attribute Image Captioning +1

Paper
Code

Red Teaming Language Model Detectors with Language Models

2 code implementations • 31 May 2023 • Zhouxing Shi, Yihan Wang, Fan Yin, Xiangning Chen, Kai-Wei Chang, Cho-Jui Hsieh

The prevalence and strong capability of large language models (LLMs) present significant safety and ethical risks if exploited by malicious users.

Adversarial Robustness Language Modelling +2

Paper
Code

How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?

1 code implementation • 27 Oct 2022 • Hritik Bansal, Da Yin, Masoud Monajatipoor, Kai-Wei Chang

To this end, we introduce an Ethical NaTural Language Interventions in Text-to-Image GENeration (ENTIGEN) benchmark dataset to evaluate the change in image generations conditional on ethical interventions across three social axes -- gender, skin color, and culture.

Cultural Vocal Bursts Intensity Prediction Text-to-Image Generation

Paper
Code

Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty

1 code implementation • 2 Dec 2023 • Cheng-Fu Yang, Haoyang Xu, Te-Lin Wu, Xiaofeng Gao, Kai-Wei Chang, Feng Gao

In this paper, we aim to tackle this problem with a unified framework consisting of an end-to-end trainable method and a planning algorithm.

Denoising Vision-Language Navigation

Paper
Code

Societal Biases in Language Generation: Progress and Challenges

1 code implementation • ACL 2021 • Emily Sheng, Kai-Wei Chang, Premkumar Natarajan, Nanyun Peng

Technology for language generation has advanced rapidly, spurred by advancements in pre-training large models on massive amounts of data and the need for intelligent agents to communicate in a natural manner.

Fairness Text Generation

Paper
Code

Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text Classification

1 code implementation • IJCNLP 2019 • Yichao Zhou, Jyun-Yu Jiang, Kai-Wei Chang, Wei Wang

To identify adversarial attacks, a perturbation discriminator validates how likely a token in the text is perturbed and provides a set of potential perturbations.

Blocking General Classification +3

Paper
Code

Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble

1 code implementation • 20 Jun 2020 • Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, Kai-Wei Chang, Xuanjing Huang

Despite neural networks have achieved prominent performance on many natural language processing (NLP) tasks, they are vulnerable to adversarial examples.

Sentence

Paper
Code

Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet Neighborhood Ensemble

1 code implementation • ACL 2021 • Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, Kai-Wei Chang, Xuanjing Huang

Although deep neural networks have achieved prominent performance on many NLP tasks, they are vulnerable to adversarial examples.

Sentence

Paper
Code

Syntax-augmented Multilingual BERT for Cross-lingual Transfer

1 code implementation • ACL 2021 • Wasi Uddin Ahmad, Haoran Li, Kai-Wei Chang, Yashar Mehdad

In recent years, we have seen a colossal effort in pre-training multilingual text encoders using large-scale corpora in many languages to facilitate cross-lingual transfer learning.

Cross-Lingual Transfer named-entity-recognition +7

Paper
Code

Controllable Text Generation with Neurally-Decomposed Oracle

1 code implementation • 27 May 2022 • Tao Meng, Sidi Lu, Nanyun Peng, Kai-Wei Chang

We propose a general and efficient framework to control auto-regressive generation models with NeurAlly-Decomposed Oracle (NADO).

Language Modelling Machine Translation +1

Paper
Code

Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks

1 code implementation • 1 Nov 2023 • Po-Nien Kung, Fan Yin, Di wu, Kai-Wei Chang, Nanyun Peng

Instruction tuning (IT) achieves impressive zero-shot generalization results by training large language models (LLMs) on a massive amount of diverse tasks with instructions.

Informativeness Out-of-Distribution Generalization +1

Paper
Code

On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing

2 code implementations • NAACL 2019 • Wasi Uddin Ahmad, Zhisong Zhang, Xuezhe Ma, Eduard Hovy, Kai-Wei Chang, Nanyun Peng

Different languages might have different word orders.

Cross-Lingual Transfer Dependency Parsing

Paper
Code

Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction

1 code implementation • ACL 2022 • Kuan-Hao Huang, I-Hung Hsu, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng

We present a study on leveraging multilingual pre-trained generative language models for zero-shot cross-lingual event argument extraction (EAE).

Event Argument Extraction Text Generation +1

Paper
Code

Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study

1 code implementation • 20 Dec 2022 • Di wu, Wasi Uddin Ahmad, Kai-Wei Chang

However, there lacks a systematic study of how the two types of approaches compare and how different design choices can affect the performance of PLM-based models.

Keyphrase Extraction Keyphrase Generation

Paper
Code

Rethinking Model Selection and Decoding for Keyphrase Generation with Pre-trained Sequence-to-Sequence Models

1 code implementation • 10 Oct 2023 • Di wu, Wasi Uddin Ahmad, Kai-Wei Chang

DeSel improves greedy search by an average of 4. 7% semantic F1 across five datasets.

Keyphrase Generation Model Selection

Paper
Code

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

1 code implementation • 20 Feb 2024 • Haibin Wu, Huang-Cheng Chou, Kai-Wei Chang, Lucas Goncalves, Jiawei Du, Jyh-Shing Roger Jang, Chi-Chun Lee, Hung-Yi Lee

Speech emotion recognition (SER) is a pivotal technology for human-computer interaction systems.

Self-Supervised Learning Speech Emotion Recognition

Paper
Code

On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation

1 code implementation • 21 Feb 2024 • Di wu, Wasi Uddin Ahmad, Kai-Wei Chang

This study addresses the application of encoder-only Pre-trained Language Models (PLMs) in keyphrase generation (KPG) amidst the broader availability of domain-tailored encoder-only models compared to encoder-decoder models.

Keyphrase Generation

Paper
Code

SpeechNet: A Universal Modularized Model for Speech Processing Tasks

1 code implementation • 7 May 2021 • Yi-Chen Chen, Po-Han Chi, Shu-wen Yang, Kai-Wei Chang, Jheng-Hao Lin, Sung-Feng Huang, Da-Rong Liu, Chi-Liang Liu, Cheng-Kuang Lee, Hung-Yi Lee

The multi-task learning of a wide variety of speech processing tasks with a universal model has not been studied.

Multi-Task Learning

Paper
Code

Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data

1 code implementation • 27 Feb 2024 • Xiao Liu, Zirui Wu, Xueqing Wu, Pan Lu, Kai-Wei Chang, Yansong Feng

To address this gap, we introduce the Quantitative Reasoning with Data (QRData) benchmark, aiming to evaluate Large Language Models' capability in statistical and causal reasoning with real-world data.

Benchmarking

Paper
Code

Examining Gender Bias in Languages with Grammatical Gender

1 code implementation • IJCNLP 2019 • Pei Zhou, Weijia Shi, Jieyu Zhao, Kuan-Hao Huang, Muhao Chen, Ryan Cotterell, Kai-Wei Chang

Recent studies have shown that word embeddings exhibit gender bias inherited from the training corpora.

Translation Word Embeddings +2

Paper
Code

Semantic Probabilistic Layers for Neuro-Symbolic Learning

1 code implementation • 1 Jun 2022 • Kareem Ahmed, Stefano Teso, Kai-Wei Chang, Guy Van Den Broeck, Antonio Vergari

We design a predictive layer for structured-output prediction (SOP) that can be plugged into any neural network guaranteeing its predictions are consistent with a set of predefined symbolic constraints.

Hierarchical Multi-label Classification Logical Reasoning

Paper
Code

Model Editing Can Hurt General Abilities of Large Language Models

1 code implementation • 9 Jan 2024 • Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng

One critical challenge that has emerged is the presence of hallucinations in the output of large language models (LLMs) due to false or outdated knowledge.

Model Editing Question Answering

Paper
Code

Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models

1 code implementation • NAACL 2021 • James Y. Huang, Kuan-Hao Huang, Kai-Wei Chang

In this work, we present ParaBART, a semantic sentence embedding model that learns to disentangle semantics and syntax in sentence embeddings obtained by pre-trained language models.

Paper
Code

CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive Learning

1 code implementation • ICCV 2023 • Hritik Bansal, Nishad Singhi, Yu Yang, Fan Yin, Aditya Grover, Kai-Wei Chang

Multimodal contrastive pretraining has been used to train multimodal representation models, such as CLIP, on large amounts of paired image-text data.

Backdoor Attack Contrastive Learning +1

Paper
Code

Robustness Verification for Transformers

1 code implementation • ICLR 2020 • Zhouxing Shi, huan zhang, Kai-Wei Chang, Minlie Huang, Cho-Jui Hsieh

Robustness verification that aims to formally certify the prediction behavior of neural networks has become an important tool for understanding model behavior and obtaining safety guarantees.

Position Sentiment Analysis

Paper
Code

Integrating topic modeling and word embedding to characterize violent deaths

1 code implementation • 28 Jun 2021 • Alina Arseniev-Koehler, Susan D. Cochran, Vickie M. Mays, Kai-Wei Chang, Jacob Gates Foster

Our method offers a flexible and broadly applicable approach to model topics in text data.

Paper
Code

TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction

1 code implementation • 16 Nov 2023 • Kuan-Hao Huang, I-Hung Hsu, Tanmay Parekh, Zhiyu Xie, Zixuan Zhang, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng, Heng Ji

In this work, we identify and address evaluation challenges, including inconsistency due to varying data assumptions or preprocessing steps, the insufficiency of current evaluation frameworks that may introduce dataset or data split bias, and the low reproducibility of some previous approaches.

Benchmarking Event Extraction

Paper
Code

On Prompt-Driven Safeguarding for Large Language Models

1 code implementation • 31 Jan 2024 • Chujie Zheng, Fan Yin, Hao Zhou, Fandong Meng, Jie zhou, Kai-Wei Chang, Minlie Huang, Nanyun Peng

Prepending model inputs with safety prompts is a common practice for safeguarding large language models (LLMs) from complying with queries that contain harmful intents.

Paper
Code

Searching for an Effective Defender: Benchmarking Defense against Adversarial Word Substitution

1 code implementation • EMNLP 2021 • Zongyi Li, Jianhan Xu, Jiehang Zeng, Linyang Li, Xiaoqing Zheng, Qi Zhang, Kai-Wei Chang, Cho-Jui Hsieh

Recent studies have shown that deep neural networks are vulnerable to intentionally crafted adversarial examples, and various methods have been proposed to defend against adversarial word-substitution attacks for neural NLP models.

Benchmarking

Paper
Code

Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning

1 code implementation • EMNLP 2021 • Da Yin, Liunian Harold Li, Ziniu Hu, Nanyun Peng, Kai-Wei Chang

Commonsense is defined as the knowledge that is shared by everyone.

Ranked #1 on Visual Commonsense Reasoning on GD-VCR

Cultural Vocal Bursts Intensity Prediction Visual Commonsense Reasoning

Paper
Code

On the Paradox of Learning to Reason from Data

1 code implementation • 23 May 2022 • Honghua Zhang, Liunian Harold Li, Tao Meng, Kai-Wei Chang, Guy Van Den Broeck

Logical reasoning is needed in a wide range of NLP tasks.

Logical Reasoning

Paper
Code

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models

1 code implementation • 24 May 2023 • Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-Wei Chang, Shih-Fu Chang

Specifically, IdealGPT utilizes an LLM to generate sub-questions, a VLM to provide corresponding sub-answers, and another LLM to reason to achieve the final answer.

Paper
Code

Building Language Models for Text with Named Entities

2 code implementations • ACL 2018 • Md. Rizwan Parvez, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

Text in many domains involves a significant amount of named entities.

Ranked #1 on Recipe Generation on Now You're Cooking!

Code Generation Language Modelling +1

Paper
Code

What's "up" with vision-language models? Investigating their struggle with spatial reasoning

1 code implementation • 30 Oct 2023 • Amita Kamath, Jack Hessel, Kai-Wei Chang

Recent vision-language (VL) models are powerful, but can they reliably distinguish "right" from "left"?

Paper
Code

Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations

2 code implementations • ICCV 2019 • Tianlu Wang, Jieyu Zhao, Mark Yatskar, Kai-Wei Chang, Vicente Ordonez

In this work, we present a framework to measure and mitigate intrinsic biases with respect to protected variables --such as gender-- in visual recognition tasks.

Temporal Action Localization

Paper
Code

Retrieval Augmented Code Generation and Summarization

1 code implementation • Findings (EMNLP) 2021 • Md Rizwan Parvez, Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

To mimic developers' code or summary generation behavior, we propose a retrieval augmented framework, REDCODER, that retrieves relevant code or summaries from a retrieval database and provides them as a supplement to code generation or summarization models.

Ranked #1 on Code Generation on CodeXGLUE - CodeSearchNet (using extra training data)

Code Generation Code Summarization +1

Paper
Code

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings

8 code implementations • NeurIPS 2016 • Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, Adam Kalai

Geometrically, gender bias is first shown to be captured by a direction in the word embedding.

BIG-bench Machine Learning Word Embeddings

Paper
Code

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

1 code implementation • EACL 2021 • Kuan-Hao Huang, Kai-Wei Chang

We also demonstrate that the performance of SynPG is competitive or even better than supervised models when the unannotated data is large.

Data Augmentation Disentanglement +2

Paper
Code

Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference

2 code implementations • 16 Dec 2020 • Yichao Zhou, Yu Yan, Rujun Han, J. Harry Caufield, Kai-Wei Chang, Yizhou Sun, Peipei Ping, Wei Wang

There has been a steady need in the medical community to precisely extract the temporal relations between clinical events.

Feature Engineering Question Answering +3

Paper
Code

Learning Gender-Neutral Word Embeddings

1 code implementation • EMNLP 2018 • Jieyu Zhao, Yichao Zhou, Zeyu Li, Wei Wang, Kai-Wei Chang

Word embedding models have become a fundamental component in a wide range of Natural Language Processing (NLP) applications.

Word Embeddings

Paper
Code

VideoCon: Robust Video-Language Alignment via Contrast Captions

1 code implementation • 15 Nov 2023 • Hritik Bansal, Yonatan Bitton, Idan Szpektor, Kai-Wei Chang, Aditya Grover

Despite being (pre)trained on a massive amount of data, state-of-the-art video-language alignment models are not robust to semantically-plausible contrastive changes in the video captions.

Language Modelling Large Language Model +5

Paper
Code

GATE: Graph Attention Transformer Encoder for Cross-lingual Relation and Event Extraction

1 code implementation • 6 Oct 2020 • Wasi Uddin Ahmad, Nanyun Peng, Kai-Wei Chang

Recent progress in cross-lingual relation and event extraction use graph convolutional networks (GCNs) with universal dependency parses to learn language-agnostic sentence representations such that models trained on one language can be applied to other languages.

Event Extraction Graph Attention +2

Paper
Code

AVATAR: A Parallel Corpus for Java-Python Program Translation

1 code implementation • 26 Aug 2021 • Wasi Uddin Ahmad, Md Golam Rahman Tushar, Saikat Chakraborty, Kai-Wei Chang

Automating program translation is of paramount importance in software migration, and recently researchers explored unsupervised approaches due to the unavailability of parallel corpora.

Translation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.