Search Results for author: Kuan-Hao Huang

Found 26 papers, 20 papers with code

Text-Based Reasoning About Vector Graphics

no code implementations • 9 Apr 2024 • Zhenhailong Wang, Joy Hsu, Xingyao Wang, Kuan-Hao Huang, Manling Li, Jiajun Wu, Heng Ji

By casting an image to a text-based representation, we can leverage the power of language models to learn alignment from SVG to visual primitives and generalize to unseen question-answering tasks.

Descriptive Language Modelling +2

Paper
Add Code

Event Detection from Social Media for Epidemic Prediction

1 code implementation • 2 Apr 2024 • Tanmay Parekh, Anh Mac, Jiarui Yu, Yuxuan Dong, Syed Shahriar, Bonnie Liu, Eric Yang, Kuan-Hao Huang, Wei Wang, Nanyun Peng, Kai-Wei Chang

In our work, we pioneer exploiting Event Detection (ED) for better preparedness and early warnings of any upcoming epidemic by developing a framework to extract and analyze epidemic-related events from social media posts.

Event Detection

Paper
Code

TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction

1 code implementation • 16 Nov 2023 • Kuan-Hao Huang, I-Hung Hsu, Tanmay Parekh, Zhiyu Xie, Zixuan Zhang, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng, Heng Ji

In this work, we identify and address evaluation challenges, including inconsistency due to varying data assumptions or preprocessing steps, the insufficiency of current evaluation frameworks that may introduce dataset or data split bias, and the low reproducibility of some previous approaches.

Benchmarking Event Extraction

Paper
Code

Self-Augmentation Improves Zero-Shot Cross-Lingual Transfer

no code implementations • 19 Sep 2023 • Fei Wang, Kuan-Hao Huang, Kai-Wei Chang, Muhao Chen

In this paper, we propose a simple yet effective method, SALT, to improve the zero-shot cross-lingual transfer of the multilingual pretrained language models without the help of such external data.

Multilingual NLP Zero-Shot Cross-Lingual Transfer

Paper
Add Code

Contextual Label Projection for Cross-Lingual Structured Prediction

1 code implementation • 16 Sep 2023 • Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

Label projection, which involves obtaining translated labels and texts jointly, is essential for leveraging machine translation to facilitate cross-lingual transfer in structured prediction tasks.

Event Argument Extraction Machine Translation +6

Paper
Code

ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation

1 code implementation • 26 May 2023 • Kuan-Hao Huang, Varun Iyer, I-Hung Hsu, Anoop Kumar, Kai-Wei Chang, Aram Galstyan

Paraphrase generation is a long-standing task in natural language processing (NLP).

Data Augmentation Few-Shot Learning +6

Paper
Code

AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model

1 code implementation • 26 May 2023 • I-Hung Hsu, Zhiyu Xie, Kuan-Hao Huang, Prem Natarajan, Nanyun Peng

However, existing generation-based EAE models mostly focus on problem re-formulation and prompt design, without incorporating additional information that has been shown to be effective for classification-based models, such as the abstract meaning representation (AMR) of the input passages.

Event Argument Extraction

Paper
Code

PIP: Parse-Instructed Prefix for Syntactically Controlled Paraphrase Generation

1 code implementation • 26 May 2023 • Yixin Wan, Kuan-Hao Huang, Kai-Wei Chang

Existing fine-tuning methods for this task are costly as all the parameters of the model need to be updated during the training process.

Paraphrase Generation

Paper
Code

Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis

1 code implementation • 23 May 2023 • Oscar Chew, Hsuan-Tien Lin, Kai-Wei Chang, Kuan-Hao Huang

Recent research has revealed that machine learning models have a tendency to leverage spurious correlations that exist in the training set but may not hold true in general circumstances.

text-classification Text Classification

Paper
Code

Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes

no code implementations • 22 May 2023 • Kuan-Hao Huang, Liang Tan, Rui Hou, Sinong Wang, Amjad Almahairi, Ruty Rinott

Fine-tuning a large pre-trained language model for each downstream task causes computational burdens in the inference time due to several times of forward passes.

Language Modelling

Paper
Add Code

Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations

no code implementations • 2 Nov 2022 • Kuan-Hao Huang, Varun Iyer, Anoop Kumar, Sriram Venkatapathy, Kai-Wei Chang, Aram Galstyan

In this paper, we demonstrate that leveraging Abstract Meaning Representations (AMR) can greatly improve the performance of unsupervised syntactically controlled paraphrase generation.

Data Augmentation Paraphrase Generation +1

Paper
Add Code

TAGPRIME: A Unified Framework for Relational Structure Extraction

1 code implementation • 25 May 2022 • I-Hung Hsu, Kuan-Hao Huang, Shuning Zhang, Wenxin Cheng, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng

In this work, we propose to take a unified view of all these tasks and introduce TAGPRIME to address relational structure extraction problems.

Event Argument Extraction Language Modelling +2

Paper
Code

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

1 code implementation • 25 May 2022 • Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

We utilize this ontology to further introduce GENEVA, a diverse generalizability benchmarking dataset comprising four test suites, aimed at evaluating models' ability to handle limited data and unseen event type generalization.

Benchmarking Event Argument Extraction +1

Paper
Code

A Comparative Survey of Deep Active Learning

1 code implementation • 25 Mar 2022 • Xueying Zhan, Qingzhong Wang, Kuan-Hao Huang, Haoyi Xiong, Dejing Dou, Antoni B. Chan

In this work, We construct a DAL toolkit, DeepAL+, by re-implementing 19 highly-cited DAL methods.

Active Learning

153

Paper
Code

Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction

1 code implementation • ACL 2022 • Kuan-Hao Huang, I-Hung Hsu, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng

We present a study on leveraging multilingual pre-trained generative language models for zero-shot cross-lingual event argument extraction (EAE).

Event Argument Extraction Text Generation +1

Paper
Code

DeepAL: Deep Active Learning in Python

2 code implementations • 30 Nov 2021 • Kuan-Hao Huang

We present DeepAL, a Python library that implements several common strategies for active learning, with a particular emphasis on deep active learning.

Active Learning

761

Paper
Code

DEGREE: A Data-Efficient Generation-Based Event Extraction Model

2 code implementations • NAACL 2022 • I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng

Given a passage and a manually designed prompt, DEGREE learns to summarize the events mentioned in the passage into a natural sentence that follows a predefined pattern.

Event Extraction Sentence +2

Paper
Code

Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training

1 code implementation • EMNLP 2021 • Kuan-Hao Huang, Wasi Uddin Ahmad, Nanyun Peng, Kai-Wei Chang

Pre-trained multilingual language encoders, such as multilingual BERT and XLM-R, show great potential for zero-shot cross-lingual transfer.

Sentence text-classification +4

Paper
Code

Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models

1 code implementation • NAACL 2021 • James Y. Huang, Kuan-Hao Huang, Kai-Wei Chang

In this work, we present ParaBART, a semantic sentence embedding model that learns to disentangle semantics and syntax in sentence embeddings obtained by pre-trained language models.

Paper
Code

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

1 code implementation • EACL 2021 • Kuan-Hao Huang, Kai-Wei Chang

We also demonstrate that the performance of SynPG is competitive or even better than supervised models when the unannotated data is large.

Data Augmentation Disentanglement +2

Paper
Code

Generating Sports News from Live Commentary: A Chinese Dataset for Sports Game Summarization

1 code implementation • Asian Chapter of the Association for Computational Linguistics 2020 • Kuan-Hao Huang, Chen Li, Kai-Wei Chang

To deeply study this task, we present SportsSum, a Chinese sports game summarization dataset which contains 5, 428 soccer games of live commentaries and the corresponding news articles.

Paper
Code

Examining Gender Bias in Languages with Grammatical Gender

1 code implementation • IJCNLP 2019 • Pei Zhou, Weijia Shi, Jieyu Zhao, Kuan-Hao Huang, Muhao Chen, Ryan Cotterell, Kai-Wei Chang

Recent studies have shown that word embeddings exhibit gender bias inherited from the training corpora.

Translation Word Embeddings +2

Paper
Code

JECL: Joint Embedding and Cluster Learning for Image-Text Pairs

no code implementations • 4 Jan 2019 • Sean T. Yang, Kuan-Hao Huang, Bill Howe

We propose JECL, a method for clustering image-caption pairs by training parallel encoders with regularized clustering and alignment objectives, simultaneously learning both representations and cluster assignments.

Clustering

Paper
Add Code

Dynamic Principal Projection for Cost-Sensitive Online Multi-Label Classification

no code implementations • 14 Nov 2017 • Hong-Min Chu, Kuan-Hao Huang, Hsuan-Tien Lin

The foundation of CS-DPP is an online LSDR framework derived from a leading LSDR algorithm.

Classification General Classification +1

Paper
Add Code

Cost-Sensitive Reference Pair Encoding for Multi-Label Learning

1 code implementation • 29 Nov 2016 • Yao-Yuan Yang, Kuan-Hao Huang, Chih-Wei Chang, Hsuan-Tien Lin

Label space expansion for multi-label classification (MLC) is a methodology that encodes the original label vectors to higher dimensional codes before training and decodes the predicted codes back to the label vectors during testing.

Active Learning Multi-Label Classification +1

Paper
Code

Cost-Sensitive Label Embedding for Multi-Label Classification

2 code implementations • 30 Mar 2016 • Kuan-Hao Huang, Hsuan-Tien Lin

Furthermore, extensive experimental results demonstrate that CLEMS is significantly better than a wide spectrum of existing LE algorithms and state-of-the-art cost-sensitive algorithms across different cost functions.

Classification General Classification +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.