Search Results for author: Kuan-Hao Huang

Found 26 papers, 20 papers with code

Text-Based Reasoning About Vector Graphics

no code implementations9 Apr 2024 Zhenhailong Wang, Joy Hsu, Xingyao Wang, Kuan-Hao Huang, Manling Li, Jiajun Wu, Heng Ji

By casting an image to a text-based representation, we can leverage the power of language models to learn alignment from SVG to visual primitives and generalize to unseen question-answering tasks.

Descriptive Language Modelling +2

Event Detection from Social Media for Epidemic Prediction

1 code implementation2 Apr 2024 Tanmay Parekh, Anh Mac, Jiarui Yu, Yuxuan Dong, Syed Shahriar, Bonnie Liu, Eric Yang, Kuan-Hao Huang, Wei Wang, Nanyun Peng, Kai-Wei Chang

In our work, we pioneer exploiting Event Detection (ED) for better preparedness and early warnings of any upcoming epidemic by developing a framework to extract and analyze epidemic-related events from social media posts.

Event Detection

TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction

1 code implementation16 Nov 2023 Kuan-Hao Huang, I-Hung Hsu, Tanmay Parekh, Zhiyu Xie, Zixuan Zhang, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng, Heng Ji

In this work, we identify and address evaluation challenges, including inconsistency due to varying data assumptions or preprocessing steps, the insufficiency of current evaluation frameworks that may introduce dataset or data split bias, and the low reproducibility of some previous approaches.

Benchmarking Event Extraction

Self-Augmentation Improves Zero-Shot Cross-Lingual Transfer

no code implementations19 Sep 2023 Fei Wang, Kuan-Hao Huang, Kai-Wei Chang, Muhao Chen

In this paper, we propose a simple yet effective method, SALT, to improve the zero-shot cross-lingual transfer of the multilingual pretrained language models without the help of such external data.

Multilingual NLP Zero-Shot Cross-Lingual Transfer

Contextual Label Projection for Cross-Lingual Structured Prediction

1 code implementation16 Sep 2023 Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

Label projection, which involves obtaining translated labels and texts jointly, is essential for leveraging machine translation to facilitate cross-lingual transfer in structured prediction tasks.

Event Argument Extraction Machine Translation +6

AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model

1 code implementation26 May 2023 I-Hung Hsu, Zhiyu Xie, Kuan-Hao Huang, Prem Natarajan, Nanyun Peng

However, existing generation-based EAE models mostly focus on problem re-formulation and prompt design, without incorporating additional information that has been shown to be effective for classification-based models, such as the abstract meaning representation (AMR) of the input passages.

Event Argument Extraction

PIP: Parse-Instructed Prefix for Syntactically Controlled Paraphrase Generation

1 code implementation26 May 2023 Yixin Wan, Kuan-Hao Huang, Kai-Wei Chang

Existing fine-tuning methods for this task are costly as all the parameters of the model need to be updated during the training process.

Paraphrase Generation

Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis

1 code implementation23 May 2023 Oscar Chew, Hsuan-Tien Lin, Kai-Wei Chang, Kuan-Hao Huang

Recent research has revealed that machine learning models have a tendency to leverage spurious correlations that exist in the training set but may not hold true in general circumstances.

text-classification Text Classification

Learning Easily Updated General Purpose Text Representations with Adaptable Task-Specific Prefixes

no code implementations22 May 2023 Kuan-Hao Huang, Liang Tan, Rui Hou, Sinong Wang, Amjad Almahairi, Ruty Rinott

Fine-tuning a large pre-trained language model for each downstream task causes computational burdens in the inference time due to several times of forward passes.

Language Modelling

Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations

no code implementations2 Nov 2022 Kuan-Hao Huang, Varun Iyer, Anoop Kumar, Sriram Venkatapathy, Kai-Wei Chang, Aram Galstyan

In this paper, we demonstrate that leveraging Abstract Meaning Representations (AMR) can greatly improve the performance of unsupervised syntactically controlled paraphrase generation.

Data Augmentation Paraphrase Generation +1

TAGPRIME: A Unified Framework for Relational Structure Extraction

1 code implementation25 May 2022 I-Hung Hsu, Kuan-Hao Huang, Shuning Zhang, Wenxin Cheng, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng

In this work, we propose to take a unified view of all these tasks and introduce TAGPRIME to address relational structure extraction problems.

Event Argument Extraction Language Modelling +2

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

1 code implementation25 May 2022 Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

We utilize this ontology to further introduce GENEVA, a diverse generalizability benchmarking dataset comprising four test suites, aimed at evaluating models' ability to handle limited data and unseen event type generalization.

Benchmarking Event Argument Extraction +1

A Comparative Survey of Deep Active Learning

1 code implementation25 Mar 2022 Xueying Zhan, Qingzhong Wang, Kuan-Hao Huang, Haoyi Xiong, Dejing Dou, Antoni B. Chan

In this work, We construct a DAL toolkit, DeepAL+, by re-implementing 19 highly-cited DAL methods.

Active Learning

DeepAL: Deep Active Learning in Python

2 code implementations30 Nov 2021 Kuan-Hao Huang

We present DeepAL, a Python library that implements several common strategies for active learning, with a particular emphasis on deep active learning.

Active Learning

DEGREE: A Data-Efficient Generation-Based Event Extraction Model

2 code implementations NAACL 2022 I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng

Given a passage and a manually designed prompt, DEGREE learns to summarize the events mentioned in the passage into a natural sentence that follows a predefined pattern.

Event Extraction Sentence +2

Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training

1 code implementation EMNLP 2021 Kuan-Hao Huang, Wasi Uddin Ahmad, Nanyun Peng, Kai-Wei Chang

Pre-trained multilingual language encoders, such as multilingual BERT and XLM-R, show great potential for zero-shot cross-lingual transfer.

Sentence text-classification +4

Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models

1 code implementation NAACL 2021 James Y. Huang, Kuan-Hao Huang, Kai-Wei Chang

In this work, we present ParaBART, a semantic sentence embedding model that learns to disentangle semantics and syntax in sentence embeddings obtained by pre-trained language models.

Semantic Similarity Semantic Textual Similarity +3

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

1 code implementation EACL 2021 Kuan-Hao Huang, Kai-Wei Chang

We also demonstrate that the performance of SynPG is competitive or even better than supervised models when the unannotated data is large.

Data Augmentation Disentanglement +2

Generating Sports News from Live Commentary: A Chinese Dataset for Sports Game Summarization

1 code implementation Asian Chapter of the Association for Computational Linguistics 2020 Kuan-Hao Huang, Chen Li, Kai-Wei Chang

To deeply study this task, we present SportsSum, a Chinese sports game summarization dataset which contains 5, 428 soccer games of live commentaries and the corresponding news articles.

JECL: Joint Embedding and Cluster Learning for Image-Text Pairs

no code implementations4 Jan 2019 Sean T. Yang, Kuan-Hao Huang, Bill Howe

We propose JECL, a method for clustering image-caption pairs by training parallel encoders with regularized clustering and alignment objectives, simultaneously learning both representations and cluster assignments.

Clustering

Cost-Sensitive Reference Pair Encoding for Multi-Label Learning

1 code implementation29 Nov 2016 Yao-Yuan Yang, Kuan-Hao Huang, Chih-Wei Chang, Hsuan-Tien Lin

Label space expansion for multi-label classification (MLC) is a methodology that encodes the original label vectors to higher dimensional codes before training and decodes the predicted codes back to the label vectors during testing.

Active Learning Multi-Label Classification +1

Cost-Sensitive Label Embedding for Multi-Label Classification

2 code implementations30 Mar 2016 Kuan-Hao Huang, Hsuan-Tien Lin

Furthermore, extensive experimental results demonstrate that CLEMS is significantly better than a wide spectrum of existing LE algorithms and state-of-the-art cost-sensitive algorithms across different cost functions.

Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.