Search Results for author: Richard Tzong-Han Tsai

Found 24 papers, 1 papers with code

BRCC and SentiBahasaRojak: The First Bahasa Rojak Corpus for Pretraining and Sentiment Analysis Dataset

no code implementations • COLING 2022 • Nanda Putri Romadhona, Sin-En Lu, Bo-Han Lu, Richard Tzong-Han Tsai

Finally, to test the effectiveness of the Mixed XLM model pre-trained on BRCC for social media scenarios where code-mixing is found frequently, we compile a new Bahasa Rojak sentiment analysis dataset, SentiBahasaRojak, with a Kappa value of 0. 77.

Data Augmentation Sentiment Analysis +1

Paper
Add Code

Verdict Inference with Claim and Retrieved Elements Using RoBERTa

no code implementations • EMNLP (FEVER) 2021 • In-Zu Gi, Ting-Yu Fang, Richard Tzong-Han Tsai

Automatic fact verification has attracted recent research attention as the increasing dissemination of disinformation on social media platforms.

Fact Verification Retrieval

Paper
Add Code

Enhancing Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems

no code implementations • 18 Mar 2024 • Bo-Han Lu, Yi-Hsuan Lin, En-Shiun Annie Lee, Richard Tzong-Han Tsai

We employ a pre-trained LLaMA2-7B model specialized in Traditional Mandarin Chinese to leverage the orthographic similarities between Taiwanese Hokkien Han and Traditional Mandarin Chinese.

Machine Translation Translation

Paper
Add Code

SMUTF: Schema Matching Using Generative Tags and Hybrid Features

no code implementations • 22 Jan 2024 • Yu Zhang, Mei Di, Haozheng Luo, Chenwei Xu, Richard Tzong-Han Tsai

Recognizing the lack of extensive, publicly available datasets for SM, we have created and open-sourced the HDXSM dataset from the public humanitarian data.

Feature Engineering Humanitarian

Paper
Add Code

Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages

no code implementations • 7 Oct 2023 • Shih-Cheng Huang, Pin-Zu Li, Yu-Chi Hsu, Kuang-Ming Chen, Yu Tung Lin, Shih-Kai Hsiao, Richard Tzong-Han Tsai, Hung-Yi Lee

By simply adding the chat vector to a continual pre-trained model's weights, we can endow the model with chat capabilities in new languages without the need for further training.

Instruction Following

Paper
Add Code

Large Language Models on the Chessboard: A Study on ChatGPT's Formal Language Comprehension and Complex Reasoning Skills

no code implementations • 29 Aug 2023 • Mu-Tien Kuo, Chih-Chung Hsueh, Richard Tzong-Han Tsai

This paper probes the performance of ChatGPT, a sophisticated language model by OpenAI in tackling such complex reasoning tasks, using chess as a case study.

Decision Making Language Modelling

Paper
Add Code

Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien

1 code implementation • 21 Jan 2023 • Sin-En Lu, Bo-Han Lu, Chao-Yi Lu, Richard Tzong-Han Tsai

In natural language processing (NLP), code-mixing (CM) is a challenging task, especially when the mixed languages include dialects.

Language Modelling Transfer Learning +1

Paper
Code

EPG2S: Speech Generation and Speech Enhancement based on Electropalatography and Audio Signals using Multimodal Learning

no code implementations • 16 Jun 2022 • Li-Chin Chen, Po-Hsun Chen, Richard Tzong-Han Tsai, Yu Tsao

Further, the addition of noisy speech signals is observed to improve quality and intelligibility.

Speech Enhancement

Paper
Add Code

Revised JNLPBA Corpus: A Revised Version of Biomedical NER Corpus for Relation Extraction Task

no code implementations • 29 Jan 2019 • Ming-Siang Huang, Po-Ting Lai, Richard Tzong-Han Tsai, Wen-Lian Hsu

Moreover, the cross-validation test is carried out which we train the NER systems on JNLPBA/Revised JNLPBA corpora and access the performance in both protein-protein interaction extraction (PPIE) and biomedical event extraction (BEE) corpora to confirm that the newly refined Revised JNLPBA is a competent NER corpus in biomedical relation application.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

Cross-language Article Linking Using Cross-Encyclopedia Entity Embedding

no code implementations • NAACL 2018 • Chun-Kai Wu, Richard Tzong-Han Tsai

Cross-language article linking (CLAL) is the task of finding corresponding article pairs of different languages across encyclopedias.

Information Retrieval

Paper
Add Code

Question Retrieval with Distributed Representations and Participant Reputation in Community Question Answering

no code implementations • ROCLINGIJCLCLP 2017 • Sam Weng, Chun-Kai Wu, Yu-Chun Wang, Richard Tzong-Han Tsai

Community Question Answering Retrieval

Paper
Add Code

Encyclolink: A Cross-Encyclopedia,Cross-language Article-Linking System and Web-based Search Interface

no code implementations • IJCNLP 2017 • Yu-Chun Wang, Ka Ming Wong, Chun-Kai Wu, Chao-Lin Pan, Richard Tzong-Han Tsai

Cross-language article linking (CLAL) is the task of finding corresponding article pairs across encyclopedias of different languages.

Information Retrieval

Paper
Add Code

Enhancing Drug-Drug Interaction Classification with Corpus-level Feature and Classifier Ensemble

no code implementations • WS 2017 • Jing Cyun Tu, Po-Ting Lai, Richard Tzong-Han Tsai

In this paper, we presented a study that focuses on the DDI classification.

Classification Feature Engineering +3

Paper
Add Code

A Telecom-Domain Online Customer Service Assistant Based on Question Answering with Word Embedding and Intent Classification

no code implementations • IJCNLP 2017 • Jui-Yang Wang, Min-Feng Kuo, Jen-Chieh Han, Chao-Chuang Shih, Chun-Hsun Chen, Po-Ching Lee, Richard Tzong-Han Tsai

In the paper, we propose an information retrieval based (IR-based) Question Answering (QA) system to assist online customer service staffs respond users in the telecom domain.

General Classification Information Retrieval +4

Paper
Add Code

Textual Analysis for Studying Chinese Historical Documents and Literary Novels

no code implementations • 11 Oct 2015 • Chao-Lin Liu, Guan-Tao Jin, Hongsu Wang, Qing-Feng Liu, Wen-Huei Cheng, Wei-Yun Chiu, Richard Tzong-Han Tsai, Yu-Chun Wang

To showcase the potentials and challenges of computer-assisted analysis of Chinese literatures, we explored some interesting yet non-trivial questions about two of the Four Great Classical Novels of China: (1) Which monsters attempted to consume the Buddhist monk Xuanzang in the Journey to the West (JTTW), which was published in the 16th century, (2) Which was the most powerful monster in JTTW, and (3) Which major role smiled the most in the Dream of the Red Chamber, which was published in the 18th century.