Search Results for author: Richard Tzong-Han Tsai

Found 24 papers, 1 papers with code

BRCC and SentiBahasaRojak: The First Bahasa Rojak Corpus for Pretraining and Sentiment Analysis Dataset

no code implementations COLING 2022 Nanda Putri Romadhona, Sin-En Lu, Bo-Han Lu, Richard Tzong-Han Tsai

Finally, to test the effectiveness of the Mixed XLM model pre-trained on BRCC for social media scenarios where code-mixing is found frequently, we compile a new Bahasa Rojak sentiment analysis dataset, SentiBahasaRojak, with a Kappa value of 0. 77.

Data Augmentation Sentiment Analysis +1

Verdict Inference with Claim and Retrieved Elements Using RoBERTa

no code implementations EMNLP (FEVER) 2021 In-Zu Gi, Ting-Yu Fang, Richard Tzong-Han Tsai

Automatic fact verification has attracted recent research attention as the increasing dissemination of disinformation on social media platforms.

Fact Verification Retrieval

Enhancing Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems

no code implementations18 Mar 2024 Bo-Han Lu, Yi-Hsuan Lin, En-Shiun Annie Lee, Richard Tzong-Han Tsai

We employ a pre-trained LLaMA2-7B model specialized in Traditional Mandarin Chinese to leverage the orthographic similarities between Taiwanese Hokkien Han and Traditional Mandarin Chinese.

Machine Translation Translation

SMUTF: Schema Matching Using Generative Tags and Hybrid Features

no code implementations22 Jan 2024 Yu Zhang, Mei Di, Haozheng Luo, Chenwei Xu, Richard Tzong-Han Tsai

Recognizing the lack of extensive, publicly available datasets for SM, we have created and open-sourced the HDXSM dataset from the public humanitarian data.

Feature Engineering Humanitarian

Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages

no code implementations7 Oct 2023 Shih-Cheng Huang, Pin-Zu Li, Yu-Chi Hsu, Kuang-Ming Chen, Yu Tung Lin, Shih-Kai Hsiao, Richard Tzong-Han Tsai, Hung-Yi Lee

By simply adding the chat vector to a continual pre-trained model's weights, we can endow the model with chat capabilities in new languages without the need for further training.

Instruction Following

Large Language Models on the Chessboard: A Study on ChatGPT's Formal Language Comprehension and Complex Reasoning Skills

no code implementations29 Aug 2023 Mu-Tien Kuo, Chih-Chung Hsueh, Richard Tzong-Han Tsai

This paper probes the performance of ChatGPT, a sophisticated language model by OpenAI in tackling such complex reasoning tasks, using chess as a case study.

Decision Making Language Modelling

Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien

1 code implementation21 Jan 2023 Sin-En Lu, Bo-Han Lu, Chao-Yi Lu, Richard Tzong-Han Tsai

In natural language processing (NLP), code-mixing (CM) is a challenging task, especially when the mixed languages include dialects.

Language Modelling Transfer Learning +1

Revised JNLPBA Corpus: A Revised Version of Biomedical NER Corpus for Relation Extraction Task

no code implementations29 Jan 2019 Ming-Siang Huang, Po-Ting Lai, Richard Tzong-Han Tsai, Wen-Lian Hsu

Moreover, the cross-validation test is carried out which we train the NER systems on JNLPBA/Revised JNLPBA corpora and access the performance in both protein-protein interaction extraction (PPIE) and biomedical event extraction (BEE) corpora to confirm that the newly refined Revised JNLPBA is a competent NER corpus in biomedical relation application.

named-entity-recognition Named Entity Recognition +3

Cross-language Article Linking Using Cross-Encyclopedia Entity Embedding

no code implementations NAACL 2018 Chun-Kai Wu, Richard Tzong-Han Tsai

Cross-language article linking (CLAL) is the task of finding corresponding article pairs of different languages across encyclopedias.

Information Retrieval

Textual Analysis for Studying Chinese Historical Documents and Literary Novels

no code implementations11 Oct 2015 Chao-Lin Liu, Guan-Tao Jin, Hongsu Wang, Qing-Feng Liu, Wen-Huei Cheng, Wei-Yun Chiu, Richard Tzong-Han Tsai, Yu-Chun Wang

To showcase the potentials and challenges of computer-assisted analysis of Chinese literatures, we explored some interesting yet non-trivial questions about two of the Four Great Classical Novels of China: (1) Which monsters attempted to consume the Buddhist monk Xuanzang in the Journey to the West (JTTW), which was published in the 16th century, (2) Which was the most powerful monster in JTTW, and (3) Which major role smiled the most in the Dream of the Red Chamber, which was published in the 18th century.

Cannot find the paper you are looking for? You can Submit a new open access paper.