Search Results for author: Shu-Kai Hsieh

Found 43 papers, 2 papers with code

ClassifierGuesser: A Context-based Classifier Prediction System for Chinese Language Learners

1 code implementation IJCNLP 2017 Nicole Peinelt, Maria Liakata, Shu-Kai Hsieh

Classifiers are function words that are used to express quantities in Chinese and are especially difficult for language learners.

BIG-bench Machine Learning

CogALex-V Shared Task: LOPE

no code implementations WS 2016 Kanan Luce, Jiaxing Yu, Shu-Kai Hsieh

Automatic discovery of semantically-related words is one of the most important NLP tasks, and has great impact on the theoretical psycholinguistic modeling of the mental lexicon.

Word Embeddings

Why Chinese Web-as-Corpus is Wacky? Or: How Big Data is Killing Chinese Corpus Linguistics

no code implementations LREC 2014 Shu-Kai Hsieh

This paper aims to examine and evaluate the current development of using Web-as-Corpus (WaC) paradigm in Chinese corpus linguistics.

Chinese Word Segmentation

A realistic and robust model for Chinese word segmentation

no code implementations21 May 2019 Chu-Ren Huang, Ting-Shuo Yo, Petr Simon, Shu-Kai Hsieh

Both experiments support the claim that the WBD model is a realistic model for Chinese word segmentation as it can be easily adapted for new variants with the robust result.

Chinese Word Segmentation Segmentation

Eigencharacter: An Embedding of Chinese Character Orthography

no code implementations WS 2019 Yu-Hsiang Tseng, Shu-Kai Hsieh

Chinese characters are unique in its logographic nature, which inherently encodes world knowledge through thousands of years evolution.

World Knowledge

Do You Believe It Happened? Assessing Chinese Readers' Veridicality Judgments

no code implementations LREC 2020 Yu-Yun Chang, Shu-Kai Hsieh

A corpus annotated with Chinese readers{'} veridicality judgments is released as the Chinese PragBank for further analysis.

What confuses BERT? Linguistic Evaluation of Sentiment Analysis on Telecom Customer Opinion

no code implementations ROCLING 2021 Cing-Fang Shih, Yu-Hsiang Tseng, Ching-Wen Yang, Pin-Er Chen, Hsin-Yu Chou, Lian-Hui Tan, Tzu-Ju Lin, Chun-Wei Wang, Shu-Kai Hsieh

To investigate the factors underlying the correctness of the model’s predictions, we conduct a series of analyses, including qualitative error analysis and quantitative analysis of linguistic features with logistic regressions.

Sentence Sentiment Analysis

Keyword-centered Collocating Topic Analysis

no code implementations ROCLING 2021 Yu-Lin Chang, Yongfu Liao, Po-Ya Angela Wang, Mao-Chang Ku, Shu-Kai Hsieh

The rapid flow of information and the abundance of text data on the Internet have brought about the urgent demand for the construction of monitoring resources and techniques used for various purposes.

Word Embeddings

Augmenting Chinese WordNet semantic relations with contextualized embeddings

no code implementations GWC 2019 Yu-Hsiang Tseng, Shu-Kai Hsieh

Constructing semantic relations in WordNet has been a labour-intensive task, especially in a dynamic and fast-changing language environment.

Sinitic Wordnet: Laying the Groundwork with Chinese Varieties Written in Traditional Characters

no code implementations GWC 2018 Chih-Yao Lee, Shu-Kai Hsieh

The present work seeks to make the logographic nature of Chinese script a relevant research ground in wordnet studies.

CxLM: A Construction and Context-aware Language Model

no code implementations LREC 2022 Yu-Hsiang Tseng, Cing-Fang Shih, Pin-Er Chen, Hsin-Yu Chou, Mao-Chang Ku, Shu-Kai Hsieh

Next, an experiment is conducted on the dataset to examine to what extent a pretrained masked language model is aware of the constructions.

Language Modelling

Character Jacobian: Modeling Chinese Character Meanings with Deep Learning Model

no code implementations COLING 2022 Yu-Hsiang Tseng, Shu-Kai Hsieh

The Notch model first learns the non-linear relations between the constituents and words, and the character Jacobians further describes the character’s role in each word.

Analyzing discourse functions with acoustic features and phone embeddings: non-lexical items in Taiwan Mandarin

no code implementations ROCLING 2022 Pin-Er Chen, Yu-Hsiang Tseng, Chi-Wei Wang, Fang-Chi Yeh, Shu-Kai Hsieh

In this paper, we investigate the discourse functions of non-lexical items through their acoustic properties and the phone embeddings extracted from a deep learning model.

Exploring Affordance and Situated Meaning in Image Captions: A Multimodal Analysis

no code implementations24 May 2023 Pin-Er Chen, Po-Ya Angela Wang, Hsin-Yu Chou, Yu-Hsiang Tseng, Shu-Kai Hsieh

This paper explores the grounding issue regarding multimodal semantic representation from a computational cognitive-linguistic view.

Image Captioning Natural Language Understanding

Vec2Gloss: definition modeling leveraging contextualized vectors with Wordnet gloss

no code implementations29 May 2023 Yu-Hsiang Tseng, Mao-Chang Ku, Wei-Ling Chen, Yu-Lin Chang, Shu-Kai Hsieh

We propose a `Vec2Gloss' model, which produces the gloss from the target word's contextualized embeddings.

Lexical Retrieval Hypothesis in Multimodal Context

no code implementations28 May 2023 Po-Ya Angela Wang, Pin-Er Chen, Hsin-Yu Chou, Yu-Hsiang Tseng, Shu-Kai Hsieh

This study highlights the potential of the MultiMoco Corpus to provide an important resource for in-depth analysis and further research in multimodal communication studies.

Retrieval

Resolving Regular Polysemy in Named Entities

no code implementations18 Jan 2024 Shu-Kai Hsieh, Yu-Hsiang Tseng, Hsin-Yu Chou, Ching-Wen Yang, Yu-Yun Chang

Word sense disambiguation primarily addresses the lexical ambiguity of common words based on a predefined sense inventory.

Word Sense Disambiguation

Cannot find the paper you are looking for? You can Submit a new open access paper.