Search Results for author: Hai Hu

Found 17 papers, 9 papers with code

Building a Treebank for Chinese Literature for Translation Studies

no code implementations • TLT (ACL) 2020 • Hai Hu, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Sandra Kuebler, Chien-Jer Charles Lin

Translation

Paper
Add Code

SH2: Self-Highlighted Hesitation Helps You Decode More Truthfully

1 code implementation • 11 Jan 2024 • Jushi Kai, Hai Hu, Zhouhan Lin

Therefore, we propose to ''highlight'' the factual information by selecting the tokens with the lowest probabilities and concatenating them to the original context, thus forcing the model to repeatedly read and hesitate on these tokens before generation.

Hallucination Text Generation

Paper
Code

MELA: Multilingual Evaluation of Linguistic Acceptability

no code implementations • 15 Nov 2023 • Ziyin Zhang, Yikang Liu, Weifang Huang, Junyu Mao, Rui Wang, Hai Hu

Recent benchmarks for Large Language Models (LLMs) have mostly focused on application-driven tasks such as complex reasoning and code generation, and this has led to a scarcity in purely linguistic evaluation of LLMs.

Code Generation Cross-Lingual Transfer +3

Paper
Add Code

Revisiting Acceptability Judgements

1 code implementation • 23 May 2023 • Hai Hu, Ziyin Zhang, Weifang Huang, Jackie Yan-Ki Lai, Aini Li, Yina Patterson, Jiahui Huang, Peng Zhang, Chien-Jer Charles Lin, Rui Wang

We introduce CoLAC - Corpus of Linguistic Acceptability in Chinese, the first large-scale acceptability dataset for a non-Indo-European language.

Cross-Lingual Transfer Linguistic Acceptability

Paper
Code

ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models

2 code implementations • 16 Apr 2023 • Yikang Liu, Ziyin Zhang, Wanyang Zhang, Shisen Yue, Xiaojing Zhao, Xinyuan Cheng, Yiwen Zhang, Hai Hu

To address these challenges in English language teaching, we first present ArguGPT, a balanced corpus of 4, 038 argumentative essays generated by 7 GPT models in response to essay prompts from three sources: (1) in-class or homework exercises, (2) TOEFL and (3) GRE writing tasks.

Sentence

Paper
Code

Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference

1 code implementation • Findings (ACL) 2021 • Hai Hu, He Zhou, Zuoyu Tian, Yiwen Zhang, Yina Ma, Yanting Li, Yixin Nie, Kyle Richardson

These results, however, come with important caveats: cross-lingual models often perform best when trained on a mixture of English and high-quality monolingual NLI data (OCNLI), and are often hindered by automatically translated resources (XNLI-zh).

Cross-Lingual Transfer Natural Language Inference +2

Paper
Code

OCNLI: Original Chinese Natural Language Inference

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Hai Hu, Kyle Richardson, Liang Xu, Lu Li, Sandra Kuebler, Lawrence S. Moss

In this paper, we present the first large-scale NLI dataset (consisting of ~56, 000 annotated sentence pairs) for Chinese called the Original Chinese Natural Language Inference dataset (OCNLI).

Natural Language Inference Sentence +1

135

Paper
Code

CLUE: A Chinese Language Understanding Evaluation Benchmark

3 code implementations • COLING 2020 • Liang Xu, Hai Hu, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu, Zhe Zhao, Qipeng Zhao, Cong Yue, Xinrui Zhang, Zhengliang Yang, Kyle Richardson, Zhenzhong Lan

The advent of natural language understanding (NLU) benchmarks for English, such as GLUE and SuperGLUE allows new NLU models to be evaluated across a diverse set of tasks.

General Classification Machine Reading Comprehension +4

3,823

Paper
Code

MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity

1 code implementation • SCiL 2020 • Hai Hu, Qi Chen, Kyle Richardson, Atreyee Mukherjee, Lawrence S. Moss, Sandra Kuebler

We present a new logic-based inference engine for natural language inference (NLI) called MonaLog, which is based on natural logic and the monotonicity calculus.

Data Augmentation Natural Language Inference

Paper
Code

Probing Natural Language Inference Models through Semantic Fragments

3 code implementations • 16 Sep 2019 • Kyle Richardson, Hai Hu, Lawrence S. Moss, Ashish Sabharwal

Our experiments, using a library of 8 such semantic fragments, reveal two remarkable findings: (a) State-of-the-art models, including BERT, that are pre-trained on existing NLI benchmark datasets perform poorly on these new fragments, even though the phenomena probed here are central to the NLI task.

Natural Language Inference

Paper
Code

Ensemble Methods to Distinguish Mainland and Taiwan Chinese

no code implementations • WS 2019 • Hai Hu, Wen Li, He Zhou, Zuoyu Tian, Yiwen Zhang, Liang Zou

This paper describes the IUCL system at VarDial 2019 evaluation campaign for the task of discriminating between Mainland and Taiwan variation of mandarin Chinese.

Word Embeddings

Paper
Add Code

Natural Language Inference with Monotonicity

no code implementations • WS 2019 • Hai Hu, Qi Chen, Larry Moss

This paper describes a working system which performs natural language inference using polarity-marked parse trees.

Natural Language Inference

Paper
Add Code

Polarity Computations in Flexible Categorial Grammar

1 code implementation • SEMEVAL 2018 • Hai Hu, Larry Moss

This paper shows how to take parse trees in CCG and algorithmically find the polarities of all the constituents.

Paper
Code

Detecting Syntactic Features of Translated Chinese

no code implementations • WS 2018 • Hai Hu, Wen Li, Sandra Kübler

We present a machine learning approach to distinguish texts translated to Chinese (by humans) from texts originally written in Chinese, with a focus on a wide range of syntactic features.

Translation

Paper
Add Code

Path of Vowel Raising in Chengdu Dialect of Mandarin

no code implementations • 11 Mar 2018 • Hai Hu, Yiwen Zhang

He and Rao (2013) reported a raising phenomenon of /a/ in /Xan/ (X being a consonant or a vowel) in Chengdu dialect of Mandarin, i. e. /a/ is realized as [epsilon] for young speakers but [ae] for older speakers, but they offered no acoustic analysis.

Paper
Add Code

Is China Entering WTO or shijie maoyi zuzhi--a Corpus Study of English Acronyms in Chinese Newspapers

no code implementations • 18 Nov 2017 • Hai Hu

This is one of the first studies that quantitatively examine the usage of English acronyms (e. g. WTO) in Chinese texts.

Translation

Paper
Add Code

Non-Deterministic Segmentation for Chinese Lattice Parsing

no code implementations • RANLP 2017 • Hai Hu, Daniel Dakota, S K{\"u}bler, ra

Parsing Chinese critically depends on correct word segmentation for the parser since incorrect segmentation inevitably causes incorrect parses.

Morphological Analysis Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.