no code implementations • LTEDI (ACL) 2022 • Xiaotian Lin, Yingwen Fu, Ziyu Yang, Nankai Lin, Shengyi Jiang
In this paper, we report the solution of the team BERT 4EVER for the LT-EDI-2022 shared task2: Homophobia/Transphobia Detection in social media comments in ACL 2022, which aims to classify Youtube comments into one of the following categories: no, moderate, or severe depression.
no code implementations • LT4HALA (LREC) 2022 • Hailin Zhang, Ziyu Yang, Yingwen Fu, Ruoyao Ding
In addition, we perform a series of training strategies based on the provided ancient Chinese pre-trained model to enhance the model performance.
Chinese Word Segmentation Cultural Vocal Bursts Intensity Prediction +5
no code implementations • Findings (ACL) 2022 • Yingwen Fu, Wenjie Ou, Zhou Yu, Yue Lin
Unsupervised constrained text generation aims to generate text under a given set of constraints without any supervised data.
no code implementations • 2 Feb 2023 • Xiaotian Lin, Nankai Lin, Yingwen Fu, Ziyu Yang, Shengyi Jiang
In this paper, we propose a novel self-training selection framework with two selectors to select the high-quality samples from data augmentation.
no code implementations • 19 Dec 2022 • Yingwen Fu, Wenjie Ou, Zhou Yu, Yue Lin
Conversational text-to-SQL is designed to translate multi-turn natural language questions into their corresponding SQL queries.
no code implementations • 6 Apr 2022 • Yingwen Fu, Jinyi Chen, Nankai Lin, Xixuan Huang, Xinying Qiu, Shengyi Jiang
The Yunshan Cup 2020 track focused on creating a framework for evaluating different methods of part-of-speech (POS).
no code implementations • 2 Apr 2022 • Yingwen Fu, Nankai Lin, Ziyu Yang, Shengyi Jiang
In this paper, we describe our novel dual-contrastive framework ConCNER for cross-lingual NER under the scenario of limited source-language labeled data.
1 code implementation • 2 Apr 2022 • Nankai Lin, Yingwen Fu, Xiaotian Lin, Aimin Yang, Shengyi Jiang
In the distillation XABSA task, we further explore the comparative effectiveness of different data (source dataset, translated dataset, and code-switched dataset).
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2
no code implementations • LREC 2022 • Nankai Lin, Yingwen Fu, Chuwei Chen, Ziyu Yang, Shengyi Jiang
In this work, we construct a text classification dataset to alleviate the resource-scare situation of the Lao language.
no code implementations • 3 Sep 2021 • Yingwen Fu, Nankai Lin, Zhihe Yang, Shengyi Jiang
In this work, we propose a dataset construction framework, which is based on labeled datasets of homologous languages and iterative optimization, to build a Malay NER dataset (MYNER) comprising 28, 991 sentences (over 384 thousand tokens).