1 code implementation • COLING 2022 • Baoxin Wang, Xingyi Duan, Dayong Wu, Wanxiang Che, Zhigang Chen, Guoping Hu
The Chinese text correction (CTC) focuses on detecting and correcting Chinese spelling errors and grammatical errors.
1 code implementation • 16 Aug 2024 • Xuanliang Zhang, Dingzirui Wang, Longxu Dou, Baoxin Wang, Dayong Wu, Qingfu Zhu, Wanxiang Che
Most existing methods employ a fixed tabular format to represent the table, which could limit the performance.
no code implementations • 13 Aug 2024 • Dayong Wu, Jiaqi Li, Baoxin Wang, Honghong Zhao, Siyuan Xue, Yanjie Yang, Zhijun Chang, Rui Zhang, Li Qian, Bo wang, Shijin Wang, Zhixiong Zhang, Guoping Hu
Large language models (LLMs) have shown remarkable achievements across various language tasks. To enhance the performance of LLMs in scientific literature services, we developed the scientific literature LLM (SciLit-LLM) through pre-training and supervised fine-tuning on scientific literature, building upon the iFLYTEK Spark LLM.
1 code implementation • 25 Jun 2024 • YiXuan Wang, Baoxin Wang, Yijun Liu, Qingfu Zhu, Dayong Wu, Wanxiang Che
Nowadays, data augmentation through synthetic data has been widely used in the field of Grammatical Error Correction (GEC) to alleviate the problem of data scarcity.
1 code implementation • 26 Mar 2024 • YiXuan Wang, Baoxin Wang, Yijun Liu, Dayong Wu, Wanxiang Che
In this light, we propose the LM-Combiner, a rewriting model that can directly modify the over-correction of GEC system outputs without a model ensemble.
no code implementations • 9 May 2023 • Bo Sun, Baoxin Wang, YiXuan Wang, Wanxiang Che, Dayong Wu, Shijin Wang, Ting Liu
Our experiments show that powerful pre-trained models perform poorly on this corpus.
1 code implementation • 11 Aug 2022 • Honghong Zhao, Baoxin Wang, Dayong Wu, Wanxiang Che, Zhigang Chen, Shijin Wang
In this paper, we present an overview of the CTC 2021, a Chinese text correction task for native speakers.
no code implementations • 15 Apr 2022 • Bo Sun, Baoxin Wang, Wanxiang Che, Dayong Wu, Zhigang Chen, Ting Liu
These errors have been studied extensively and are relatively simple for humans.
no code implementations • COLING 2022 • Ziqing Yang, Zihang Xu, Yiming Cui, Baoxin Wang, Min Lin, Dayong Wu, Zhigang Chen
It covers Standard Chinese, Yue Chinese, and six other ethnic minority languages.
no code implementations • 10 Feb 2022 • Baoxin Wang, Qingye Meng, Ziyue Wang, Honghong Zhao, Dayong Wu, Wanxiang Che, Shijin Wang, Zhigang Chen, Cong Liu
Knowledge graph embedding (KGE) models learn the representation of entities and relations in knowledge graphs.
Ranked #4 on Link Property Prediction on ogbl-wikikg2
no code implementations • 1 Oct 2020 • Shaolei Wang, Baoxin Wang, Jiefu Gong, Zhongyuan Wang, Xiao Hu, Xingyi Duan, Zizhuo Shen, Gang Yue, Ruiji Fu, Dayong Wu, Wanxiang Che, Shijin Wang, Guoping Hu, Ting Liu
Grammatical error diagnosis is an important task in natural language processing.
no code implementations • 19 Dec 2019 • Xingyi Duan, Baoxin Wang, Ziyue Wang, Wentao Ma, Yiming Cui, Dayong Wu, Shijin Wang, Ting Liu, Tianxiang Huo, Zhen Hu, Heng Wang, Zhiyuan Liu
We present a Chinese judicial reading comprehension (CJRC) dataset which contains approximately 10K documents and almost 50K questions with answers.
no code implementations • IJCNLP 2019 • Ziyue Wang, Baoxin Wang, Xingyi Duan, Dayong Wu, Shijin Wang, Guoping Hu, Ting Liu
To our knowledge, IFlyLegal is the first Chinese legal system that employs up-to-date NLP techniques and caters for needs of different user groups, such as lawyers, judges, procurators, and clients.
no code implementations • ACL 2018 • Baoxin Wang
Recurrent neural network (RNN) has achieved remarkable performance in text categorization.
Ranked #2 on Text Classification on Yahoo! Answers