no code implementations • EMNLP (IWSLT) 2019 • Mei Tu, Wei Liu, Lijie Wang, Xiao Chen, Xue Wen
We propose layer-tied self-attention for end-to-end speech translation.
no code implementations • EMNLP 2020 • Lijie Wang, Ao Zhang, Kun Wu, Ke Sun, Zhenghua Li, Hua Wu, Min Zhang, Haifeng Wang
This paper describes in detail the construction process and data statistics of DuSQL.
no code implementations • 2 Jun 2024 • Liang Zhao, Tianwen Wei, Liang Zeng, Cheng Cheng, Liu Yang, Peng Cheng, Lijie Wang, Chenxia Li, Xuejie Wu, Bo Zhu, Yimeng Gan, Rui Hu, Shuicheng Yan, Han Fang, Yahui Zhou
We introduce LongSkywork, a long-context Large Language Model (LLM) capable of processing up to 200, 000 tokens.
1 code implementation • 22 Dec 2023 • Qi Xu, Lijie Wang, Jing Wang, Lin Cheng, Song Chen, Yi Kang
In recent years, analog circuits have received extensive attention and are widely used in many emerging applications.
no code implementations • 1 Nov 2023 • Xiaoyue Wang, Xin Liu, Lijie Wang, Yaoxiang Wang, Jinsong Su, Hua Wu
Then, we pair each sample with a bias indicator representing its bias degree, and use these extended samples to train a sample generator.
1 code implementation • 30 Oct 2023 • Tianwen Wei, Liang Zhao, Lichang Zhang, Bo Zhu, Lijie Wang, Haihua Yang, Biye Li, Cheng Cheng, Weiwei Lü, Rui Hu, Chenxia Li, Liu Yang, Xilin Luo, Xuejie Wu, Lunan Liu, Wenjun Cheng, Peng Cheng, Jianhao Zhang, XiaoYu Zhang, Lei Lin, Xiaokun Wang, Yutuan Ma, Chuanhai Dong, Yanqi Sun, Yifu Chen, Yongyi Peng, Xiaojuan Liang, Shuicheng Yan, Han Fang, Yahui Zhou
In this technical report, we present Skywork-13B, a family of large language models (LLMs) trained on a corpus of over 3. 2 trillion tokens drawn from both English and Chinese texts.
1 code implementation • 25 Oct 2023 • Liu Yang, Haihua Yang, Wenjun Cheng, Lei Lin, Chenxia Li, Yifu Chen, Lunan Liu, Jianfei Pan, Tianwen Wei, Biye Li, Liang Zhao, Lijie Wang, Bo Zhu, Guoliang Li, Xuejie Wu, Xilin Luo, Rui Hu
Large language models (LLMs) have shown great potential to solve varieties of natural language processing (NLP) tasks, including mathematical reasoning.
1 code implementation • 2 Jun 2023 • Xiaoyue Wang, Lijie Wang, Xin Liu, Suhang Wu, Jinsong Su, Hua Wu
In this way, the top-layer sentence representation will be trained to ignore the common biased features encoded by the low-layer sentence representation and focus on task-relevant unbiased features.
no code implementations • 26 Aug 2022 • Saihao Huang, Lijie Wang, Zhenghua Li, Zeyang Liu, Chenhui Dou, Fukang Yan, Xinyan Xiao, Hua Wu, Min Zhang
As the first session-level Chinese dataset, CHASE contains two separate parts, i. e., 2, 003 sessions manually constructed from scratch (CHASE-C), and 3, 456 sessions translated from English SParC (CHASE-T).
no code implementations • 28 Jul 2022 • Yaozong Shen, Lijie Wang, Ying Chen, Xinyan Xiao, Jing Liu, Hua Wu
To fill in the gap, we propose a novel evaluation benchmark providing with both English and Chinese annotated data.
no code implementations • 23 May 2022 • Lijie Wang, Yaozong Shen, Shuyuan Peng, Shuai Zhang, Xinyan Xiao, Hao liu, Hongxuan Tang, Ying Chen, Hua Wu, Haifeng Wang
Based on this benchmark, we conduct experiments on three typical models with three saliency methods, and unveil their strengths and weakness in terms of interpretability.
no code implementations • 26 Apr 2022 • Kun Wu, Lijie Wang, Zhenghua Li, Xinyan Xiao
Grammar-based parsers have achieved high performance in the cross-domain text-to-SQL parsing task, but suffer from low decoding efficiency due to the much larger number of actions for grammar selection than that of tokens in SQL queries.
no code implementations • 30 Aug 2021 • Lijie Wang, Hao liu, Shuyuan Peng, Hongxuan Tang, Xinyan Xiao, Ying Chen, Hua Wu, Haifeng Wang
Therefore, in order to systematically evaluate the factors for building trustworthy systems, we propose a novel and well-annotated sentiment analysis dataset to evaluate robustness and interpretability.
1 code implementation • EMNLP 2021 • Kun Wu, Lijie Wang, Zhenghua Li, Ao Zhang, Xinyan Xiao, Hua Wu, Min Zhang, Haifeng Wang
For better distribution matching, we require that at least 80% of SQL patterns in the training data are covered by generated queries.
2 code implementations • 2 Sep 2020 • Shuai Zhang, Lijie Wang, Ke Sun, Xinyan Xiao
DDParser is extended on the graph-based biaffine parser to accommodate to the characteristics of Chinese dataset.
no code implementations • 5 Jul 2020 • Lijie Wang, Xueting Wang, Toshihiko Yamasaki
The spread of social networking services has created an increasing demand for selecting, editing, and generating impressive images.
no code implementations • WS 2020 • Yimeng Zhuang, Yuan Zhang, Lijie Wang
This paper describes the LIT Team{'}s submission to the IWSLT2020 open domain translation task, focusing primarily on Japanese-to-Chinese translation direction.