no code implementations • 17 Nov 2023 • Ruohong Zhang, Luyu Gao, Chen Zheng, Zhen Fan, Guokun Lai, Zheng Zhang, Fangzhou Ai, Yiming Yang, Hongxia Yang
This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources, and the adaptive training of a chatbot with domain-specific inquiries.
no code implementations • 18 Sep 2020 • Guokun Lai, Zihang Dai, Yiming Yang
In contrast, there is a large-scale of parallel corpus created by humans on the Internet.
3 code implementations • NeurIPS 2020 • Zihang Dai, Guokun Lai, Yiming Yang, Quoc V. Le
With the success of language pretraining, it is highly desirable to develop more efficient architectures of good scalability that can exploit the abundant unlabeled data at a lower cost.
Ranked #6 on Reading Comprehension on RACE
1 code implementation • 24 Apr 2020 • Ruohong Zhang, Yu Hao, Donghan Yu, Wei-Cheng Chang, Guokun Lai, Yiming Yang
Keywords: Multivariate Time Series, Change-point Detection, Graph Neural Networks
1 code implementation • 16 Sep 2019 • Guokun Lai, Barlas Oguz, Yiming Yang, Veselin Stoyanov
We consider the setting of semi-supervised cross-lingual understanding, where labeled data is available in a source language (English), but only unlabeled data is available in the target language.
1 code implementation • NeurIPS 2019 • Zihang Dai, Guokun Lai, Yiming Yang, Shinjae Yoo
With latent variables, stochastic recurrent models have achieved state-of-the-art performance in modeling sound-wave sequence.
1 code implementation • 15 Jun 2018 • Guokun Lai, Bohan Li, Guoqing Zheng, Yiming Yang
In this paper, we combine the ideas from both stochastic latent variables and dilated convolutions, and propose a new architecture to model sequential data, termed as Stochastic WaveNet, where stochastic latent variables are injected into the WaveNet structure.
no code implementations • ICLR 2018 • Qizhe Xie, Guokun Lai, Zihang Dai, Eduard Hovy
Cloze test is widely adopted in language exams to evaluate students' language proficiency.
no code implementations • ICLR 2018 • Guokun Lai, Hanxiao Liu, Yiming Yang
Convolution Neural Network (CNN) has gained tremendous success in computer vision tasks with its outstanding ability to capture the local latent features.
no code implementations • EMNLP 2018 • Qizhe Xie, Guokun Lai, Zihang Dai, Eduard Hovy
Cloze tests are widely adopted in language exams to evaluate students' language proficiency.
no code implementations • 31 Oct 2017 • Guokun Lai, Hanxiao Liu, Yiming Yang
Convolution Neural Network (CNN) has gained tremendous success in computer vision tasks with its outstanding ability to capture the local latent features.
2 code implementations • EMNLP 2017 • Guokun Lai, Qizhe Xie, Hanxiao Liu, Yiming Yang, Eduard Hovy
We present RACE, a new dataset for benchmark evaluation of methods in the reading comprehension task.
19 code implementations • 21 Mar 2017 • Guokun Lai, Wei-Cheng Chang, Yiming Yang, Hanxiao Liu
Multivariate time series forecasting is an important machine learning problem across many domains, including predictions of solar plant energy output, electricity consumption, and traffic jam situation.
Ranked #1 on Univariate Time Series Forecasting on Solar-Power