no code implementations • 27 Aug 2019 • Heriberto Cuayáhuitl, Donghyeon Lee, Seonghan Ryu, Sungja Choi, Inchul Hwang, Jihie Kim
Training chatbots using the reinforcement learning paradigm is challenging due to high-dimensional states, infinite action spaces and the difficulty in specifying the reward function.
no code implementations • 27 Aug 2019 • Heriberto Cuayáhuitl, Donghyeon Lee, Seonghan Ryu, Yongjin Cho, Sungja Choi, Satish Indurthi, Seunghak Yu, Hyungtak Choi, Inchul Hwang, Jihie Kim
Experimental results using chitchat data reveal that (1) near human-like dialogue policies can be induced, (2) generalisation to unseen data is a difficult problem, and (3) training an ensemble of chatbot agents is essential for improved performance over using a single agent.
no code implementations • 2 Dec 2018 • Heriberto Cuayáhuitl, Seonghan Ryu, Donghyeon Lee, Jihie Kim
The amount of dialogue history to include in a conversational agent is often underestimated and/or set in an empirical and thus possibly naive way.
no code implementations • EMNLP 2018 • Seonghan Ryu, Sangjun Koo, Hwanjo Yu, Gary Geunbae Lee
The main goal of this paper is to develop out-of-domain (OOD) detection for dialog systems.
Generative Adversarial Network Out of Distribution (OOD) Detection +1
2 code implementations • 27 Jul 2018 • Seonghan Ryu, Seokhwan Kim, Junhwi Choi, Hwanjo Yu, Gary Geunbae Lee
Then we used domain-category analysis as an auxiliary task to train neural sentence embedding for OOD sentence detection.