Search Results for author: Yihong Zhao

Found 2 papers, 1 papers with code

Off-policy Learning for Multiple Loggers

no code implementations23 Jul 2019 Li He, Long Xia, Wei Zeng, Zhi-Ming Ma, Yihong Zhao, Dawei Yin

To make full use of such historical data, learning policies from multiple loggers becomes necessary.

counterfactual

Explicit State Tracking with Semi-Supervision for Neural Dialogue Generation

2 code implementations31 Aug 2018 Xisen Jin, Wenqiang Lei, Zhaochun Ren, Hongshen Chen, Shangsong Liang, Yihong Zhao, Dawei Yin

However, the \emph{expensive nature of state labeling} and the \emph{weak interpretability} make the dialogue state tracking a challenging problem for both task-oriented and non-task-oriented dialogue generation: For generating responses in task-oriented dialogues, state tracking is usually learned from manually annotated corpora, where the human annotation is expensive for training; for generating responses in non-task-oriented dialogues, most of existing work neglects the explicit state tracking due to the unlimited number of dialogue states.

Dialogue Generation Dialogue State Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.