Search Results for author: Yihong Zhao

Found 2 papers, 1 papers with code

Off-policy Learning for Multiple Loggers

no code implementations23 Jul 2019 Li He, Long Xia, Wei Zeng, Zhi-Ming Ma, Yihong Zhao, Dawei Yin

To make full use of such historical data, learning policies from multiple loggers becomes necessary.

counterfactual

Explicit State Tracking with Semi-Supervision for Neural Dialogue Generation

2 code implementations31 Aug 2018 Xisen Jin, Wenqiang Lei, Zhaochun Ren, Hongshen Chen, Shangsong Liang, Yihong Zhao, Dawei Yin

However, the \emph{expensive nature of state labeling} and the \emph{weak interpretability} make the dialogue state tracking a challenging problem for both task-oriented and non-task-oriented dialogue generation: For generating responses in task-oriented dialogues, state tracking is usually learned from manually annotated corpora, where the human annotation is expensive for training; for generating responses in non-task-oriented dialogues, most of existing work neglects the explicit state tracking due to the unlimited number of dialogue states.

Decoder Dialogue Generation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.