Search Results for author: Patrick Lehnen

Found 4 papers, 0 papers with code

Feedback Attribution for Counterfactual Bandit Learning in Multi-Domain Spoken Language Understanding

no code implementations EMNLP 2021 Tobias Falke, Patrick Lehnen

With counterfactual bandit learning, models can be trained based on positive and negative feedback received for historical predictions, with no labeled data needed.

Multi-agent Reinforcement Learning reinforcement-learning +1

Leveraging User Paraphrasing Behavior In Dialog Systems To Automatically Collect Annotations For Long-Tail Utterances

no code implementations COLING 2020 Tobias Falke, Markus Boese, Daniil Sorokin, Caglar Tirkaz, Patrick Lehnen

In large-scale commercial dialog systems, users express the same request in a wide variety of alternative ways with a long tail of less frequent alternatives.

Cannot find the paper you are looking for? You can Submit a new open access paper.