Spoken Dialogue Systems

19 papers with code • 0 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Spoken Dialogue Systems models and implementations
4 papers
204

Most implemented papers

Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

andy194673/nlg-scvae WS 2018

Cross-domain natural language generation (NLG) is still a difficult task within spoken dialogue modelling.

A dataset for resolving referring expressions in spoken dialogue via contextual query rewrites (CQR)

alexa/alexa-dataset-contextual-query-rewrite 28 Mar 2019

In this paper, we describe our methodology for creating the query reformulation extension to the dialog corpus, and present an initial set of experiments to establish a baseline for the CQR task.

Hierarchical Multi-Task Natural Language Understanding for Cross-domain Conversational AI: HERMIT NLU

RasaHQ/rasa WS 2019

We present a new neural architecture for wide-coverage Natural Language Understanding in Spoken Dialogue Systems.

Modeling ASR Ambiguity for Dialogue State Tracking Using Word Confusion Networks

kolk/MODELING-ASR-AMBIGUITY-FOR-NEURAL-DIALOGUE-STATE-TRACKING-USING-WORD-CONFUSION-NETWORKS 3 Feb 2020

Spoken dialogue systems typically use a list of top-N ASR hypotheses for inferring the semantic meaning and tracking the state of the dialogue.

"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations

alexa/alexa-with-dstc10-track2-dataset 28 Sep 2021

Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets.

EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification

PolyAI-LDN/evi-paper Findings (NAACL) 2022

Knowledge-based authentication is crucial for task-oriented spoken dialogue systems that offer personalised and privacy-focused services.

When can I Speak? Predicting initiation points for spoken dialogue agents

siyan-sylvia-li/icarus_final SIGDIAL (ACL) 2022

Current spoken dialogue systems initiate their turns after a long period of silence (700-1000ms), which leads to little real-time feedback, sluggish responses, and an overall stilted conversational flow.

OLISIA: a Cascade System for Spoken Dialogue State Tracking

orange-opensource/olisia-dstc11 20 Apr 2023

Though Dialogue State Tracking (DST) is a core component of spoken dialogue systems, recent work on this task mostly deals with chat corpora, disregarding the discrepancies between spoken and written language. In this paper, we propose OLISIA, a cascade system which integrates an Automatic Speech Recognition (ASR) model and a DST model.

Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model

XinyuZhou2000/Spoken_Dialogue 20 Sep 2023

This paper explores the potential of constructing an AI spoken dialogue system that "thinks how to respond" and "thinks how to speak" simultaneously, which more closely aligns with the human speech production process compared to the current cascade pipeline of independent chatbot and Text-to-Speech (TTS) modules.