Search Results for author: Arash Eshghi

Found 32 papers, 9 papers with code

Demonstrating EMMA: Embodied MultiModal Agent for Language-guided Action Execution in 3D Simulated Environments

no code implementations • SIGDIAL (ACL) 2022 • Alessandro Suglia, Bhathiya Hemanthage, Malvina Nikandrou, George Pantazopoulos, Amit Parekh, Arash Eshghi, Claudio Greco, Ioannis Konstas, Oliver Lemon, Verena Rieser

We demonstrate EMMA, an embodied multimodal agent which has been developed for the Alexa Prize SimBot challenge.

Conditional Text Generation

Paper
Add Code

Dialogue Act and Slot Recognition in Italian Complex Dialogues

no code implementations • EURALI (LREC) 2022 • Irene Sucameli, Michele De Quattro, Arash Eshghi, Alessandro Suglia, Maria Simi

Since the advent of Transformer-based, pretrained language models (LM) such as BERT, Natural Language Understanding (NLU) components in the form of Dialogue Act Recognition (DAR) and Slot Recognition (SR) for dialogue systems have become both more accurate and easier to create for specific application domains.

Natural Language Understanding

Paper
Add Code

Combine to Describe: Evaluating Compositional Generalization in Image Captioning

no code implementations • ACL 2022 • George Pantazopoulos, Alessandro Suglia, Arash Eshghi

Compositionality – the ability to combine simpler concepts to understand & generate arbitrarily more complex conceptual structures – has long been thought to be the cornerstone of human language capacity.

Image Captioning

Paper
Add Code

Incremental Graph-Based Semantics and Reasoning for Conversational AI

no code implementations • ReInAct 2021 • Angus Addlesee, Arash Eshghi

The next generation of conversational AI systems need to: (1) process language incrementally, token-by-token to be more responsive and enable handling of conversational phenomena such as pauses, restarts and self-corrections; (2) reason incrementally allowing meaning to be established beyond what is said; (3) be transparent and controllable, allowing designers as well as the system itself to easily establish reasons for particular behaviour and tailor to particular user groups, or domains.

Paper
Add Code

Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers

1 code implementation • 21 Apr 2024 • Georgios Pantazopoulos, Alessandro Suglia, Oliver Lemon, Arash Eshghi

In this paper, we use \textit{diagnostic classifiers} to measure the extent to which the visual prompt produced by the resampler encodes spatial information.

Paper
Code

Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

no code implementations • 7 Nov 2023 • Georgios Pantazopoulos, Malvina Nikandrou, Amit Parekh, Bhathiya Hemanthage, Arash Eshghi, Ioannis Konstas, Verena Rieser, Oliver Lemon, Alessandro Suglia

Interactive and embodied tasks pose at least two fundamental challenges to existing Vision & Language (VL) models, including 1) grounding language in trajectories of actions and observations, and 2) referential disambiguation.

Text Generation

Paper
Add Code

Learning to generate and corr- uh I mean repair language in real-time

1 code implementation • 22 Aug 2023 • Arash Eshghi, Arash Ashrafzadeh

We further do a zero-shot evaluation of the ability of the same model to generate self-repairs when the generation goal changes mid-utterance.

Paper
Code

No that's not what I meant: Handling Third Position Repair in Conversational Question Answering

1 code implementation • 31 Jul 2023 • Vevake Balaraman, Arash Eshghi, Ioannis Konstas, Ioannis Papaioannou

We demonstrate the usefulness of the data by training and evaluating strong baseline models for executing TPRs.

Conversational Question Answering Position

Paper
Code

'What are you referring to?' Evaluating the Ability of Multi-Modal Dialogue Models to Process Clarificational Exchanges

1 code implementation • 28 Jul 2023 • Javier Chiyah-Garcia, Alessandro Suglia, Arash Eshghi, Helen Hastie

Referential ambiguities arise in dialogue when a referring expression does not uniquely identify the intended referent for the addressee.

Referring Expression

Paper
Code

The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in Open-domain Conversational Question Answering

no code implementations • 25 May 2023 • Sabrina Chiesurin, Dimitris Dimakopoulos, Marco Antonio Sobrevilla Cabezudo, Arash Eshghi, Ioannis Papaioannou, Verena Rieser, Ioannis Konstas

Large language models are known to produce output which sounds fluent and convincing, but is also often wrong, e. g. "unfaithful" with respect to a rationale as retrieved from a knowledge base.

Conversational Question Answering Open-Domain Question Answering

Paper
Add Code

Exploring Multi-Modal Representations for Ambiguity Detection & Coreference Resolution in the SIMMC 2.0 Challenge

2 code implementations • 25 Feb 2022 • Javier Chiyah-Garcia, Alessandro Suglia, José Lopes, Arash Eshghi, Helen Hastie

Anaphoric expressions, such as pronouns and referential descriptions, are situated with respect to the linguistic context of prior turns, as well as, the immediate visual environment.

coreference-resolution

Paper
Code

A Study of Automatic Metrics for the Evaluation of Natural Language Explanations

1 code implementation • EACL 2021 • Miruna Clinciu, Arash Eshghi, Helen Hastie

As transparency becomes key for robotics and AI, it will be necessary to evaluate the methods through which transparency is provided, including automatically generated natural language (NL) explanations.

nlg evaluation Text Generation

Paper
Code

A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AI

2 code implementations • COLING 2020 • Angus Addlesee, Yanchao Yu, Arash Eshghi

Automatic Speech Recognition (ASR) systems are increasingly powerful and more accurate, but also more numerous with several options existing currently as a service (e. g. Google, IBM, and Microsoft).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Data-Efficient Goal-Oriented Conversation with Dialogue Knowledge Transfer Networks

no code implementations • IJCNLP 2019 • Igor Shalyminov, Sungjin Lee, Arash Eshghi, Oliver Lemon

Our main dataset is the Stanford Multi-Domain dialogue corpus.

Dialogue Generation Few-Shot Learning +2

Paper
Add Code

Current Challenges in Spoken Dialogue Systems and Why They Are Critical for Those Living with Dementia

no code implementations • 14 Sep 2019 • Angus Addlesee, Arash Eshghi, Ioannis Konstas

Dialogue technologies such as Amazon's Alexa have the potential to transform the healthcare industry.

speech-recognition Speech Recognition +1

Paper
Add Code

Few-Shot Dialogue Generation Without Annotated Data: A Transfer Learning Approach

no code implementations • WS 2019 • Igor Shalyminov, Sungjin Lee, Arash Eshghi, Oliver Lemon

Learning with minimal data is one of the key challenges in the development of practical, production-ready goal-oriented dialogue systems.

Dialogue Generation Goal-Oriented Dialogue Systems +2

Paper
Add Code

Benchmarking Natural Language Understanding Services for building Conversational Agents

8 code implementations • 13 Mar 2019 • Xingkun Liu, Arash Eshghi, Pawel Swietojanski, Verena Rieser

We have recently seen the emergence of several publicly available Natural Language Understanding (NLU) toolkits, which map user utterances to structured, but more abstract, Dialogue Act (DA) or Intent specifications, while making this process accessible to the lay developer.

Benchmarking General Classification +3

17,946

Paper
Code

Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems

no code implementations • 8 Oct 2018 • Igor Shalyminov, Arash Eshghi, Oliver Lemon

To test the model's generalisation potential, we evaluate the same model on the bAbI+ dataset, without any additional training.

Multi-Task Learning

Paper
Add Code

Learning how to learn: an adaptive dialogue agent for incrementally learning visually grounded word meanings

no code implementations • WS 2017 • Yanchao Yu, Arash Eshghi, Oliver Lemon

We present an optimised multi-modal dialogue agent for interactive learning of visually grounded word meanings from a human tutor, trained on real human-human tutoring data.

Reinforcement Learning (RL)

Paper
Add Code

The BURCHAK corpus: a Challenge Data Set for Interactive Learning of Visually Grounded Word Meanings

no code implementations • WS 2017 • Yanchao Yu, Arash Eshghi, Gregory Mills, Oliver Joseph Lemon

We motivate and describe a new freely available human-human dialogue dataset for interactive learning of visually grounded word meanings through ostensive definition by a tutor to a learner.

Attribute Sentence

Paper
Add Code

Training an adaptive dialogue policy for interactive learning of visually grounded word meanings

no code implementations • WS 2016 • Yanchao Yu, Arash Eshghi, Oliver Lemon

We present a multi-modal dialogue system for interactive learning of perceptually grounded word meanings from a human tutor.

Semantic Parsing

Paper
Add Code

Bootstrapping incremental dialogue systems from minimal data: the generalisation power of dialogue grammars

no code implementations • EMNLP 2017 • Arash Eshghi, Igor Shalyminov, Oliver Lemon

Our experiments show that our model can process 74% of the Facebook AI bAbI dataset even when trained on only 0. 13% of the data (5 dialogues).

Dialogue Management Management +3

Paper
Add Code

Challenging Neural Dialogue Models with Natural Data: Memory Networks Fail on Incremental Phenomena

1 code implementation • 22 Sep 2017 • Igor Shalyminov, Arash Eshghi, Oliver Lemon

Results show that the semantic accuracy of the MemN2N model drops drastically; and that although it is in principle able to learn to process the constructions in bAbI+, it needs an impractical amount of training data to do so.

Retrieval Sentence

Paper
Code

VOILA: An Optimised Dialogue System for Interactively Learning Visually-Grounded Word Meanings (Demonstration System)

no code implementations • WS 2017 • Yanchao Yu, Arash Eshghi, Oliver Lemon

We present VOILA: an optimised, multi-modal dialogue agent for interactive learning of visually grounded word meanings from a human user.

Active Learning

Paper
Add Code

Feedback relevance spaces: The organisation of increments in conversation

no code implementations • WS 2017 • Christine Howes, Arash Eshghi

Paper
Add Code

Bootstrapping incremental dialogue systems: using linguistic knowledge to learn from minimal data

no code implementations • 1 Dec 2016 • Dimitrios Kalatzis, Arash Eshghi, Oliver Lemon

We present a method for inducing new dialogue systems from very small amounts of unannotated dialogue data, showing how word-level exploration using Reinforcement Learning (RL), combined with an incremental and semantic grammar - Dynamic Syntax (DS) - allows systems to discover, generate, and understand many new dialogue variants.

Dialogue Management Management +2