Search Results for author: Anthony Sicilia

Found 20 papers, 10 papers with code

Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems

no code implementations28 Jan 2025 Mert İnan, Anthony Sicilia, Suvodip Dey, Vardhan Dongre, Tejas Srinivasan, Jesse Thomason, Gökhan Tür, Dilek Hakkani-Tür, Malihe Alikhani

While theories of discourse and cognitive science have long recognized the value of unhurried pacing, recent dialogue research tends to minimize friction in conversational systems.

Decision Making Friction

Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI

no code implementations10 Jan 2025 Yuya Asano, Sabit Hassan, Paras Sharma, Anthony Sicilia, Katherine Atwell, Diane Litman, Malihe Alikhani

Evaluated in home improvement and cooking domains with real-world users, our method improves recall and F1 of correction by 34% and 16%, respectively, while maintaining precision and false positive rate.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

An Active Learning Framework for Inclusive Generation by Large Language Models

no code implementations17 Oct 2024 Sabit Hassan, Anthony Sicilia, Malihe Alikhani

We address this challenge with a novel clustering-based active learning framework, enhanced with knowledge distillation.

Active Learning Clustering +3

Accounting for Sycophancy in Language Model Uncertainty Estimation

no code implementations17 Oct 2024 Anthony Sicilia, Mert Inan, Malihe Alikhani

From these results, we argue that externalizing both model and user uncertainty can help to mitigate the impacts of sycophancy bias.

Language Modeling Language Modelling +2

Generating Signed Language Instructions in Large-Scale Dialogue Systems

no code implementations17 Oct 2024 Mert İnan, Katherine Atwell, Anthony Sicilia, Lorna Quandt, Malihe Alikhani

We introduce a goal-oriented conversational AI system enhanced with American Sign Language (ASL) instructions, presenting the first implementation of such a system on a worldwide multimodal conversational AI platform.

Retrieval Text Generation +1

Eliciting Uncertainty in Chain-of-Thought to Mitigate Bias against Forecasting Harmful User Behaviors

no code implementations17 Oct 2024 Anthony Sicilia, Malihe Alikhani

For instance, it can be applied in social media moderation to predict harmful user behaviors before they occur, allowing for preventative interventions.

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

no code implementations14 Oct 2024 Sabit Hassan, Anthony Sicilia, Malihe Alikhani

Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing systems.

Active Learning

Evaluating Theory of (an uncertain) Mind: Predicting the Uncertain Beliefs of Others in Conversation Forecasting

no code implementations23 Sep 2024 Anthony Sicilia, Malihe Alikhani

Typically, when evaluating Theory of Mind, we consider the beliefs of others to be binary: held or not held.

Learning to Generate Equitable Text in Dialogue from Biased Training Data

1 code implementation10 Jul 2023 Anthony Sicilia, Malihe Alikhani

Absence of equitable and inclusive principles can hinder the formation of common ground, which in turn negatively impacts the overall performance of the system.

Decision Making Fairness +1

HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations

1 code implementation23 May 2023 Anthony Sicilia, Jennifer C. Gates, Malihe Alikhani

While demographic factors like age and gender change the way people talk, and in particular, the way people talk to machines, there is little investigation into how large pre-trained language models (LMs) can adapt to these changes.

Memorization

LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue

3 code implementations14 Oct 2022 Anthony Sicilia, Malihe Alikhani

From this insight, we propose a new algorithm, and empirically, we demonstrate our proposal improves both task-success and human-likeness of the generated text.

Diversity Model Selection +1

Modeling Non-Cooperative Dialogue: Theoretical and Empirical Insights

1 code implementation15 Jul 2022 Anthony Sicilia, Tristan Maidment, Pat Healy, Malihe Alikhani

We use the tools of learning theory to develop a theoretical model for identifying non-cooperative interlocutors and apply this theory to analyze different communication strategies.

Learning Theory

PAC-Bayesian Domain Adaptation Bounds for Multiclass Learners

1 code implementation12 Jul 2022 Anthony Sicilia, Katherine Atwell, Malihe Alikhani, Seong Jae Hwang

Multiclass neural networks are a common tool in modern unsupervised domain adaptation, yet an appropriate theoretical description for their non-uniform sample complexity is lacking in the adaptation literature.

Unsupervised Domain Adaptation

Test-time Fourier Style Calibration for Domain Generalization

1 code implementation13 May 2022 Xingchen Zhao, Chang Liu, Anthony Sicilia, Seong Jae Hwang, Yun Fu

Thus, it is still possible that those methods can overfit to source domains and perform poorly on target domains.

Domain Generalization

The Change that Matters in Discourse Parsing: Estimating the Impact of Domain Shift on Parser Error

4 code implementations Findings (ACL) 2022 Katherine Atwell, Anthony Sicilia, Seong Jae Hwang, Malihe Alikhani

Our results not only motivate our proposal and help us to understand its limitations, but also provide insight on the properties of discourse models and datasets which improve performance in domain adaptation.

Discourse Parsing Domain Adaptation +1

PAC Bayesian Performance Guarantees for Deep (Stochastic) Networks in Medical Imaging

1 code implementation12 Apr 2021 Anthony Sicilia, Xingchen Zhao, Anastasia Sosnovskikh, Seong Jae Hwang

Application of deep neural networks to medical imaging tasks has in some sense become commonplace.

Cannot find the paper you are looking for? You can Submit a new open access paper.