Search Results for author: Monica Sunkara

Found 19 papers, 2 papers with code

Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development

no code implementations22 May 2025 Ming Shen, Raphael Shu, Anurag Pratik, James Gung, Yubin Ge, Monica Sunkara, Yi Zhang

Overall, we demonstrate the effectiveness of our optimization method for role-based multi-agent systems tackling software development tasks evaluated on diverse evaluation dimensions, and we investigate the impact of diverse optimization settings on group behaviors of the multi-agent systems to provide practical insights for future development.

MemInsight: Autonomous Memory Augmentation for LLM Agents

no code implementations27 Mar 2025 Rana Salama, Jason Cai, Michelle Yuan, Anna Currey, Monica Sunkara, Yi Zhang, Yassine Benajiba

In this work, we propose an autonomous memory augmentation approach, MemInsight, to enhance semantic data representation and retrieval mechanisms.

Conversational Recommendation Language Modeling +5

TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session Dialogues

no code implementations3 Feb 2025 Yubin Ge, Salvatore Romeo, Jason Cai, Raphael Shu, Monica Sunkara, Yassine Benajiba, Yi Zhang

Temporal reasoning in multi-session dialogues presents a significant challenge which has been under-studied in previous temporal reasoning benchmarks.

Memorization Timeline Summarization

Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise Applications

1 code implementation6 Dec 2024 Raphael Shu, Nilaksh Das, Michelle Yuan, Monica Sunkara, Yi Zhang

For coordination capabilities, we demonstrate the effectiveness of inter-agent communication and payload referencing mechanisms, achieving end-to-end goal success rates of 90%.

RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration

no code implementations11 Nov 2024 Young-Min Cho, Raphael Shu, Nilaksh Das, Tamer Alkhouli, Yi-An Lai, Jason Cai, Monica Sunkara, Yi Zhang, Dan Roth

Through controlled experiments, we analyze how different voting rules affect decision quality and efficiency in a multi-round collaboration.

Decision Making

Inference time LLM alignment in single and multidomain preference spectrum

no code implementations24 Oct 2024 Sadat Shahriar, Zheng Qi, Nikolaos Pappas, Srikanth Doss, Monica Sunkara, Kishaloy Halder, Manuel Mager, Yassine Benajiba

Aligning Large Language Models (LLM) to address subjectivity and nuanced preference levels requires adequate flexibility and control, which can be a resource-intensive and time-consuming procedure.

Model Editing Prompt Engineering

CERET: Cost-Effective Extrinsic Refinement for Text Generation

1 code implementation8 Jun 2024 Jason Cai, Hang Su, Monica Sunkara, Igor Shalyminov, Saab Mansour

Large Language Models (LLMs) are powerful models for generation tasks, but they may not generate good quality outputs in their first attempt.

Abstractive Text Summarization Question Answering +1

SpeechVerse: A Large-scale Generalizable Audio Language Model

no code implementations14 May 2024 Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sravan Bodapati, Sundararajan Srinivasan, Kyu J Han, Katrin Kirchhoff

The models are instruction finetuned using continuous latent representations extracted from the speech foundation model to achieve optimal zero-shot performance on a diverse range of speech processing tasks using natural language instructions.

Automatic Speech Recognition Benchmarking +6

Masked Audio Text Encoders are Effective Multi-Modal Rescorers

no code implementations11 May 2023 Jinglun Cai, Monica Sunkara, Xilai Li, Anshu Bhatia, Xiao Pan, Sravan Bodapati

Masked Language Models (MLMs) have proven to be effective for second-pass rescoring in Automatic Speech Recognition (ASR) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation

no code implementations5 May 2023 Nilaksh Das, Monica Sunkara, Sravan Bodapati, Jinglun Cai, Devang Kulshreshtha, Jeff Farris, Katrin Kirchhoff

Internal language model estimation (ILME) has been proposed to mitigate this bias for autoregressive models such as attention-based encoder-decoder and RNN-T.

Decoder Domain Adaptation +2

Adaptation Approaches for Nearest Neighbor Language Models

no code implementations15 Nov 2022 Rishabh Bhardwaj, George Polovets, Monica Sunkara

Semi-parametric Nearest Neighbor Language Models ($k$NN-LMs) have produced impressive gains over purely parametric LMs, by leveraging large-scale neighborhood retrieval over external memory datastores.

Retrieval

Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting

no code implementations18 Oct 2022 Saket Dingliwal, Monica Sunkara, Sravan Bodapati, Srikanth Ronanki, Jeff Farris, Katrin Kirchhoff

End-to-end speech recognition models trained using joint Connectionist Temporal Classification (CTC)-Attention loss have gained popularity recently.

Decoder speech-recognition +1

Remember the context! ASR slot error correction through memorization

no code implementations10 Sep 2021 Dhanush Bekal, Ashish Shenoy, Monica Sunkara, Sravan Bodapati, Katrin Kirchhoff

Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Adapting Long Context NLM for ASR Rescoring in Conversational Agents

no code implementations21 Apr 2021 Ashish Shenoy, Sravan Bodapati, Monica Sunkara, Srikanth Ronanki, Katrin Kirchhoff

Neural Language Models (NLM), when trained and evaluated with context spanning multiple utterances, have been shown to consistently outperform both conventional n-gram language models and NLMs that use limited context.

intent-classification Intent Classification +2

Neural Inverse Text Normalization

no code implementations12 Feb 2021 Monica Sunkara, Chaitanya Shivade, Sravan Bodapati, Katrin Kirchhoff

We propose an efficient and robust neural solution for ITN leveraging transformer based seq2seq models and FST-based text normalization techniques for data preparation.

Text Normalization

Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech

no code implementations3 Aug 2020 Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff

Experiments conducted on the Fisher corpus show that our proposed approach achieves ~6-9% and ~3-4% absolute improvement (F1 score) over the baseline BLSTM model on reference transcripts and ASR outputs respectively.

Data Augmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.