Search Results for author: Luis Lastras

Found 14 papers, 2 papers with code

Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities

no code implementations13 May 2025 George Saon, Avihu Dekel, Alexander Brooks, Tohru Nagano, Abraham Daniels, Aharon Satt, Ashish Mittal, Brian Kingsbury, David Haws, Edmilson Morais, Gakuto Kurata, Hagai Aronowitz, Ibrahim Ibrahim, Jeff Kuo, Kate Soule, Luis Lastras, Masayuki Suzuki, Ron Hoory, Samuel Thomas, Sashi Novitasari, Takashi Fukuda, Vishal Sunder, Xiaodong Cui, Zvi Kons

The speech-specific components are: a conformer acoustic encoder using block attention and self-conditioning trained with connectionist temporal classification, a windowed query-transformer speech modality adapter used to do temporal downsampling of the acoustic embeddings and map them to the LLM text embedding space, and LoRA adapters to further fine-tune the text LLM.

automatic-speech-translation Benchmarking

Putting It All into Context: Simplifying Agents with LCLMs

no code implementations12 May 2025 Mingjian Jiang, Yangjun Ruan, Luis Lastras, Pavan Kapanipathi, Tatsunori Hashimoto

Recent advances in language model (LM) agents have demonstrated significant potential for automating complex real-world tasks.

All Language Modeling +1

Activated LoRA: Fine-tuned LLMs for Intrinsics

1 code implementation16 Apr 2025 Kristjan Greenewald, Luis Lastras, Thomas Parnell, Vraj Shah, Lucian Popa, Giulio Zizzo, Chulaka Gunasekara, Ambrish Rawat, David Cox

This change crucially allows aLoRA to accept the base model's KV cache of the input string, meaning that aLoRA can be instantly activated whenever needed in a chain without recomputing the cache.

Granite Embedding Models

no code implementations27 Feb 2025 Parul Awasthy, Aashka Trivedi, Yulong Li, Mihaela Bornea, David Cox, Abraham Daniels, Martin Franz, Gabe Goodhart, Bhavani Iyer, Vishwajeet Kumar, Luis Lastras, Scott McCarley, Rudra Murthy, Vignesh P, Sara Rosenthal, Salim Roukos, Jaydeep Sen, Sukriti Sharma, Avirup Sil, Kate Soule, Arafat Sultan, Radu Florian

We introduce the Granite Embedding models, a family of encoder-based embedding models designed for retrieval tasks, spanning dense-retrieval and sparse retrieval architectures, with both English and Multilingual capabilities.

Information Retrieval Knowledge Distillation +1

A Non-autoregressive Model for Joint STT and TTS

no code implementations15 Jan 2025 Vishal Sunder, Brian Kingsbury, George Saon, Samuel Thomas, Slava Shechtman, Hagai Aronowitz, Eric Fosler-Lussier, Luis Lastras

In this paper, we take a step towards jointly modeling automatic speech recognition (STT) and speech synthesis (TTS) in a fully non-autoregressive way.

Automatic Speech Recognition speech-recognition +2

Formally Specifying the High-Level Behavior of LLM-Based Agents

no code implementations12 Oct 2023 Maxwell Crouse, Ibrahim Abdelaziz, Ramon Astudillo, Kinjal Basu, Soham Dan, Sadhana Kumaravel, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Luis Lastras

We demonstrate how the proposed framework can be used to implement recent LLM-based agents (e. g., ReACT), and show how the flexibility of our approach can be leveraged to define a new agent with more complex behavior, the Plan-Act-Summarize-Solve (PASS) agent.

Question Answering

End-to-End Spoken Language Understanding Without Full Transcripts

no code implementations30 Sep 2020 Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis Lastras

For our speech-to-entities experiments on the ATIS corpus, both the CTC and attention models showed impressive ability to skip non-entity words: there was little degradation when trained on just entities versus full transcripts.

Decoder slot-filling +4

Implicit Discourse Relation Classification: We Need to Talk about Evaluation

no code implementations ACL 2020 Najoung Kim, Song Feng, Chulaka Gunasekara, Luis Lastras

Implicit relation classification on Penn Discourse TreeBank (PDTB) 2. 0 is a common benchmark task for evaluating the understanding of discourse relations.

Classification General Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.