Search Results for author: Alan Dao

Found 1 papers, 1 papers with code

Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

1 code implementation20 Oct 2024 Alan Dao, Dinh Bach Vu, Huy Hoang Ha

Large Language Models (LLMs) have revolutionized natural language processing, but their application to speech-based tasks remains challenging due to the complexities of integrating audio and text modalities.

Question Answering speech-recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.