no code implementations • 12 Jun 2024 • Iwen E. Kang, Christophe Van Gysel, Man-Hung Siu
Voice assistants increasingly use on-device Automatic Speech Recognition (ASR) to ensure speed and privacy.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 10 Jun 2024 • Sonal Sannigrahi, Thiago Fraga-Silva, Youssef Oualil, Christophe Van Gysel
In this paper, we provide a preliminary exploration of the use of Large Language Models (LLMs) to generate synthetic queries that are complementary to template-based methods.
no code implementations • 2 Nov 2023 • Youyuan Zhang, Sashank Gondala, Thiago Fraga-Silva, Christophe Van Gysel
On-device Virtual Assistants (VAs) powered by Automatic Speech Recognition (ASR) require effective knowledge integration for the challenging entity-rich query recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 25 Apr 2023 • Christophe Van Gysel
Virtual assistants are becoming increasingly important speech-driven Information Retrieval platforms that assist users with various tasks.
no code implementations • 29 Jun 2022 • Christophe Van Gysel, Mirko Hannemann, Ernest Pusateri, Youssef Oualil, Ilya Oparin
Virtual assistants make use of automatic speech recognition (ASR) to help users answer entity-centric queries.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 21 Jun 2021 • Mandana Saebi, Ernest Pusateri, Aaksha Meghawat, Christophe Van Gysel
High-quality automatic speech recognition (ASR) is essential for virtual assistants (VAs) to work well.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 14 Feb 2021 • Sashank Gondala, Lyan Verwimp, Ernest Pusateri, Manos Tsagkias, Christophe Van Gysel
We customize entropy pruning by allowing for a keep list of infrequent n-grams that require a more relaxed pruning threshold, and propose three methods to construct the keep list.
no code implementations • 26 May 2020 • Christophe Van Gysel, Manos Tsagkias, Ernest Pusateri, Ilya Oparin
We focus on improving the effectiveness of a Virtual Assistant (VA) in recognizing emerging entities in spoken queries.
no code implementations • 26 Aug 2019 • Ernest Pusateri, Christophe Van Gysel, Rami Botros, Sameer Badaskar, Mirko Hannemann, Youssef Oualil, Ilya Oparin
In this work, we uncover a theoretical connection between two language model interpolation techniques, count merging and Bayesian interpolation.
no code implementations • 16 Nov 2017 • Christophe Van Gysel
Search engines rely heavily on term-based approaches that represent queries and documents as bags of words.
no code implementations • 17 Oct 2017 • Christophe Van Gysel, Bhaskar Mitra, Matteo Venanzi, Roy Rosemarin, Grzegorz Kukla, Piotr Grudzien, Nicola Cancedda
Email responses often contain items-such as a file or a hyperlink to an external document-that are attached to or included inline in the body of the message.
4 code implementations • 9 Aug 2017 • Christophe Van Gysel, Maarten de Rijke, Evangelos Kanoulas
We propose the Neural Vector Space Model (NVSM), a method that learns representations of documents in an unsupervised manner for news article retrieval.
1 code implementation • 25 Jul 2017 • Christophe Van Gysel, Maarten de Rijke, Evangelos Kanoulas
We discover how clusterings of experts correspond to committees in organizations, the ability of expert representations to encode the co-author graph, and the degree to which they encode academic rank.
no code implementations • 13 Jul 2017 • Tom Kenter, Alexey Borisov, Christophe Van Gysel, Mostafa Dehghani, Maarten de Rijke, Bhaskar Mitra
Machine learning plays a role in many aspects of modern IR systems, and deep learning is applied in all of them.
1 code implementation • 12 Jun 2017 • Christophe Van Gysel, Maarten de Rijke, Evangelos Kanoulas
Unsupervised learning of low-dimensional, semantic representations of words and entities has recently gained attention.
1 code implementation • 3 Jan 2017 • Christophe Van Gysel, Evangelos Kanoulas, Maarten de Rijke
We introduce pyndri, a Python interface to the Indri search engine.
2 code implementations • 25 Aug 2016 • Christophe Van Gysel, Maarten de Rijke, Evangelos Kanoulas
We introduce a novel latent vector space model that jointly learns the latent representations of words, e-commerce products and a mapping between the two without the need for explicit annotations.
1 code implementation • 23 Aug 2016 • Christophe Van Gysel, Evangelos Kanoulas, Maarten de Rijke
Lexical query modeling has been the leading paradigm for session search.
1 code implementation • 23 Aug 2016 • Christophe Van Gysel, Maarten de Rijke, Marcel Worring
We compare our model to state-of-the-art unsupervised statistical vector space and probabilistic generative approaches.