Search Results for author: Benjamin Piwowarski

Found 31 papers, 13 papers with code

Choisir le bon co-équipier pour la génération coopérative de texte (Choosing The Right Teammate For Cooperative Text Generation)

no code implementations JEP/TALN/RECITAL 2022 Antoine Chaffin, Vincent Claveau, Ewa Kijak, Sylvain Lamprier, Benjamin Piwowarski, Thomas Scialom, Jacopo Staiano

Nous évaluons leurs avantages et inconvénients, en explorant leur précision respective sur des tâches de classification, ainsi que leur impact sur la génération coopérative et leur coût de calcul, dans le cadre d’une stratégie de décodage état de l’art, basée sur une recherche arborescente de Monte-Carlo (MCTS).

Text Generation

Which Neurons Matter in IR? Applying Integrated Gradients-based Methods to Understand Cross-Encoders

no code implementations27 Jun 2024 Mathias Vast, Basile Van Cooten, Laure Soulier, Benjamin Piwowarski

With the recent addition of Retrieval-Augmented Generation (RAG), the scope and importance of Information Retrieval (IR) has expanded.

Information Retrieval RAG +1

What Makes Multimodal In-Context Learning Work?

1 code implementation24 Apr 2024 Folco Bertini Baldassini, Mustafa Shukor, Matthieu Cord, Laure Soulier, Benjamin Piwowarski

Large Language Models have demonstrated remarkable performance across various tasks, exhibiting the capacity to swiftly acquire new skills, such as through In-Context Learning (ICL) with minimal demonstration examples.

In-Context Learning

Training Table Question Answering via SQL Query Decomposition

no code implementations19 Feb 2024 Raphaël Mouravieff, Benjamin Piwowarski, Sylvain Lamprier

Table Question-Answering involves both understanding the natural language query and grounding it in the context of the input table to extract the relevant information.

Question Answering Semantic Parsing

Simple Domain Adaptation for Sparse Retrievers

no code implementations21 Jan 2024 Mathias Vast, Yuxuan Zong, Basile Van Cooten, Benjamin Piwowarski, Laure Soulier

In Information Retrieval, and more generally in Natural Language Processing, adapting models to specific domains is conducted through fine-tuning.

Domain Adaptation Information Retrieval +1

Query Performance Prediction for Neural IR: Are We There Yet?

1 code implementation20 Feb 2023 Guglielmo Faggioli, Thibault Formal, Stefano Marchesin, Stéphane Clinchant, Nicola Ferro, Benjamin Piwowarski

On top of that, in lexical-oriented scenarios, QPPs fail to predict performance for neural IR systems on those queries where they differ from traditional approaches the most.

Passage Retrieval Retrieval

LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization

2 code implementations26 Jan 2023 Laura Nguyen, Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano

Text Summarization is a popular task and an active area of research for the Natural Language Processing community.

Text Summarization

CoSPLADE: Contextualizing SPLADE for Conversational Information Retrieval

1 code implementation11 Jan 2023 Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarski, Laure Soulier

Conversational search is a difficult task as it aims at retrieving documents based not only on the current user query but also on the full conversation history.

Conversational Search Information Retrieval +2

From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective

1 code implementation10 May 2022 Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant

Neural retrievers based on dense representations combined with Approximate Nearest Neighbors search have recently received a lot of attention, owing their success to distillation and/or better sampling of examples for training -- while still relying on the same backbone architecture.

Language Modelling Representation Learning

Which Discriminator for Cooperative Text Generation?

1 code implementation25 Apr 2022 Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, Ewa Kijak, Vincent Claveau

Language models generate texts by successively predicting probability distributions for next tokens given past ones.

Language Modelling Text Generation

Generative Cooperative Networks for Natural Language Generation

no code implementations28 Jan 2022 Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, Benjamin Piwowarski

Generative Adversarial Networks (GANs) have known a tremendous success for many continuous generation tasks, especially in the field of image generation.

Image Generation Text Generation

Match Your Words! A Study of Lexical Matching in Neural Information Retrieval

no code implementations10 Dec 2021 Thibault Formal, Benjamin Piwowarski, Stéphane Clinchant

Neural Information Retrieval models hold the promise to replace lexical matching models, e. g. BM25, in modern search engines.

Information Retrieval Retrieval

SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval

1 code implementation21 Sep 2021 Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant

Meanwhile, there has been a growing interest in learning \emph{sparse} representations for documents and queries, that could inherit from the desirable properties of bag-of-words models such as the exact matching of terms and the efficiency of inverted indexes.

Information Retrieval Retrieval +1

Skim-Attention: Learning to Focus via Document Layout

1 code implementation Findings (EMNLP) 2021 Laura Nguyen, Thomas Scialom, Jacopo Staiano, Benjamin Piwowarski

Motivated by human reading strategies, this paper presents Skim-Attention, a new attention mechanism that takes advantage of the structure of the document and its layout.

document understanding Language Modelling

SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking

1 code implementation12 Jul 2021 Thibault Formal, Benjamin Piwowarski, Stéphane Clinchant

In neural Information Retrieval, ongoing research is directed towards improving the first retriever in ranking pipelines.

Information Retrieval Open-Domain Question Answering +1

To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

no code implementations NeurIPS 2021 Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods.

Question Generation Question-Generation +1

Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic Evaluation

2 code implementations EMNLP 2021 Clément Rebuffel, Thomas Scialom, Laure Soulier, Benjamin Piwowarski, Sylvain Lamprier, Jacopo Staiano, Geoffrey Scoutheeten, Patrick Gallinari

QuestEval is a reference-less metric used in text-to-text tasks, that compares the generated summaries directly to the source text, by automatically asking and answering questions.

Data-to-Text Generation Question Generation +1

QuestEval: Summarization Asks for Fact-based Evaluation

1 code implementation EMNLP 2021 Thomas Scialom, Paul-Alexis Dray, Patrick Gallinari, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano, Alex Wang

Summarization evaluation remains an open research problem: current metrics such as ROUGE are known to be limited and to correlate poorly with human judgments.

Question Answering

A White Box Analysis of ColBERT

no code implementations17 Dec 2020 Thibault Formal, Benjamin Piwowarski, Stéphane Clinchant

Transformer-based models are nowadays state-of-the-art in ad-hoc Information Retrieval, but their behavior is far from being understood.

Ad-Hoc Information Retrieval Information Retrieval +1

Transductive Zero-Shot Learning using Cross-Modal CycleGAN

no code implementations13 Nov 2020 Patrick Bordes, Eloi Zablocki, Benjamin Piwowarski, Patrick Gallinari

We show the efficiency of our Cross-Modal CycleGAN model (CM-GAN) on the ImageNet T-ZSL task where we obtain state-of-the-art results.

Sentence Zero-Shot Learning

Discriminative Adversarial Search for Abstractive Summarization

1 code implementation ICML 2020 Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

We introduce a novel approach for sequence decoding, Discriminative Adversarial Search (DAS), which has the desirable properties of alleviating the effects of exposure bias without requiring external metrics.

Abstractive Text Summarization Domain Adaptation

Incorporating Visual Semantics into Sentence Representations within a Grounded Space

no code implementations IJCNLP 2019 Patrick Bordes, Eloi Zablocki, Laure Soulier, Benjamin Piwowarski, Patrick Gallinari

To overcome this limitation, we propose to transfer visual information to textual representations by learning an intermediate representation space: the grounded space.

Sentence

Answers Unite! Unsupervised Metrics for Reinforced Summarization Models

2 code implementations IJCNLP 2019 Thomas Scialom, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

Abstractive summarization approaches based on Reinforcement Learning (RL) have recently been proposed to overcome classical likelihood maximization.

Abstractive Text Summarization Question Answering +2

Self-Attention Architectures for Answer-Agnostic Neural Question Generation

no code implementations ACL 2019 Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano

Neural architectures based on self-attention, such as Transformers, recently attracted interest from the research community, and obtained significant improvements over the state of the art in several tasks.

Question Generation Question-Generation +1

Context-Aware Zero-Shot Learning for Object Recognition

no code implementations24 Apr 2019 Eloi Zablocki, Patrick Bordes, Benjamin Piwowarski, Laure Soulier, Patrick Gallinari

Zero-Shot Learning (ZSL) aims at classifying unlabeled objects by leveraging auxiliary knowledge, such as semantic representations.

Object Object Recognition +1

Learning Multi-Modal Word Representation Grounded in Visual Context

no code implementations9 Nov 2017 Éloi Zablocki, Benjamin Piwowarski, Laure Soulier, Patrick Gallinari

Representing the semantics of words is a long-standing problem for the natural language processing community.

Word Embeddings

Efficient Document Indexing Using Pivot Tree

no code implementations21 May 2016 Gaurav Singh, Benjamin Piwowarski

We present a novel method for efficiently searching top-k neighbors for documents represented in high dimensional space of terms based on the cosine similarity.

Retrieval

Parameterized Neural Network Language Models for Information Retrieval

no code implementations6 Oct 2015 Benjamin Piwowarski, Sylvain Lamprier, Nicolas Despres

Although they present good abilities to cope with both term dependencies and vocabulary mismatch problems, thanks to the distributed representation of words they are based upon, such models could not be used readily in IR, where the estimation of one language model per document (or query) is required.

Information Retrieval Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.