Search Results for author: Carlos Lassance

Found 35 papers, 12 papers with code

SPLATE: Sparse Late Interaction Retrieval

no code implementations • 22 Apr 2024 • Thibault Formal, Stéphane Clinchant, Hervé Déjean, Carlos Lassance

The late interaction paradigm introduced with ColBERT stands out in the neural Information Retrieval space, offering a compelling effectiveness-efficiency trade-off across many benchmarks.

Information Retrieval Re-Ranking +1

Paper
Add Code

Two-Step SPLADE: Simple, Efficient and Effective Approximation of SPLADE

no code implementations • 20 Apr 2024 • Carlos Lassance, Hervé Dejean, Stéphane Clinchant, Nicola Tonellotto

Learned sparse models such as SPLADE have successfully shown how to incorporate the benefits of state-of-the-art neural information retrieval models into the classical inverted index data structure.

Information Retrieval

Paper
Add Code

SPLADE-v3: New baselines for SPLADE

no code implementations • 11 Mar 2024 • Carlos Lassance, Hervé Déjean, Thibault Formal, Stéphane Clinchant

A companion to the release of the latest version of the SPLADE library.

Paper
Add Code

End-to-End Retrieval with Learned Dense and Sparse Representations Using Lucene

no code implementations • 30 Nov 2023 • Haonan Chen, Carlos Lassance, Jimmy Lin

The bi-encoder architecture provides a framework for understanding machine-learned retrieval models based on dense and sparse vector representations.

Information Retrieval Retrieval

Paper
Add Code

Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard

2 code implementations • 13 Jun 2023 • Ehsan Kamalloo, Nandan Thakur, Carlos Lassance, Xueguang Ma, Jheng-Hong Yang, Jimmy Lin

BEIR is a benchmark dataset for zero-shot evaluation of information retrieval models across 18 different domain/task combinations.

Information Retrieval Representation Learning +1

1,374

Paper
Code

Benchmarking Middle-Trained Language Models for Neural Search

1 code implementation • 5 Jun 2023 • Hervé Déjean, Stéphane Clinchant, Carlos Lassance, Simon Lupart, Thibault Formal

We compare both dense and sparse approaches under various finetuning protocols and middle training on different collections (MS MARCO, Wikipedia or Tripclick).

Benchmarking Language Modelling +1

633

Paper
Code

A Static Pruning Study on Sparse Neural Retrievers

no code implementations • 25 Apr 2023 • Carlos Lassance, Simon Lupart, Hervé Dejean, Stéphane Clinchant, Nicola Tonellotto

Sparse neural retrievers, such as DeepImpact, uniCOIL and SPLADE, have been introduced recently as an efficient and effective way to perform retrieval with inverted indexes.

Document Ranking Retrieval

Paper
Add Code

The tale of two MS MARCO -- and their unfair comparisons

no code implementations • 25 Apr 2023 • Carlos Lassance, Stéphane Clinchant

This is why this paper aims to report the importance of this issue so that researchers can be made aware of this problem and appropriately report their results.

Retrieval Vocal Bursts Valence Prediction

Paper
Add Code

AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation

2 code implementations • 4 Apr 2023 • Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin

This paper presents the AToMiC (Authoring Tools for Multimedia Content) dataset, designed to advance research in image/text cross-modal retrieval.

Cross-Modal Retrieval Retrieval +1

957

Paper
Code

Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval

no code implementations • 3 Apr 2023 • Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang

The advent of multilingual language models has generated a resurgence of interest in cross-lingual information retrieval (CLIR), which is the task of searching documents in one language with queries from another.

Cross-Lingual Information Retrieval Retrieval

Paper
Add Code

Parameter-Efficient Sparse Retrievers and Rerankers using Adapters

1 code implementation • 23 Mar 2023 • Vaishali Pal, Carlos Lassance, Hervé Déjean, Stéphane Clinchant

While previous studies have only experimented with dense retriever or in a cross lingual retrieval scenario, in this paper we aim to complete the picture on the use of adapters in IR.

Domain Adaptation Information Retrieval +3

633

Paper
Code

Naver Labs Europe (SPLADE) @ TREC NeuCLIR 2022

no code implementations • 10 Mar 2023 • Carlos Lassance, Stéphane Clinchant

This paper describes our participation in the 2022 TREC NeuCLIR challenge.

Retrieval Translation

Paper
Add Code

Extending English IR methods to multi-lingual IR

no code implementations • 28 Feb 2023 • Carlos Lassance

This paper describes our participation in the 2023 WSDM CUP - MIRACL challenge.

Document Translation

Paper
Add Code

Naver Labs Europe (SPLADE) @ TREC Deep Learning 2022

no code implementations • 24 Feb 2023 • Carlos Lassance, Stéphane Clinchant

This paper describes our participation to the 2022 TREC Deep Learning challenge.

Retrieval

Paper
Add Code

An Experimental Study on Pretraining Transformers from Scratch for IR

no code implementations • 25 Jan 2023 • Carlos Lassance, Hervé Déjean, Stéphane Clinchant

In this paper, we study the impact of the pretraining collection on the final IR effectiveness.

Passage Retrieval Retrieval

Paper
Add Code

An Efficiency Study for SPLADE Models

1 code implementation • 8 Jul 2022 • Carlos Lassance, Stéphane Clinchant

SPLADE efficiency can be controlled via a regularization factor, but solely controlling this regularization has been shown to not be efficient enough.

Retrieval

633

Paper
Code

From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective

1 code implementation • 10 May 2022 • Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant

Neural retrievers based on dense representations combined with Approximate Nearest Neighbors search have recently received a lot of attention, owing their success to distillation and/or better sampling of examples for training -- while still relying on the same backbone architecture.

Language Modelling Representation Learning

633

Paper
Code

Composite Code Sparse Autoencoders for first stage retrieval

no code implementations • 14 Apr 2022 • Carlos Lassance, Thibault Formal, Stephane Clinchant

Second, CCSA can be used as a binary quantization method and we propose to combine it with the recent graph based ANN techniques.

Image Retrieval Information Retrieval +2

Paper
Add Code

A Study on Token Pruning for ColBERT

no code implementations • 13 Dec 2021 • Carlos Lassance, Maroua Maachou, Joohee Park, Stéphane Clinchant

Our experiments show that ColBERT indexes can be pruned up to 30\% on the MS MARCO passage collection without a significant drop in performance.

Paper
Add Code

TLDR: Twin Learning for Dimensionality Reduction

1 code implementation • 18 Oct 2021 • Yannis Kalantidis, Carlos Lassance, Jon Almazan, Diane Larlus

Dimensionality reduction methods are unsupervised approaches which learn low-dimensional spaces where some properties of the initial space, typically the notion of "neighborhood", are preserved.

Dimensionality Reduction Representation Learning +2

120

Paper
Code

Graphs as Tools to Improve Deep Learning Methods

no code implementations • 8 Oct 2021 • Carlos Lassance, Myriam Bontonou, Mounia Hamidouche, Bastien Pasdeloup, Lucas Drumetz, Vincent Gripon

This chapter is composed of four main parts: tools for visualizing intermediate layers in a DNN, denoising data representations, optimizing graph objective functions and regularizing the learning process.

Denoising

Paper
Add Code

SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval

1 code implementation • 21 Sep 2021 • Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant

Meanwhile, there has been a growing interest in learning \emph{sparse} representations for documents and queries, that could inherit from the desirable properties of bag-of-words models such as the exact matching of terms and the efficiency of inverted indexes.

Ranked #5 on Zero-shot Text Search on BEIR

Information Retrieval Retrieval +1

633

Paper
Code

Improving Classification Accuracy with Graph Filtering

no code implementations • 12 Jan 2021 • Mounia Hamidouche, Carlos Lassance, Yuqing Hu, Lucas Drumetz, Bastien Pasdeloup, Vincent Gripon

In machine learning, classifiers are typically susceptible to noise in the training data.

Classification General Classification

Paper
Add Code

Graphs for deep learning representations

no code implementations • 14 Dec 2020 • Carlos Lassance

In recent years, Deep Learning methods have achieved state of the art performance in a vast range of machine learning tasks, including image classification and multilingual automatic text translation.

BIG-bench Machine Learning Image Classification +1

Paper
Add Code

DecisiveNets: Training Deep Associative Memories to Solve Complex Machine Learning Problems

no code implementations • 2 Dec 2020 • Vincent Gripon, Carlos Lassance, Ghouthi Boukli Hacene

Learning deep representations to solve complex machine learning tasks has become the prominent trend in the past few years.

BIG-bench Machine Learning

Paper
Add Code

Ranking Deep Learning Generalization using Label Variation in Latent Geometry Graphs

1 code implementation • 25 Nov 2020 • Carlos Lassance, Louis Béthune, Myriam Bontonou, Mounia Hamidouche, Vincent Gripon

Measuring the generalization performance of a Deep Neural Network (DNN) without relying on a validation set is a difficult task.

Paper
Code

Representing Deep Neural Networks Latent Space Geometries with Graphs

no code implementations • 14 Nov 2020 • Carlos Lassance, Vincent Gripon, Antonio Ortega

However, when processing a batch of inputs concurrently, the corresponding set of intermediate representations exhibit relations (what we call a geometry) on which desired properties can be sought.

Paper
Add Code

Graph topology inference benchmarks for machine learning

1 code implementation • 16 Jul 2020 • Carlos Lassance, Vincent Gripon, Gonzalo Mateos

Graphs are nowadays ubiquitous in the fields of signal processing and machine learning.

BIG-bench Machine Learning Clustering +2

Paper
Code

Deep geometric knowledge distillation with graphs

1 code implementation • 8 Nov 2019 • Carlos Lassance, Myriam Bontonou, Ghouthi Boukli Hacene, Vincent Gripon, Jian Tang, Antonio Ortega

Specifically we introduce a graph-based RKD method, in which graphs are used to capture the geometry of latent spaces.

Knowledge Distillation

Paper
Code

Improved Visual Localization via Graph Smoothing

no code implementations • 7 Nov 2019 • Carlos Lassance, Yasir Latif, Ravi Garg, Vincent Gripon, Ian Reid

One solution to this problem is to learn a deep neural network to infer the pose of a query image after learning on a dataset of images with known poses.

Image Retrieval Retrieval +1

Paper
Add Code

Structural Robustness for Deep Learning Architectures

no code implementations • 11 Sep 2019 • Carlos Lassance, Vincent Gripon, Jian Tang, Antonio Ortega

Deep Networks have been shown to provide state-of-the-art performance in many machine learning challenges.

BIG-bench Machine Learning

Paper
Add Code

Comparing linear structure-based and data-driven latent spatial representations for sequence prediction

no code implementations • 19 Aug 2019 • Myriam Bontonou, Carlos Lassance, Vincent Gripon, Nicolas Farrugia

Predicting the future of Graph-supported Time Series (GTS) is a key challenge in many domains, such as climate monitoring, finance or neuroimaging.

Time Series Time Series Analysis

Paper
Add Code

Attention Based Pruning for Shift Networks

1 code implementation • 29 May 2019 • Ghouthi Boukli Hacene, Carlos Lassance, Vincent Gripon, Matthieu Courbariaux, Yoshua Bengio

In many application domains such as computer vision, Convolutional Layers (CLs) are key to the accuracy of deep learning methods.

Object Recognition

Paper
Code

Introducing Graph Smoothness Loss for Training Deep Learning Architectures

no code implementations • 1 May 2019 • Myriam Bontonou, Carlos Lassance, Ghouthi Boukli Hacene, Vincent Gripon, Jian Tang, Antonio Ortega

We introduce a novel loss function for training deep learning architectures to perform classification.

General Classification

Paper
Add Code

A Unified Deep Learning Formalism For Processing Graph Signals

no code implementations • 1 May 2019 • Myriam Bontonou, Carlos Lassance, Jean-Charles Vialatte, Vincent Gripon

Convolutional Neural Networks are very efficient at processing signals defined on a discrete Euclidean space (such as images).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.