Search Results for author: Benjamin Clavié

Found 10 papers, 1 papers with code

Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling

no code implementations23 Sep 2024 Benjamin Clavié, Antoine Chaffin, Griffin Adams

This method can reduce the space & memory footprint of ColBERT indexes by 50% with virtually no retrieval performance degradation.

Retrieval

rerankers: A Lightweight Python Library to Unify Ranking Methods

1 code implementation30 Aug 2024 Benjamin Clavié

This paper presents rerankers, a Python library which provides an easy-to-use interface to the most commonly used re-ranking approaches.

Re-Ranking Retrieval

JaColBERTv2.5: Optimising Multi-Vector Retrievers to Create State-of-the-Art Japanese Retrievers with Constrained Resources

no code implementations30 Jul 2024 Benjamin Clavié

Neural Information Retrieval has advanced rapidly in high-resource languages, but progress in lower-resource ones such as Japanese has been hindered by data scarcity, among other challenges.

Information Retrieval Retrieval

Towards Better Monolingual Japanese Retrievers with Multi-Vector Models

no code implementations26 Dec 2023 Benjamin Clavié

As language-specific training data tends to be sparsely available compared to English, document retrieval in many languages has been largely relying on multilingual models.

Retrieval

Large Language Models as Batteries-Included Zero-Shot ESCO Skills Matchers

no code implementations7 Jul 2023 Benjamin Clavié, Guillaume Soulié

We generate synthetic training data for the entirety of ESCO skills and train a classifier to extract skill mentions from job posts.

Re-Ranking

Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification

no code implementations13 Mar 2023 Benjamin Clavié, Alexandru Ciceu, Frederick Naylor, Guillaume Soulié, Thomas Brightwell

Furthermore, we observe that the wording of the prompt is a critical factor in eliciting the appropriate "reasoning" in the model, and that seemingly minor aspects of the prompt significantly affect the model's performance.

Job classification Prompt Engineering +3

Deep Embeddings of Contextual Assessment Data for Improving Performance Prediction

no code implementations13 Jul 2020 Benjamin Clavié, K. Gal

We introduce DeepPerfEmb, or DPE, a new deep-learning model that captures dense representations of students’ online behaviour and meta-data about students and educational content.

EduBERT: Pretrained Deep Language Models for Learning Analytics

no code implementations2 Dec 2019 Benjamin Clavié, Kobi Gal

The use of large pretrained neural networks to create contextualized word embeddings has drastically improved performance on several natural language processing (NLP) tasks.

text-classification Text Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.