Search Results for author: Rémi Lebret

Found 18 papers, 5 papers with code

Word Emdeddings through Hellinger PCA

no code implementations • 19 Dec 2013 • Rémi Lebret, Ronan Collobert

Word embeddings resulting from neural language models have been shown to be successful for a large variety of NLP tasks.

NER Word Embeddings

Paper
Add Code

Rehabilitation of Count-based Models for Word Vector Representations

no code implementations • 16 Dec 2014 • Rémi Lebret, Ronan Collobert

We present a systematic study of the use of the Hellinger distance to extract semantic representations from the word co-occurence statistics of large text corpora.

Dimensionality Reduction Word Embeddings +1

Paper
Add Code

N-gram-Based Low-Dimensional Representation for Document Classification

no code implementations • 19 Dec 2014 • Rémi Lebret, Ronan Collobert

The number of features is therefore dramatically reduced and documents can be represented as bag of semantic concepts.

Classification Clustering +4

Paper
Add Code

Phrase-based Image Captioning

no code implementations • 12 Feb 2015 • Rémi Lebret, Pedro O. Pinheiro, Ronan Collobert

Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing.

Descriptive Image Captioning +1

Paper
Add Code

"The Sum of Its Parts": Joint Learning of Word and Phrase Representations with Autoencoders

no code implementations • 18 Jun 2015 • Rémi Lebret, Ronan Collobert

We evaluate the quality of the word representations on several classical word evaluation tasks, and we introduce a novel task to evaluate the quality of the phrase representations.

Paper
Add Code

Taxonomy Induction using Hypernym Subsequences

no code implementations • 25 Apr 2017 • Amit Gupta, Rémi Lebret, Hamza Harkous, Karl Aberer

We propose a novel, semi-supervised approach towards domain taxonomy induction from an input vocabulary of seed terms.

Paper
Add Code

280 Birds with One Stone: Inducing Multilingual Taxonomies from Wikipedia using Character-level Classification

no code implementations • 25 Apr 2017 • Amit Gupta, Rémi Lebret, Hamza Harkous, Karl Aberer

We propose a simple, yet effective, approach towards inducing multilingual taxonomies from Wikipedia.

General Classification

Paper
Add Code

Polisis: Automated Analysis and Presentation of Privacy Policies Using Deep Learning

2 code implementations • 7 Feb 2018 • Hamza Harkous, Kassem Fawaz, Rémi Lebret, Florian Schaub, Kang G. Shin, Karl Aberer

Companies, users, researchers, and regulators still lack usable and scalable tools to cope with the breadth and depth of privacy policies.

Language Modelling Question Answering

Paper
Code

Weakly Supervised Active Learning with Cluster Annotation

no code implementations • 31 Dec 2018 • Fábio Perez, Rémi Lebret, Karl Aberer

In this work, we introduce a novel framework that employs cluster annotation to boost active learning by reducing the number of human interactions required to train deep neural networks.

Active Learning

Paper
Add Code

Upgrading the Newsroom: An Automated Image Selection System for News Articles

no code implementations • 23 Apr 2020 • Fangyu Liu, Rémi Lebret, Didier Orel, Philippe Sordet, Karl Aberer

The system fuses multiple textual sources extracted from news articles and accepts multilingual inputs.

Image Retrieval Retrieval +2

Paper
Add Code

Spoken dialect identification in Twitter using a multi-filter architecture

no code implementations • 5 Jun 2020 • Mohammadreza Banaei, Rémi Lebret, Karl Aberer

This paper presents our approach for SwissText & KONVENS 2020 shared task 2, which is a multi-stage neural model for Swiss German (GSW) identification on Twitter.

Dialect Identification Task 2

Paper
Add Code

Direction is what you need: Improving Word Embedding Compression in Large Language Models

1 code implementation • ACL (RepL4NLP) 2021 • Klaudia Bałazy, Mohammadreza Banaei, Rémi Lebret, Jacek Tabor, Karl Aberer

The adoption of Transformer-based models in natural language processing (NLP) has led to great success using a massive number of parameters.

Language Modelling

Paper
Code

Legal Transformer Models May Not Always Help

no code implementations • 14 Sep 2021 • Saibo Geng, Rémi Lebret, Karl Aberer

This work investigates the value of domain adaptive pre-training and language adapters in legal NLP tasks.

Paper
Add Code

AdaGrid: Adaptive Grid Search for Link Prediction Training Objective

1 code implementation • 30 Mar 2022 • Tim Poštuvan, Jiaxuan You, Mohammadreza Banaei, Rémi Lebret, Jure Leskovec

To mitigate these limitations, we propose Adaptive Grid Search (AdaGrid), which dynamically adjusts the edge message ratio during training.

BIG-bench Machine Learning Link Prediction

Paper
Code

An Efficient Active Learning Pipeline for Legal Text Classification

no code implementations • 15 Nov 2022 • Sepideh Mamooler, Rémi Lebret, Stéphane Massonnet, Karl Aberer

However, most AL strategies require a set of labeled samples to start with, which is expensive to acquire.

Active Learning Knowledge Distillation +2

Paper
Add Code

Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models

1 code implementation • 8 Feb 2023 • Mohammadreza Banaei, Klaudia Bałazy, Artur Kasymov, Rémi Lebret, Jacek Tabor, Karl Aberer

Recent transformer language models achieve outstanding results in many natural language processing (NLP) tasks.

Paper
Code

Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages

1 code implementation • 29 Jun 2023 • Yasmine Karoui, Rémi Lebret, Negar Foroutan, Karl Aberer

Our evaluation across three distinct tasks (image-text retrieval, visual entailment, and natural language visual reasoning) demonstrates that this approach outperforms the state-of-the-art multilingual vision-language models without requiring large parallel corpora.

Machine Translation Retrieval +3

Paper
Code

Multilingual Text Summarization on Financial Documents

no code implementations • FNP (LREC) 2022 • Negar Foroutan, Angelika Romanou, Stéphane Massonnet, Rémi Lebret, Karl Aberer

The language models were fine-tuned on a financial document collection of three languages (English, Spanish, and Greek) and aim to identify the beginning of the summary narrative part of the document.

Abstractive Text Summarization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.