Search Results for author: Amir Hazem

Found 29 papers, 3 papers with code

Cross-lingual and Cross-domain Transfer Learning for Automatic Term Extraction from Low Resource Data

no code implementations • LREC 2022 • Amir Hazem, Merieme Bouhandi, Florian Boudin, Beatrice Daille

Automatic Term Extraction (ATE) is a key component for domain knowledge understanding and an important basis for further natural language processing applications.

Term Extraction Transfer Learning

Paper
Add Code

Data Selection for Bilingual Lexicon Induction from Specialized Comparable Corpora

no code implementations • COLING 2020 • Martin Laville, Amir Hazem, Emmanuel Morin, Phillippe Langlais

In this paper, we contrast several data selection techniques to improve bilingual lexicon induction from specialized comparable corpora.

Bilingual Lexicon Induction Data Augmentation +1

Paper
Add Code

Hierarchical Text Segmentation for Medieval Manuscripts

1 code implementation • COLING 2020 • Amir Hazem, Beatrice Daille, Dominique Stutzmann, Christopher Kermorvant, Louis Chevalier

In this paper, we address the segmentation of books of hours, Latin devotional manuscripts of the late Middle Ages, that exhibit challenging issues: a complex hierarchical entangled structure, variable content, noisy transcriptions with no sentence markers, and strong correlations between sections for which topical information is no longer sufficient to draw segmentation boundaries.

Hierarchical Text Segmentation Segmentation +2

Paper
Code

Towards Automatic Thesaurus Construction and Enrichment.

no code implementations • LREC 2020 • Amir Hazem, Beatrice Daille, Lanza Claudia

Thesaurus construction with minimum human efforts often relies on automatic methods to discover terms and their relations.

Paper
Add Code

TermEval 2020: TALN-LS2N System for Automatic Term Extraction

no code implementations • LREC 2020 • Amir Hazem, Bouh, M{\'e}rieme i, Florian Boudin, Beatrice Daille

Automatic terminology extraction is a notoriously difficult task aiming to ease effort demanded to manually identify terms in domain-specific corpora by automatically providing a ranked list of candidate terms.

Term Extraction

Paper
Add Code

TALN/LS2N Participation at the BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora

no code implementations • LREC 2020 • Martin Laville, Amir Hazem, Emmanuel Morin

This paper describes the TALN/LS2N system participation at the Building and Using Comparable Corpora (BUCC) shared task.

Paper
Add Code

Meta-Embedding Sentence Representation for Textual Similarity

no code implementations • RANLP 2019 • Amir Hazem, Hern, Nicolas ez

In this paper, we propose a systematic study of the impact of the main word embedding models on sentence representation.

Question Similarity Sentence +2

Paper
Add Code

Tweaks and Tricks for Word Embedding Disruptions

no code implementations • RANLP 2019 • Amir Hazem, Hern, Nicolas ez

In this paper, we introduce the concept of disruption which we define as a side effect of the training process of embedding models.

Descriptive Sentence +3

Paper
Add Code

Towards Automatic Variant Analysis of Ancient Devotional Texts

no code implementations • WS 2019 • Amir Hazem, B{\'e}atrice Daille, Dominique Stutzmann, Jacob Currie, Christine Jacquin

Based on the manual observation of 772 Obsecro Te copies which show more than 21, 000 variants, we show that the proposed methodology is helpful for an automatic study of variants and may serve as basis to analyze and to depict useful information from devotional texts.

Paper
Add Code

R\'eutilisation de Textes dans les Manuscrits Anciens (Text Reuse in Ancient Manuscripts)

no code implementations • JEPTALNRECITAL 2019 • Amir Hazem, B{\'e}atrice Daille, Dominique Stutzmann, Jacob Currie, Christine Jacquin

Nous nous int{\'e}ressons dans cet article {\`a} la probl{\'e}matique de r{\'e}utilisation de textes dans les livres liturgiques du Moyen {\^A}ge.

Paper
Add Code

Leveraging Meta-Embeddings for Bilingual Lexicon Extraction from Specialized Comparable Corpora

no code implementations • COLING 2018 • Amir Hazem, Emmanuel Morin

For that purpose, we propose the first systematic evaluation of different word embedding models for bilingual terminology extraction from specialized comparable corpora.

Information Retrieval Machine Translation

Paper
Add Code

Word Embedding Approach for Synonym Extraction of Multi-Word Terms

1 code implementation • LREC 2018 • Amir Hazem, B{\'e}atrice Daille

Information Retrieval Machine Translation +3

Paper
Code

PyRATA, Python Rule-based feAture sTructure Analysis

1 code implementation • JEPTALNRECITAL 2018 • Hern, Nicolas ez, Amir Hazem

Feature Engineering

Paper
Code

A Multi-Domain Framework for Textual Similarity. A Case Study on Question-to-Question and Question-Answering Similarity Tasks

no code implementations • LREC 2018 • Amir Hazem, Basma El Amal Boussaha, Hern, Nicolas ez

Community Question Answering Natural Language Inference +2

Paper
Add Code

MappSent at IJCNLP-2017 Task 5: A Textual Similarity Approach Applied to Multi-choice Question Answering in Examinations

no code implementations • IJCNLP 2017 • Amir Hazem

In this paper we present MappSent, a textual similarity approach that we applied to the multi-choice question answering in exams shared task.

Information Retrieval Question Answering +1

Paper
Add Code

Bilingual Word Embeddings for Bilingual Terminology Extraction from Specialized Comparable Corpora

no code implementations • IJCNLP 2017 • Amir Hazem, Emmanuel Morin

Bilingual lexicon extraction from comparable corpora is constrained by the small amount of available data when dealing with specialized domains.

Word Embeddings

Paper
Add Code

MappSent: a Textual Mapping Approach for Question-to-Question Similarity

no code implementations • RANLP 2017 • Amir Hazem, Basma El Amel Boussaha, Hern, Nicolas ez

Since the advent of word embedding methods, the representation of longer pieces of texts such as sentences and paragraphs is gaining more and more interest, especially for textual similarity tasks.

Community Question Answering Question Similarity +4

Paper
Add Code

Efficient Data Selection for Bilingual Terminology Extraction from Comparable Corpora

no code implementations • COLING 2016 • Amir Hazem, Emmanuel Morin

Comparable corpora are the main alternative to the use of parallel corpora to extract bilingual lexicons.

Machine Translation Topic Models +2

Paper
Add Code

Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models

no code implementations • LREC 2016 • Amir Hazem, Emmanuel Morin

There is a rich flora of word space models that have proven their efficiency in many different applications including information retrieval (Dumais, 1988), word sense disambiguation (Schutze, 1992), various semantic knowledge tests (Lund et al., 1995; Karlgren, 2001), and text categorization (Sahlgren, 2005).

Information Retrieval Retrieval +2

Paper
Add Code

Bilingual Lexicon Extraction at the Morpheme Level Using Distributional Analysis

no code implementations • LREC 2016 • Amir Hazem, B{\'e}atrice Daille

We also show that the adapted approach significantly improve bilingual lexicon extraction from comparable corpora compared to the approach at the word level.

Translation

Paper
Add Code

LINA: Identifying Comparable Documents from Wikipedia

no code implementations • WS 2015 • Emmanuel Morin, Amir Hazem, Florian Boudin, Elizaveta Loginova-Clouet

Machine Translation

Paper
Add Code

Continuous Adaptation to User Feedback for Statistical Machine Translation

no code implementations • HLT 2015 • Holger Schwenk, Loïc Barrault, Amir Hazem, Frédéric Blain, Fethi Bougares

Domain Adaptation Machine Translation +2

Paper
Add Code

Looking at Unbalanced Specialized Comparable Corpora for Bilingual Lexicon Extraction

no code implementations • ACL 2014 • Emmanuel Morin, Amir Hazem

Paper
Add Code

Semi-compositional Method for Synonym Extraction of Multi-Word Terms

no code implementations • LREC 2014 • B{\'e}atrice Daille, Amir Hazem

Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc.

Question Answering Text Summarization

Paper
Add Code

Word Co-occurrence Counts Prediction for Bilingual Terminology Extraction from Comparable Corpora

no code implementations • IJCNLP 2013 • Amir Hazem, Emmanuel Morin

Paper
Add Code

A Comparison of Smoothing Techniques for Bilingual Lexicon Extraction from Comparable Corpora

no code implementations • WS 2013 • Amir Hazem, Emmanuel Morin

Information Retrieval Language Modelling +3

Paper
Add Code

Bilingual Lexicon Extraction from Comparable Corpora by Combining Contextual Representations (Extraction de lexiques bilingues \`a partir de corpus comparables par combinaison de repr\'esentations contextuelles) [in French]

no code implementations • JEPTALNRECITAL 2013 • Amir Hazem, Emmanuel Morin

Paper
Add Code

Participation du LINA \`a DEFT2012 (LINA at DEFT2012) [in French]

no code implementations • JEPTALNRECITAL 2012 • Florian Boudin, Amir Hazem, Hern, Nicolas ez, Prajol Shrestha

Keyword Extraction

Paper
Add Code

Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora

no code implementations • LREC 2012 • Amir Hazem, Emmanuel Morin

One of the main resources used for the task of bilingual lexicon extraction from comparable corpora is : the bilingual dictionary, which is considered as a bridge between two languages.

Information Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.