1 code implementation • WS 2018 • Alex Rush, er
A major goal of open-source NLP is to quickly and accurately reproduce the results of new work, in a manner that the community can easily use and modify.
1 code implementation • LREC 2020 • Ossama Obeid, Nasser Zalmout, Salam Khalifa, Dima Taji, Mai Oudah, Bashar Alhafni, Go Inoue, Fadhl Eryani, Alex Erdmann, er, Nizar Habash
We present CAMeL Tools, a collection of open-source tools for Arabic natural language processing in Python.
1 code implementation • ACL 2019 • Artem Chernodub, Oleksiy Oliynyk, Philipp Heidenreich, Alex Bondarenko, Matthias Hagen, Chris Biemann, Alex Panchenko, er
We present TARGER, an open source neural argument mining framework for tagging arguments in free input texts and for keyword-based retrieval of arguments from an argument-tagged web-scale corpus.
1 code implementation • ACL 2019 • Alex Koller, er, Stephan Oepen, Weiwei Sun
This tutorial is on representing and processing sentence meaning in the form of labeled directed graphs.
1 code implementation • LREC 2016 • Siim Orasmaa, Timo Petmanson, Alex Tkachenko, er, Sven Laur, Heiki-Jaan Kaalep
Although there are many tools for natural language processing tasks in Estonian, these tools are very loosely interoperable, and it is not easy to build practical applications on top of them.
1 code implementation • EMNLP 2018 • Luke Melas-Kyriazi, Alex Rush, er, George Han
Image paragraph captioning models aim to produce detailed descriptions of a source image.
1 code implementation • WS 2018 • Thiago Castro Ferreira, Diego Moussallem, Emiel Krahmer, S Wubben, er
This paper describes the enrichment of WebNLG corpus (Gardent et al., 2017a, b), with the aim to further extend its usefulness as a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation.
1 code implementation • EMNLP 2018 • Navonil Majumder, Soujanya Poria, Alex Gelbukh, er, Md. Shad Akhtar, Erik Cambria, Asif Ekbal
Sentiment analysis has immense implications in e-commerce through user feedback mining.
2 code implementations • NAACL 2019 • Alex Erdmann, er, David Joseph Wrisley, Benjamin Allen, Christopher Brown, Sophie Cohen-Bod{\'e}n{\`e}s, Micha Elsner, Yukun Feng, Brian Joseph, B{\'e}atrice Joyeux-Prunel, Marie-Catherine de Marneffe
Scholars in inter-disciplinary fields like the Digital Humanities are increasingly interested in semantic annotation of specialized corpora.
1 code implementation • COLING 2018 • Juan Diego Rodriguez, Adam Caldwell, Alex Liu, er
Our results empirically demonstrate when each of the published approaches tends to do well.
Entity Extraction using GAN Named Entity Recognition (NER) +2
1 code implementation • ACL 2019 • Viktor Hangya, Alex Fraser, er
Mining parallel sentences from comparable corpora is important.
1 code implementation • NAACL 2018 • Fabienne Braune, Viktor Hangya, Tobias Eder, Alex Fraser, er
Bilingual word embeddings are useful for bilingual lexicon induction, the task of mining translations of given words.
1 code implementation • WS 2018 • Egon Stemle, Alex Onysko, er
This article describes the system that participated in the shared task on metaphor detection on the Vrije University Amsterdam Metaphor Corpus (VUA).
1 code implementation • WS 2018 • Chris van der Lee, Emiel Krahmer, S Wubben, er
The current study investigated novel techniques and methods for trainable approaches to data-to-text generation.
1 code implementation • WS 2019 • Dmitry Puzyrev, Artem Shelmanov, Alex Panchenko, er, Ekaterina Artemova
This paper presents the first gold-standard resource for Russian annotated with compositionality information of noun compounds.
1 code implementation • ACL 2019 • Sergey Golovanov, Rauf Kurbanov, Sergey Nikolenko, Kyryl Truskovskyi, Alex Tselousov, er, Thomas Wolf
Large-scale pretrained language models define state of the art in natural language processing, achieving outstanding performance on a variety of tasks.
1 code implementation • COLING 2018 • Florian Kunneman, S Wubben, er, Antal Van den Bosch, Emiel Krahmer
In the second evaluation, the gold-standard pros and cons were assessed along with the system output.
1 code implementation • ACL 2018 • Viktor Hangya, Fabienne Braune, Alex Fraser, er, Hinrich Sch{\"u}tze
Bilingual tasks, such as bilingual lexicon induction and cross-lingual classification, are crucial for overcoming data sparsity in the target language.
1 code implementation • WS 2018 • Thiago Castro Ferreira, S Wubben, er, Emiel Krahmer
This study describes the approach developed by the Tilburg University team to the shallow task of the Multilingual Surface Realization Shared Task 2018 (SR18).
1 code implementation • LREC 2016 • Alice Frain, S Wubben, er
We test the viability of our data on the task of classification of satire.
no code implementations • CL 2017 • Hassan Sajjad, Helmut Schmid, Alex Fraser, er, Hinrich Sch{\"u}tze
After training, the unlabeled data is disambiguated based on the posterior probabilities of the two sub-models.
no code implementations • ACL 2018 • Hannah Rohde, Alex Johnson, er, Nathan Schneider, Bonnie Webber
Theories of discourse coherence posit relations between discourse segments as a key feature of coherent text.
no code implementations • ACL 2018 • Alex Erdmann, er, Nasser Zalmout, Nizar Habash
Arabic dialects lack large corpora and are noisy, being linguistically disparate with no standardized spelling.
no code implementations • ACL 2018 • Mikhail Burtsev, Alex Seliverstov, er, Rafael Airapetyan, Mikhail Arkhipov, Dilyara Baymurzina, Nickolay Bushkov, Olga Gureenkova, Taras Khakhulin, Yuri Kuratov, Denis Kuznetsov, Alexey Litinsky, Varvara Logacheva, Alexey Lymar, Valentin Malykh, Maxim Petrov, Vadim Polulyakh, Leonid Pugachev, Alexey Sorokin, Maria Vikhreva, Marat Zaynutdinov
It supports modular as well as end-to-end approaches to implementation of conversational agents.
no code implementations • ACL 2017 • Mart{\'\i}n Villalba, Christoph Teichmann, Alex Koller, er
The referring expressions (REs) produced by a natural language generation (NLG) system can be misunderstood by the hearer, even when they are semantically correct.
no code implementations • ACL 2016 • Seid Muhie Yimam, Heiner Ulrich, von L, Tatiana esberger, Marcel Rosenbach, Michaela Regneri, Alex Panchenko, er, Franziska Lehmann, Uli Fahrer, Chris Biemann, Kathrin Ballweg
no code implementations • EACL 2017 • Alex Panchenko, er, Eugen Ruppert, Stefano Faralli, Simone Paolo Ponzetto, Chris Biemann
On the example of word sense induction and disambiguation (WSID), we show that it is possible to develop an interpretable model that matches the state-of-the-art models in accuracy.
no code implementations • EACL 2017 • Stefano Faralli, Alex Panchenko, er, Chris Biemann, Simone Paolo Ponzetto
In this paper, we present ContrastMedium, an algorithm that transforms noisy semantic networks into full-fledged, clean taxonomies.
no code implementations • EACL 2017 • Thiago Castro Ferreira, Emiel Krahmer, S Wubben, er
The model relies on the REGnames corpus, a dataset with 53, 102 proper name references to 1, 000 people in different discourse contexts.
no code implementations • EACL 2017 • Matthias Huck, Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar, Alex Fraser, er
Translating into morphologically rich languages is difficult.
no code implementations • EACL 2017 • Marion Weller-Di Marco, Alex Fraser, er, Sabine Schulte im Walde
Many errors in phrase-based SMT can be attributed to problems on three linguistic levels: morphological complexity in the target language, structural differences and lexical choice.
no code implementations • EACL 2017 • Tolga Uslu, Wahed Hemati, Alex Mehler, er, Daniel Baumartz
R is a very powerful framework for statistical modeling.
no code implementations • EACL 2017 • Johannes Gontrum, Jonas Groschwitz, Alex Koller, er, Christoph Teichmann
We present Alto, a rapid prototyping tool for new grammar formalisms.
no code implementations • EACL 2017 • Verena Henrich, Alex Lang, er
Understanding the social media audience is becoming increasingly important for social media analysis.
no code implementations • EACL 2017 • Beto Boullosa, Richard Eckart de Castilho, Alex Geyken, er, Lothar Lemnitzer, Iryna Gurevych
This paper describes an application system aimed to help lexicographers in the extraction of example sentences for a given headword based on its different senses.
no code implementations • NAACL 2018 • Nasser Zalmout, Alex Erdmann, er, Nizar Habash
User-generated text tends to be noisy with many lexical and orthographic inconsistencies, making natural language processing (NLP) tasks more challenging.
no code implementations • NAACL 2016 • Thiago Castro Ferreira, Emiel Krahmer, S Wubben, er
no code implementations • SEMEVAL 2018 • Elena Mikhalkova, Yuri Karyakin, Alex Voronov, er, Dmitry Grigoriev, Artem Leoznov
The paper describes our search for a universal algorithm of detecting intentional lexical ambiguity in different forms of creative language.
no code implementations • SEMEVAL 2018 • Lena Hettinger, Alex Dallmann, er, Albin Zehe, Thomas Niebler, Andreas Hotho
In this paper we describe our system for SemEval-2018 Task 7 on classification of semantic relations in scientific literature for clean (subtask 1. 1) and noisy data (subtask 1. 2).
no code implementations • SEMEVAL 2018 • Alex Zhang, er, Marine Carpuat
We describe the University of Maryland{'}s submission to SemEval-018 Task 10, {``}Capturing Discriminative Attributes{''}: given word triples (w1, w2, d), the goal is to determine whether d is a discriminating attribute belonging to w1 but not w2.
no code implementations • SEMEVAL 2016 • Alex Panchenko, er, Stefano Faralli, Eugen Ruppert, Steffen Remus, Hubert Naets, C{\'e}drick Fairon, Simone Paolo Ponzetto, Chris Biemann
no code implementations • WS 2018 • Alex Rich, er, Pamela Osborn Popp, David Halpern, Anselm Rothe, Todd Gureckis
Psychological research on learning and memory has tended to emphasize small-scale laboratory studies.
no code implementations • WS 2018 • Segun Taofeek Aroyehun, Jason Angel, Daniel Alej P{\'e}rez Alvarez, ro, Alex Gelbukh, er
We describe the systems of NLP-CIC team that participated in the Complex Word Identification (CWI) 2018 shared task.
no code implementations • WS 2018 • Agnieszka Mykowiecka, Malgorzata Marciniak, Aleks Wawer, er
The paper addresses the classification of isolated Polish adjective-noun phrases according to their metaphoricity.
no code implementations • WS 2018 • Agnieszka Mykowiecka, Aleks Wawer, er, Malgorzata Marciniak
The paper addresses detection of figurative usage of words in English text.
no code implementations • WS 2018 • Filip Skurniak, Maria Janicka, Aleks Wawer, er
This paper describes multiple solutions designed and tested for the problem of word-level metaphor detection.
no code implementations • WS 2018 • Jean Senellart, Dakun Zhang, Bo wang, Guillaume Klein, Ramatch, Jean-Pierre irin, Josep Crego, Alex Rush, er
We present a system description of the OpenNMT Neural Machine Translation entry for the WNMT 2018 evaluation.
no code implementations • COLING 2018 • Segun Taofeek Aroyehun, Alex Gelbukh, er
On this task, we investigate the efficacy of deep neural network models of varying complexity.
no code implementations • WS 2018 • Hendrik Strobelt, Sebastian Gehrmann, Michael Behrisch, Adam Perer, Hanspeter Pfister, Alex Rush, er
Neural attention-based sequence-to-sequence models (seq2seq) (Sutskever et al., 2014; Bahdanau et al., 2014) have proven to be accurate and robust for many sequence prediction tasks.
no code implementations • WS 2018 • Alex Erdmann, er, Nizar Habash
Morphologically rich languages are challenging for natural language processing tasks due to data sparsity.
no code implementations • WS 2018 • Segun Taofeek Aroyehun, Alex Gelbukh, er
We describe our submissions to the Third Social Media Mining for Health Applications Shared Task.
no code implementations • WS 2018 • Dario Stojanovski, Alex Fraser, er
We show that NMT models taking advantage of context oracle signals can achieve considerable gains in BLEU, of up to 7. 02 BLEU for coreference and 1. 89 BLEU for coherence on subtitles translation.
no code implementations • WS 2018 • Alex Molchanov, er
This paper describes the PROMT submissions for the WMT 2018 Shared News Translation Task.
no code implementations • WS 2018 • Ngoc-Quan Pham, Jan Niehues, Alex Waibel, er
We present our experiments in the scope of the news translation task in WMT 2018, in directions: English→German.
no code implementations • WS 2018 • Dario Stojanovski, Viktor Hangya, Matthias Huck, Alex Fraser, er
We describe LMU Munich{'}s unsupervised machine translation systems for English↔German translation.
no code implementations • WS 2018 • Matthias Huck, Dario Stojanovski, Viktor Hangya, Alex Fraser, er
The systems were used for our participation in the WMT18 biomedical translation task and in the shared task on machine translation of news.
no code implementations • WS 2018 • Viktor Hangya, Alex Fraser, er
In this paper we describe LMU Munich{'}s submission for the \textit{WMT 2018 Parallel Corpus Filtering} shared task which addresses the problem of cleaning noisy parallel corpora.
no code implementations • WS 2018 • Alex Shvets, er, Simon Mille, Leo Wanner
An increasing amount of research tackles the challenge of text generation from abstract ontological or semantic structures, which are in their very nature potentially large connected graphs.
no code implementations • WS 2018 • Henry Elder, Sebastian Gehrmann, Alex O{'}Connor, er, Qun Liu
In natural language generation (NLG), the task is to generate utterances from a more abstract input, such as structured data.
no code implementations • WS 2017 • Alex Calderwood, er, Elizabeth A. Pruett, Raymond Ptucha, Christopher Homan, Cecilia Ovesdotter Alm
Interpersonal violence (IPV) is a prominent sociological problem that affects people of all demographic backgrounds.
no code implementations • WS 2017 • Alex Panchenko, er, Stefano Faralli, Simone Paolo Ponzetto, Chris Biemann
We introduce a new method for unsupervised knowledge-based word sense disambiguation (WSD) based on a resource that links two types of sense-aware lexical networks: one is induced from a corpus using distributional semantics, the other is manually constructed.
no code implementations • WS 2017 • Aleks Wawer, er, Agnieszka Mykowiecka
This paper compares two approaches to word sense disambiguation using word embeddings trained on unambiguous synonyms.
no code implementations • WS 2017 • Thomas Alex Trost, er, Dietrich Klakow
Word embeddings are high-dimensional vector representations of words and are thus difficult to interpret.
no code implementations • WS 2017 • Thiago Castro Ferreira, Iacer Calixto, S Wubben, er, Emiel Krahmer
In this paper, we study AMR-to-text generation, framing it as a translation task and comparing two different MT approaches (Phrase-based and Neural MT).
no code implementations • WS 2017 • Chris van der Lee, Emiel Krahmer, S Wubben, er
We present PASS, a data-to-text system that generates Dutch soccer reports from match statistics.
no code implementations • WS 2017 • Alex Koller, er, Nikos Engonopoulos
Integrating surface realization and the generation of referring expressions into a single algorithm can improve the quality of the generated sentences.
no code implementations • WS 2017 • Jeffrey Ling, Alex Rush, er
Sequence-to-sequence models with attention have been successful for a variety of NLP problems, but their speed does not scale well for tasks with long source sequences such as document summarization.
Ranked #25 on Document Summarization on CNN / Daily Mail
no code implementations • WS 2017 • Alex Prange, er, Margarita Chikobava, Peter Poller, Michael Barz, Daniel Sonntag
We present a multimodal dialogue system that allows doctors to interact with a medical decision support system in virtual reality (VR).
no code implementations • WS 2017 • Christoph Teichmann, Alex Koller, er, Jonas Groschwitz
We generalize coarse-to-fine parsing to grammar formalisms that are more expressive than PCFGs and/or describe languages of trees or graphs.
no code implementations • RANLP 2017 • Seid Muhie Yimam, Steffen Remus, Alex Panchenko, er, Andreas Holzinger, Chris Biemann
In this paper, we describe the concept of entity-centric information access for the biomedical domain.
no code implementations • WS 2016 • Jan-Thorsten Peter, Tamer Alkhouli, Hermann Ney, Matthias Huck, Fabienne Braune, Alex Fraser, er, Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar, Barry Haddow, Rico Sennrich, Fr{\'e}d{\'e}ric Blain, Lucia Specia, Jan Niehues, Alex Waibel, Alex Allauzen, re, Lauriane Aufrant, Franck Burlot, Elena Knyazeva, Thomas Lavergne, Fran{\c{c}}ois Yvon, M{\=a}rcis Pinnis, Stella Frank
Ranked #12 on Machine Translation on WMT2016 English-Romanian
no code implementations • WS 2016 • Alex Erdmann, er, Christopher Brown, Brian Joseph, Mark Janse, Petra Ajaka, Micha Elsner, Marie-Catherine de Marneffe
Although spanning thousands of years and genres as diverse as liturgy, historiography, lyric and other forms of prose and poetry, the body of Latin texts is still relatively sparse compared to English.
no code implementations • WS 2016 • Sowmya Vajjala, Detmar Meurers, Alex Eitel, er, Katharina Scheiter
Computational approaches to readability assessment are generally built and evaluated using gold standard corpora labeled by publishers or teachers rather than being grounded in observations about human performance.
no code implementations • WS 2016 • Christian Bentz, Tatyana Ruzsics, Alex Koplenig, er, Tanja Samard{\v{z}}i{\'c}
Language complexity is an intriguing phenomenon argued to play an important role in both language learning and processing.
no code implementations • WS 2016 • Markus Kreuzthaler, Michel Oleynik, Alex Avian, er, Stefan Schulz
The disambiguation of period characters is therefore an important task for sentence and abbreviation detection.
no code implementations • WS 2016 • Thiago Castro Ferreira, S Wubben, er, Emiel Krahmer
no code implementations • COLING 2018 • Chris van der Lee, Bart Verduijn, Emiel Krahmer, S Wubben, er
We present an evaluation of PASS, a data-to-text system that generates Dutch soccer reports from match statistics which are automatically tailored towards fans of one club or the other.
no code implementations • COLING 2018 • Florian Dessloch, Thanh-Le Ha, Markus M{\"u}ller, Jan Niehues, Thai-Son Nguyen, Ngoc-Quan Pham, Elizabeth Salesky, Matthias Sperber, Sebastian St{\"u}ker, Thomas Zenkel, Alex Waibel, er
{\%} Combining these techniques, we are able to provide an adapted speech translation system for several European languages.
no code implementations • COLING 2018 • Daniel Baumartz, Tolga Uslu, Alex Mehler, er
In this paper we present LTV, a website and API that generates labeled topic classifications based on the Dewey Decimal Classification (DDC), an international standard for topic classification in libraries.
no code implementations • COLING 2016 • Yimai Fang, Haoyue Zhu, Ewa Muszy{\'n}ska, Alex Kuhnle, er, Simone Teufel
It is a further development of an existing summariser that has an incremental, proposition-based content selection process but lacks a natural language (NL) generator for the final output.
no code implementations • COLING 2016 • Wahed Hemati, Tolga Uslu, Alex Mehler, er
More and more disciplines require NLP tools for performing automatic text analyses on various levels of linguistic resolution.
no code implementations • COLING 2016 • Rudolf Schneider, Cordula Guder, Torsten Kilias, Alex L{\"o}ser, er, Jens Graupmann, Oleks Kozachuk, R
We present INDREX-MM, a main memory database system for interactively executing two interwoven tasks, declarative relation extraction from text and their exploitation with SQL.
no code implementations • COLING 2016 • Sebastian Arnold, Robert Dziuba, Alex L{\"o}ser, er
We introduce TASTY (Tag-as-you-type), a novel text editor for interactive entity linking as part of the writing process.
no code implementations • IJCNLP 2017 • Bill McDowell, Nathanael Chambers, Alex Ororbia II, er, David Reitter
Within this prediction reranking framework, we propose an alternative scoring function, showing an 8. 8{\%} relative gain over the original CAEVO.
no code implementations • IJCNLP 2017 • Somnath Banerjee, Partha Pakray, Riyanka Manna, Dipankar Das, Alex Gelbukh, er
In this paper, we describe a deep learning framework for analyzing the customer feedback as part of our participation in the shared task on Customer Feedback Analysis at the 8th International Joint Conference on Natural Language Processing (IJCNLP 2017).
no code implementations • LREC 2018 • Nizar Habash, Fadhl Eryani, Salam Khalifa, Owen Rambow, Dana Abdulrahim, Alex Erdmann, er, Reem Faraj, Wajdi Zaghouani, Houda Bouamor, Nasser Zalmout, Sara Hassan, Faisal Al-Shargi, Sakhar Alkhereyf, Basma Abdulkareem, Esk, Ramy er, Mohammad Salameh, Hind Saddiki
no code implementations • RANLP 2017 • Aleks Wawer, er, Agnieszka Mykowiecka
In this paper we describe experiments with automated detection of metaphors in the Polish language.
no code implementations • RANLP 2017 • Alex Popov, er
This paper presents a neural network architecture for word sense disambiguation (WSD).
no code implementations • ACL 2014 • Miles Osborne, Sean Moran, Richard McCreadie, Alex Von Lunen, er, Martin Sykora, Elizabeth Cano, Neil Ireson, Craig Macdonald, Iadh Ounis, Yulan He, Tom Jackson, Fabio Ciravegna, Ann O{'}Brien
no code implementations • ACL 2012 • S Wubben, er, Antal van den Bosch, Emiel Krahmer
Ranked #3 on Text Simplification on ASSET
no code implementations • SEMEVAL 2012 • Antonio Fern{\'a}ndez, Yoan Guti{\'e}rrez, H{\'e}ctor D{\'a}vila, Alex Ch{\'a}vez, er, Andy Gonz{\'a}lez, Rainel Estrada, Yenier Casta{\~n}eda, Sonia V{\'a}zquez, Andr{\'e}s Montoyo, Rafael Mu{\~n}oz