Search Results for author: Jan Haji{\v{c}}

Found 34 papers, 2 papers with code

Prague Dependency Treebank - Consolidated 1.0

no code implementations • LREC 2020 • Jan Haji{\v{c}}, Eduard Bej{\v{c}}ek, Jaroslava Hlavacova, Marie Mikulov{\'a}, Milan Straka, Jan {\v{S}}t{\v{e}}p{\'a}nek, Barbora {\v{S}}t{\v{e}}p{\'a}nkov{\'a}

We present a richly annotated and genre-diversified language resource, the Prague Dependency Treebank-Consolidated 1. 0 (PDT-C 1. 0), the purpose of which is - as it always been the case for the family of the Prague Dependency Treebanks - to serve both as a training data for various types of NLP tasks as well as for linguistically-oriented research.

Translation

Paper
Add Code

Parallel Dependency Treebank Annotated with Interlinked Verbal Synonym Classes and Roles

no code implementations • WS 2019 • Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Eva Fu{\v{c}}{\'\i}kov{\'a}, Eva Haji{\v{c}}ov{\'a}, Jan Haji{\v{c}}

Paper
Add Code

LemmaTag: Jointly Tagging and Lemmatizing for Morphologically Rich Languages with BRNNs

1 code implementation • EMNLP 2018 • Daniel Kondratyuk, Tom{\'a}{\v{s}} Gaven{\v{c}}iak, Milan Straka, Jan Haji{\v{c}}

We present LemmaTag, a featureless neural network architecture that jointly generates part-of-speech tags and lemmas for sentences by using bidirectional RNNs with character-level and word-level embeddings.

Lemmatization Machine Translation +4

Paper
Code

CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

no code implementations • CONLL 2018 • Daniel Zeman, Jan Haji{\v{c}}, Martin Popel, Martin Potthast, Milan Straka, Filip Ginter, Joakim Nivre, Slav Petrov

Every year, the Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their learning systems on the same data sets.

Dependency Parsing Morphological Analysis +1

Paper
Add Code

Synonymy in Bilingual Context: The CzEngClass Lexicon

no code implementations • COLING 2018 • Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Eva Fu{\v{c}}{\'\i}kov{\'a}, Eva Haji{\v{c}}ov{\'a}, Jan Haji{\v{c}}

This paper describes CzEngClass, a bilingual lexical resource being built to investigate verbal synonymy in bilingual context and to relate semantic roles common to one synonym class to verb arguments (verb valency).

Word Sense Disambiguation

Paper
Add Code

Creating a Verb Synonym Lexicon Based on a Parallel Corpus

no code implementations • LREC 2018 • Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Eva Fu{\v{c}}{\'\i}kov{\'a}, Eva Haji{\v{c}}ov{\'a}, Jan Haji{\v{c}}

Paper
Add Code

SumeCzech: Large Czech News-Based Summarization Dataset

no code implementations • LREC 2018 • Milan Straka, Nikita Mediankin, Tom Kocmi, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}, Vojt{\v{e}}ch Hude{\v{c}}ek, Jan Haji{\v{c}}

Document Summarization Machine Translation +1

Paper
Add Code

Diacritics Restoration Using Neural Networks

1 code implementation • LREC 2018 • Jakub N{\'a}plava, Milan Straka, Pavel Stra{\v{n}}{\'a}k, Jan Haji{\v{c}}

Ranked #2 on Czech Text Diacritization on Multilingual Dataset for Training and Evaluating Diacritics Restoration Systems

Croatian Text Diacritization Czech Text Diacritization +10

Paper
Code

Bridging the LAPPS Grid and CLARIN

no code implementations • LREC 2018 • Erhard Hinrichs, Nancy Ide, James Pustejovsky, Jan Haji{\v{c}}, Marie Hinrichs, Mohammad Fazleh Elahi, Keith Suderman, Marc Verhagen, Kyeongmin Rim, Pavel Stra{\v{n}}{\'a}k, Jozef Mi{\v{s}}utka

Paper
Add Code

Tools for Building an Interlinked Synonym Lexicon Network

no code implementations • LREC 2018 • Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Eva Fu{\v{c}}{\'\i}kov{\'a}, Eva Haji{\v{c}}ov{\'a}, Jan Haji{\v{c}}

Paper
Add Code

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

no code implementations • CONLL 2017 • Daniel Zeman, Martin Popel, Milan Straka, Jan Haji{\v{c}}, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Martin Potthast, Francis Tyers, Elena Badmaeva, Memduh Gokirmak, Anna Nedoluzhko, Silvie Cinkov{\'a}, Jan Haji{\v{c}} jr., Jaroslava Hlav{\'a}{\v{c}}ov{\'a}, V{\'a}clava Kettnerov{\'a}, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Jenna Kanerva, Stina Ojala, Anna Missil{\"a}, Christopher D. Manning, Sebastian Schuster, Siva Reddy, Dima Taji, Nizar Habash, Herman Leung, Marie-Catherine de Marneffe, Manuela Sanguinetti, Maria Simi, Hiroshi Kanayama, Valeria de Paiva, Kira Droganova, H{\'e}ctor Mart{\'\i}nez Alonso, {\c{C}}a{\u{g}}r{\i} {\c{C}}{\"o}ltekin, Umut Sulubacak, Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Georg Rehm, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, M, Michael l, Jesse Kirchner, Hector Fern Alcalde, ez, Jana Strnadov{\'a}, Esha Banerjee, Ruli Manurung, Antonio Stella, Atsuko Shimada, Sookyoung Kwak, Gustavo Mendon{\c{c}}a, L, Tatiana o, Rattima Nitisaroj, Josie Li

The Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their learning systems on the same data sets.

Dependency Parsing

Paper
Add Code

Joint search in a bilingual valency lexicon and an annotated corpus

no code implementations • COLING 2016 • Eva Fu{\v{c}}{\'\i}kov{\'a}, Jan Haji{\v{c}}, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

In this paper and the associated system demo, we present an advanced search system that allows to perform a joint search over a (bilingual) valency lexicon and a correspondingly annotated linked parallel corpus.

Paper
Add Code

Enriching a Valency Lexicon by Deverbative Nouns

no code implementations • WS 2016 • Eva Fu{\v{c}}{\'\i}kov{\'a}, Jan Haji{\v{c}}, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

The motivation for the task is to extend a verbal valency (i. e., predicate-argument) lexicon by adding nouns that share the valency properties with the base verb, assuming their properties can be derived (even if not trivially) from the underlying verb by deterministic grammatical rules.

Paper
Add Code

Inherently Pronominal Verbs in Czech: Description and Conversion Based on Treebank Annotation

no code implementations • WS 2016 • Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Eduard Bej{\v{c}}ek, Jan Haji{\v{c}}

Paper
Add Code

Universal Dependencies v1: A Multilingual Treebank Collection

no code implementations • LREC 2016 • Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Haji{\v{c}}, Christopher D. Manning, Ryan Mcdonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman

Cross-linguistically consistent annotation is necessary for sound comparative evaluation and cross-lingual learning experiments.

Paper
Add Code

QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages

no code implementations • LREC 2016 • Arantxa Otegi, Nora Aranberri, Antonio Branco, Jan Haji{\v{c}}, Martin Popel, Kiril Simov, Eneko Agirre, Petya Osenova, Rita Pereira, Jo{\~a}o Silva, Steven Neale

This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part-of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference.

Cross-Lingual Transfer Entity Disambiguation +9

Paper
Add Code

Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities

no code implementations • LREC 2016 • Georg Rehm, Jan Haji{\v{c}}, Josef van Genabith, Andrejs Vasiljevs

META-NET is a European network of excellence, founded in 2010, that consists of 60 research centres in 34 European countries.

Paper
Add Code

Towards Comparability of Linguistic Graph Banks for Semantic Parsing

no code implementations • LREC 2016 • Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinkov{\'a}, Dan Flickinger, Jan Haji{\v{c}}, Angelina Ivanova, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

We announce a new language resource for research on semantic parsing, a large, carefully curated collection of semantic dependency graphs representing multiple linguistic traditions.

Dependency Parsing Semantic Dependency Parsing +1

Paper
Add Code

UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing

no code implementations • LREC 2016 • Milan Straka, Jan Haji{\v{c}}, Jana Strakov{\'a}

Automatic natural language processing of large texts often presents recurring challenges in multiple languages: even for most advanced tasks, the texts are first processed by basic processing steps {--} from tokenization to parsing.

Dependency Parsing Lemmatization +4

Paper
Add Code

Using Parallel Texts and Lexicons for Verbal Word Sense Disambiguation

no code implementations • WS 2015 • Ond{\v{r}}ej Du{\v{s}}ek, Eva Fu{\v{c}}{\'\i}kov{\'a}, Jan Haji{\v{c}}, Martin Popel, Jana {\v{S}}indlerov{\'a}, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

Machine Translation Word Sense Disambiguation

Paper
Add Code

SemEval 2015 Task 18: Broad-Coverage Semantic Dependency Parsing

no code implementations • SEMEVAL 2015 • Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinkov{\'a}, Dan Flickinger, Jan Haji{\v{c}}, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

Dependency Parsing Semantic Dependency Parsing +1

Paper
Add Code

Bilingual English-Czech Valency Lexicon Linked to a Parallel Corpus

no code implementations • WS 2015 • Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Ond{\v{r}}ej Du{\v{s}}ek, Eva Fu{\v{c}}{\'\i}kov{\'a}, Jan Haji{\v{c}}, Jana {\v{S}}indlerov{\'a}

Machine Translation Semantic Role Labeling

Paper
Add Code

SemEval 2014 Task 8: Broad-Coverage Semantic Dependency Parsing

no code implementations • SEMEVAL 2014 • Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Dan Flickinger, Jan Haji{\v{c}}, Angelina Ivanova, Yi Zhang

Dependency Parsing Semantic Dependency Parsing

Paper
Add Code

Comparing Czech and English AMRs

no code implementations • WS 2014 • Jan Haji{\v{c}}, Ond{\v{r}}ej Bojar, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

Paper
Add Code

Machine Translation of Medical Texts in the Khresmoi Project

no code implementations • WS 2014 • Ond{\v{r}}ej Du{\v{s}}ek, Jan Haji{\v{c}}, Jaroslava Hlav{\'a}{\v{c}}ov{\'a}, Michal Nov{\'a}k, Pavel Pecina, Rudolf Rosa, Ale{\v{s}} Tamchyna, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Daniel Zeman

Domain Adaptation Image Retrieval +3

Paper
Add Code

Verbal Valency Frame Detection and Selection in Czech and English

no code implementations • WS 2014 • Ond{\v{r}}ej Du{\v{s}}ek, Jan Haji{\v{c}}, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

Predicate Detection Word Sense Disambiguation

Paper
Add Code

Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition

no code implementations • ACL 2014 • Jana Strakov{\'a}, Milan Straka, Jan Haji{\v{c}}

Lemmatization Morphological Analysis +6

Paper
Add Code

Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain

no code implementations • LREC 2014 • Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Jan Haji{\v{c}}, Pavel Pecina, Ond{\v{r}}ej Du{\v{s}}ek

This paper presents development and test sets for machine translation of search queries in cross-lingual information retrieval in the medical domain.

Cross-Lingual Information Retrieval Domain Adaptation +4

Paper
Add Code

Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech

no code implementations • LREC 2014 • Nianwen Xue, Ond{\v{r}}ej Bojar, Jan Haji{\v{c}}, Martha Palmer, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Xiuhong Zhang

Abstract Meaning Representations (AMRs) are rooted, directional and labeled graphs that abstract away from morpho-syntactic idiosyncrasies such as word category (verbs and nouns), word order, and function words (determiners, some prepositions).

Machine Translation Semantic Parsing +2

Paper
Add Code

CLARA: A New Generation of Researchers in Common Language Resources and Their Applications

no code implementations • LREC 2014 • Koenraad De Smedt, Erhard Hinrichs, Detmar Meurers, Inguna Skadi{\c{n}}a, Bolette Pedersen, Costanza Navarretta, N{\'u}ria Bel, Krister Lind{\'e}n, Mark{\'e}ta Lopatkov{\'a}, Jan Haji{\v{c}}, Gisle Andersen, Przemyslaw Lenkiewicz

CLARA (Common Language Resources and Their Applications) is a Marie Curie Initial Training Network which ran from 2009 until 2014 with the aim of providing researcher training in crucial areas related to language resources and infrastructure.

Paper
Add Code

The Strategic Impact of META-NET on the Regional, National and International Level

no code implementations • LREC 2014 • Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, N{\'u}ria Bel, Audron{\.e} Bielevi{\v{c}}ien{\.e}, Lars Borin, Ant{\'o}nio Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garab{\'\i}k, Marko Grobelnik, Carmen Garc{\'\i}a-Mateo, Josef van Genabith, Jan Haji{\v{c}}, Inma Hern{\'a}ez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lind{\'e}n, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunci{\'o}n Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Stelios Piperidis, Adam Przepi{\'o}rkowski, Eir{\'\i}kur R{\"o}gnvaldsson, Michael Rosner, Bolette Pedersen, Inguna Skadi{\c{n}}a, Koenraad De Smedt, Marko Tadi{\'c}, Paul Thompson, Dan Tufi{\c{s}}, Tam{\'a}s V{\'a}radi, Andrejs Vasi{\c{l}}jevs, Kadri Vider, Jolanta Zabarskaite

This article provides an overview of the dissemination work carried out in META-NET from 2010 until early 2014; we describe its impact on the regional, national and international level, mainly with regard to politics and the situation of funding for LT topics.

Machine Translation

Paper
Add Code

Joint Morphological and Syntactic Analysis for Richly Inflected Languages

no code implementations • TACL 2013 • Bernd Bohnet, Joakim Nivre, Igor Boguslavsky, Rich{\'a}rd Farkas, Filip Ginter, Jan Haji{\v{c}}

Joint morphological and syntactic analysis has been proposed as a way of improving parsing accuracy for richly inflected languages.

Dependency Parsing Part-Of-Speech Tagging

Paper
Add Code

Announcing Prague Czech-English Dependency Treebank 2.0

no code implementations • LREC 2012 • Jan Haji{\v{c}}, Eva Haji{\v{c}}ov{\'a}, Jarmila Panevov{\'a}, Petr Sgall, Ond{\v{r}}ej Bojar, Silvie Cinkov{\'a}, Eva Fu{\v{c}}{\'\i}kov{\'a}, Marie Mikulov{\'a}, Petr Pajas, Jan Popelka, Ji{\v{r}}{\'\i} Semeck{\'y}, Jana {\v{S}}indlerov{\'a}, Jan {\v{S}}t{\v{e}}p{\'a}nek, Josef Toman, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

We introduce a substantial update of the Prague Czech-English Dependency Treebank, a parallel corpus manually annotated at the deep syntactic layer of linguistic representation.

Coreference Resolution Sentence

Paper
Add Code

HamleDT: To Parse or Not to Parse?

no code implementations • LREC 2012 • Daniel Zeman, David Mare{\v{c}}ek, Martin Popel, Loganathan Ramasamy, Jan {\v{S}}t{\v{e}}p{\'a}nek, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}, Jan Haji{\v{c}}

We propose HamleDT â€• HArmonized Multi-LanguagE Dependency Treebank.

Dependency Parsing

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.