TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Data-to-Text Generation	Czech Restaurant NLG	tgen	BLEU score	21.96	# 2
Data-to-Text Generation	Czech Restaurant NLG	tgen	METEOR	23.32	# 2
Data-to-Text Generation	Czech Restaurant NLG	tgen	CIDER	2.18	# 2
Data-to-Text Generation	Czech Restaurant NLG	tgen	NIST	4.77	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-generation-for-czech-data-and-1/data-to-text-generation-on-czech-restaurant)](https://paperswithcode.com/sota/data-to-text-generation-on-czech-restaurant?p=neural-generation-for-czech-data-and-1)`

Neural Generation for Czech: Data and Baselines

WS 2019 · Ond{\v{r}}ej Du{\v{s}}ek, Filip Jur{\v{c}}{\'\i}{\v{c}}ek ·

We present the first dataset targeted at end-to-end NLG in Czech in the restaurant domain, along with several strong baseline models using the sequence-to-sequence approach. While non-English NLG is under-explored in general, Czech, as a morphologically rich language, makes the task even harder: Since Czech requires inflecting named entities, delexicalization or copy mechanisms do not work out-of-the-box and lexicalizing the generated outputs is non-trivial. In our experiments, we present two different approaches to this this problem: (1) using a neural language model to select the correct inflected form while lexicalizing, (2) a two-step generation setup: our sequence-to-sequence model generates an interleaved sequence of lemmas and morphological tags, which are then inflected by a morphological generator.

PDF Abstract