Search Results for author: Milan Gritta

Found 11 papers, 8 papers with code

Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems

1 code implementation26 Jul 2023 Songbo Hu, Han Zhou, Mete Hergul, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Ivan Vulić, Anna Korhonen

Creating high-quality annotated data for task-oriented dialog (ToD) is known to be notoriously difficult, and the challenges are amplified when the goal is to create equitable, culturally adapted, and large-scale ToD datasets for multiple languages.

Translation

PanGu-Coder: Program Synthesis with Function-Level Language Modeling

1 code implementation22 Jul 2022 Fenia Christopoulou, Gerasimos Lampouras, Milan Gritta, Guchun Zhang, Yinpeng Guo, Zhongqi Li, Qi Zhang, Meng Xiao, Bo Shen, Lin Li, Hao Yu, Li Yan, Pingyi Zhou, Xin Wang, Yuchi Ma, Ignacio Iacobacci, Yasheng Wang, Guangtai Liang, Jiansheng Wei, Xin Jiang, Qianxiang Wang, Qun Liu

We present PanGu-Coder, a pretrained decoder-only language model adopting the PanGu-Alpha architecture for text-to-code generation, i. e. the synthesis of programming language solutions given a natural language problem description.

Code Generation Language Modelling +2

Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning

1 code implementation Findings (ACL) 2021 Benjamin Minixhofer, Milan Gritta, Ignacio Iacobacci

For small Natural Language Inference (NLI) datasets, language modelling is typically followed by pretraining on a large (labelled) NLI dataset before fine-tuning with each NLI subtask.

Language Modelling Natural Language Inference +1

Conversation Graph: Data Augmentation, Training and Evaluation for Non-Deterministic Dialogue Management

2 code implementations29 Oct 2020 Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci

We propose the Conversation Graph (ConvGraph), a graph-based representation of dialogues that can be exploited for data augmentation, multi-reference training and evaluation of non-deterministic agents.

Data Augmentation Dialogue Management +3

A Comparison of Techniques for Sentiment Classification of Film Reviews

no code implementations12 May 2019 Milan Gritta

We undertake the task of comparing lexicon-based sentiment classification of film reviews with machine learning approaches.

BIG-bench Machine Learning Classification +3

A Pragmatic Guide to Geoparsing Evaluation

1 code implementation29 Oct 2018 Milan Gritta, Mohammad Taher Pilehvar, Nigel Collier

Empirical methods in geoparsing have thus far lacked a standard evaluation framework describing the task, metrics and data used to compare state-of-the-art systems.

named-entity-recognition Named Entity Recognition +2

Which Melbourne? Augmenting Geocoding with Maps

no code implementations ACL 2018 Milan Gritta, Mohammad Taher Pilehvar, Nigel Collier

The purpose of text geolocation is to associate geographic information contained in a document with a set (or sets) of coordinates, either implicitly by using linguistic features and/or explicitly by using geographic metadata combined with heuristics.

Cannot find the paper you are looking for? You can Submit a new open access paper.