no code implementations • Findings (ACL) 2022 • Giulio Zhou, Gerasimos Lampouras, Ignacio Iacobacci
Large pretrained models enable transfer learning to low-resource domains for language generation tasks.
no code implementations • 5 Nov 2024 • Antoine Grosnit, Alexandre Maraval, James Doran, Giuseppe Paolo, Albert Thomas, Refinath Shahul Hameed Nabeezath Beevi, Jonas Gonzalez, Khyati Khandelwal, Ignacio Iacobacci, Abdelhakim Benechehab, Hamza Cherkaoui, Youssef Attia El-Hili, Kun Shao, Jianye Hao, Jun Yao, Balazs Kegl, Haitham Bou-Ammar, Jun Wang
We introduce Agent K v1. 0, an end-to-end autonomous data science agent designed to automate, optimise, and generalise across diverse data science tasks.
no code implementations • 18 Jun 2024 • Leonidas Gee, Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci
Code Language Models have been trained to generate accurate solutions, typically with no regard for runtime.
1 code implementation • 15 May 2024 • Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci
To help accelerate the development of LMs as conversational assistants, we propose a novel automatic evaluation task: HumanRankEval (HRE).
no code implementations • CVPR 2024 • Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Fei Chen, Steven McDonagh, Gerasimos Lampouras, Ignacio Iacobacci, Sarah Parisot
To build MuLAn, we developed a training free pipeline which decomposes a monocular RGB image into a stack of RGBA layers comprising of background and isolated instances.
no code implementations • 9 Feb 2024 • Yvette Graham, Mohammed Rameez Qureshi, Haider Khalid, Gerasimos Lampouras, Ignacio Iacobacci, Qun Liu
SCI-CHAT follows previous workshops on open domain dialogue but in contrast the focus of the shared task is simulation of intelligent conversation as judged in a live human evaluation.
1 code implementation • 20 Oct 2023 • Philip John Gorinski, Matthieu Zimmer, Gerasimos Lampouras, Derrick Goh Xin Deik, Ignacio Iacobacci
The advent of large pre-trained language models in the domain of Code Synthesis has shown remarkable performance on various benchmarks, treating the problem of Code Generation in a fashion similar to Natural Language Generation, trained with a Language Modelling (LM) objective.
no code implementations • 19 Oct 2023 • Songbo Hu, Han Zhou, Moy Yuan, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Anna Korhonen, Ivan Vulić
Achieving robust language technologies that can perform well across the world's many languages is a central goal of multilingual NLP.
no code implementations • 15 Aug 2023 • Mojtaba Valizadeh, Philip John Gorinski, Ignacio Iacobacci, Martin Berger
We propose regular expression inference (REI) as a challenge for code/language modelling, and the wider machine learning community.
1 code implementation • 26 Jul 2023 • Songbo Hu, Han Zhou, Mete Hergul, Milan Gritta, Guchun Zhang, Ignacio Iacobacci, Ivan Vulić, Anna Korhonen
Creating high-quality annotated data for task-oriented dialog (ToD) is known to be notoriously difficult, and the challenges are amplified when the goal is to create equitable, culturally adapted, and large-scale ToD datasets for multiple languages.
1 code implementation • 10 Dec 2022 • Yue Feng, Gerasimos Lampouras, Ignacio Iacobacci
To alleviate the problem of structured databases' limited coverage, recent task-oriented dialogue systems incorporate external unstructured knowledge to guide the generation of system responses.
1 code implementation • 22 Oct 2022 • Fenia Christopoulou, Gerasimos Lampouras, Ignacio Iacobacci
Curriculum Learning (CL) is a technique of training models via ranking examples in a typically increasing difficulty trend with the aim of accelerating convergence and improving generalisability.
Natural Language Understanding Zero-Shot Cross-Lingual Transfer
no code implementations • 22 Oct 2022 • Chenxi Whitehouse, Fenia Christopoulou, Ignacio Iacobacci
We use Wikidata and English Wikipedia to construct an entity-centric CS corpus by switching entities to their counterparts in other languages.
no code implementations • 12 Oct 2022 • Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci
Multihop Question Answering is a complex Natural Language Processing task that requires multiple steps of reasoning to find the correct answer to a given question.
1 code implementation • 22 Jul 2022 • Fenia Christopoulou, Gerasimos Lampouras, Milan Gritta, Guchun Zhang, Yinpeng Guo, Zhongqi Li, Qi Zhang, Meng Xiao, Bo Shen, Lin Li, Hao Yu, Li Yan, Pingyi Zhou, Xin Wang, Yuchi Ma, Ignacio Iacobacci, Yasheng Wang, Guangtai Liang, Jiansheng Wei, Xin Jiang, Qianxiang Wang, Qun Liu
We present PanGu-Coder, a pretrained decoder-only language model adopting the PanGu-Alpha architecture for text-to-code generation, i. e. the synthesis of programming language solutions given a natural language problem description.
1 code implementation • 12 Apr 2022 • Han Zhou, Ignacio Iacobacci, Pasquale Minervini
Dialogue State Tracking (DST), a crucial component of task-oriented dialogue (ToD) systems, keeps track of all important information pertaining to dialogue history: filling slots with the most probable values throughout the conversation.
1 code implementation • Findings (ACL) 2022 • Milan Gritta, Ruoyu Hu, Ignacio Iacobacci
Task-oriented personal assistants enable people to interact with a host of devices and services using natural language.
Natural Language Understanding Zero-Shot Cross-Lingual Transfer
1 code implementation • Findings (ACL) 2021 • Benjamin Minixhofer, Milan Gritta, Ignacio Iacobacci
For small Natural Language Inference (NLI) datasets, language modelling is typically followed by pretraining on a large (labelled) NLI dataset before fine-tuning with each NLI subtask.
2 code implementations • Findings (ACL) 2021 • Milan Gritta, Ignacio Iacobacci
The introduction of pretrained cross-lingual language models brought decisive improvements to multilingual NLP tasks.
no code implementations • 13 Jan 2021 • Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci
Determining the plausibility of causal relations between clauses is a commonsense reasoning task that requires complex inference ability.
2 code implementations • 29 Oct 2020 • Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci
We propose the Conversation Graph (ConvGraph), a graph-based representation of dialogues that can be exploited for data augmentation, multi-reference training and evaluation of non-deterministic agents.
no code implementations • EMNLP 2020 • Ieva Staliūnaitė, Ignacio Iacobacci
The results show differences in ability to represent compositional and lexical information between RoBERTa, BERT and DistilBERT.
no code implementations • 17 Apr 2020 • Gabriel Gordon-Hall, Philip John Gorinski, Gerasimos Lampouras, Ignacio Iacobacci
We present our submission to the End-to-End Multi-Domain Dialog Challenge Track of the Eighth Dialog System Technology Challenge.
no code implementations • ACL 2019 • Ignacio Iacobacci, Roberto Navigli
While word embeddings are now a de facto standard representation of words in most NLP tasks, recently the attention has been shifting towards vector representations which capture the different meanings, i. e., senses, of words.
no code implementations • CONLL 2017 • Massimiliano Mancini, Jose Camacho-Collados, Ignacio Iacobacci, Roberto Navigli
Word embeddings are widely used in Natural Language Processing, mainly due to their success in capturing semantic information from massive corpora.
no code implementations • 2 Aug 2016 • José Camacho-Collados, Ignacio Iacobacci, Roberto Navigli, Mohammad Taher Pilehvar
Representing the semantics of linguistic items in a machine-interpretable form has been a major goal of Natural Language Processing since its earliest days.