Teaching Small Language Models to Reason

no code implementations16 Dec 2022 Lucie Charlotte Magister, Jonathan Mallinson, Jakub Adamek, Eric Malmi, Aliaksei Severyn

Chain of thought prompting successfully improves the reasoning capabilities of large language models, achieving state of the art results on a range of datasets.

Text Generation with Text-Editing Models

no code implementations NAACL (ACL) 2022 Eric Malmi, Yue Dong, Jonathan Mallinson, Aleksandr Chuklin, Jakub Adamek, Daniil Mirylenka, Felix Stahlberg, Sebastian Krause, Shankar Kumar, Aliaksei Severyn

Text-editing models have recently become a prominent alternative to seq2seq models for monolingual text-generation tasks such as grammatical error correction, simplification, and style transfer.

EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start

no code implementations24 May 2022 Jonathan Mallinson, Jakub Adamek, Eric Malmi, Aliaksei Severyn

This is achieved by decomposing the generation process into three sub-tasks: (1) tagging to decide on the subset of input tokens to be preserved in the output, (2) re-ordering to define their order in the output text, and (3) insertion to infill the missing tokens that are not present in the input.

Controlled Text Generation as Continuous Optimization with Multiple Constraints

1 code implementation NeurIPS 2021 Sachin Kumar, Eric Malmi, Aliaksei Severyn, Yulia Tsvetkov

As large-scale language model pretraining pushes the state-of-the-art in text generation, recent work has turned to controlling attributes of the text such models generate.

A Simple Recipe for Multilingual Grammatical Error Correction

1 code implementation ACL 2021 Sascha Rothe, Jonathan Mallinson, Eric Malmi, Sebastian Krause, Aliaksei Severyn

This paper presents a simple recipe to train state-of-the-art multilingual Grammatical Error Correction (GEC) models.

Unsupervised Text Style Transfer with Padded Masked Language Models

no code implementations EMNLP 2020 Eric Malmi, Aliaksei Severyn, Sascha Rothe

This allows us to identify the source tokens to delete to transform the source text to match the style of the target domain.

Rapformer: Conditional Rap Lyrics Generation with Denoising Autoencoders

no code implementations INLG (ACL) 2020 Nikola I. Nikolov, Eric Malmi, Curtis G. Northcutt, Loreto Parisi

The ability to combine symbols to generate language is a defining characteristic of human intelligence, particularly in the context of artistic story-telling through lyrics.

Felix: Flexible Text Editing Through Tagging and Insertion

2 code implementations Findings of the Association for Computational Linguistics 2020 Jonathan Mallinson, Aliaksei Severyn, Eric Malmi, Guillermo Garrido

We achieve this by decomposing the text-editing task into two sub-tasks: tagging to decide on the subset of input tokens and their order in the output text and insertion to in-fill the missing tokens in the output not present in the input.

DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion

2 code implementations NAACL 2019 Mor Geva, Eric Malmi, Idan Szpektor, Jonathan Berant

We author a set of rules for identifying a diverse set of discourse phenomena in raw text, and decomposing the text into two independent sentences.

Responsible team players wanted: an analysis of soft skill requirements in job advertisements

no code implementations13 Oct 2018 Federica Calanca, Luiza Sayfullina, Lara Minkus, Claudia Wagner, Eric Malmi

Our work shows that soft skills can serve as partial predictors of the gender composition in job categories and that not all soft skills receive equal wage returns at the labour market.

Learning Representations for Soft Skill Matching

2 code implementations20 Jul 2018 Luiza Sayfullina, Eric Malmi, Juho Kannala

The disambiguation is formulated as a binary text classification problem where the prediction is made for the potential soft skill based on the context where it occurs.

Domain Adaptation for Resume Classification Using Convolutional Neural Networks

no code implementations18 Jul 2017 Luiza Sayfullina, Eric Malmi, Yiping Liao, Alex Jung

We propose a novel method for classifying resume data of job applicants into 27 different job categories using convolutional neural networks.

Automatic Prediction of Discourse Connectives

1 code implementation LREC 2018 Eric Malmi, Daniele Pighin, Sebastian Krause, Mikhail Kozhevnikov

We formulate the task of discourse connective prediction and release a dataset of 2. 9M sentence pairs separated by discourse connectives for this task.

You Are What Apps You Use: Demographic Prediction Based on User's Apps

1 code implementation29 Feb 2016 Eric Malmi, Ingmar Weber

Our work addresses this need by studying the predictability of user demographics based on the list of a user's apps which is readily available to many app developers.

DopeLearning: A Computational Approach to Rap Lyrics Generation

1 code implementation18 May 2015 Eric Malmi, Pyry Takala, Hannu Toivonen, Tapani Raiko, Aristides Gionis

First, we develop a prediction model to identify the next line of existing lyrics from a set of candidate next lines.

