Search Results for author: Veselin Stoyanov

Found 26 papers, 13 papers with code

Continual Few-Shot Learning for Text Classification

1 code implementation • EMNLP 2021 • Ramakanth Pasunuru, Veselin Stoyanov, Mohit Bansal

In this work, we propose a continual few-shot learning (CFL) task, in which a system is challenged with a difficult phenomenon and asked to learn to correct mistakes with only a few (10 to 15) training examples.

continual few-shot learning Few-Shot Learning +4

Paper
Code

Prompt-free and Efficient Few-shot Learning with Language Models

1 code implementation • ACL 2022 • Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, Lambert Mathias, Marzieh Saeidi, Veselin Stoyanov, Majid Yazdani

Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score.

Few-Shot Learning

105

Paper
Code

Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model

no code implementations • 23 May 2023 • Zeyu Leo Liu, Tim Dettmers, Xi Victoria Lin, Veselin Stoyanov, Xian Li

Large and sparse feed-forward layers (S-FFN) such as Mixture-of-Experts (MoE) have proven effective in scaling up Transformers model size for \textit{pretraining} large language models.

Avg Language Modelling +1

Paper
Add Code

Improving In-Context Few-Shot Learning via Self-Supervised Training

no code implementations • NAACL 2022 • Mingda Chen, Jingfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, Veselin Stoyanov, Zornitsa Kozareva

Self-supervised pretraining has made few-shot learning possible for many NLP tasks.

Few-Shot Learning

Paper
Add Code

PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models

1 code implementation • 3 Apr 2022 • Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, Marzieh Saeidi, Lambert Mathias, Veselin Stoyanov, Majid Yazdani

Few-Shot Learning

105

Paper
Code

Few-shot Learning with Multilingual Language Models

2 code implementations • 20 Dec 2021 • Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona Diab, Veselin Stoyanov, Xian Li

Large-scale generative language models such as GPT-3 are competitive few-shot learners.

Cross-Lingual Transfer Few-Shot Learning +5

29,181

Paper
Code

Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

1 code implementation • 26 Nov 2021 • Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal, Srinivasan Iyer

In this paper, we discuss approaches to detecting when models have beliefs about the world, and we improve on methods for updating model beliefs to be more truthful, with a focus on methods based on learned optimizers or hypernetworks.

Paper
Code

Multi-task Retrieval for Knowledge-Intensive Tasks

no code implementations • ACL 2021 • Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oğuz, Veselin Stoyanov, Gargi Ghosh

Retrieving relevant contexts from a large corpus is a crucial step for tasks such as open-domain question answering and fact checking.

Fact Checking Open-Domain Question Answering +1

Paper
Add Code

Pretrained Language Models for Biomedical and Clinical Tasks: Understanding and Extending the State-of-the-Art

1 code implementation • 1 Nov 2020 • Patrick Lewis, Myle Ott, Jingfei Du, Veselin Stoyanov

A large array of pretrained models are available to the biomedical NLP (BioNLP) community.

151

Paper
Code

General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference

no code implementations • Findings of the Association for Computational Linguistics 2020 • Jingfei Du, Myle Ott, Haoran Li, Xing Zhou, Veselin Stoyanov

The resulting method offers a compelling solution for using large-scale pre-trained models at a fraction of the computational cost when multiple tasks are performed on the same text.

Knowledge Distillation Quantization

Paper
Add Code

Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model

no code implementations • ICLR 2020 • Wenhan Xiong, Jingfei Du, William Yang Wang, Veselin Stoyanov

Models trained with our new objective yield significant improvements on the fact completion task.

Entity Typing Language Modelling +4

Paper
Add Code

SemEval-2013 Task 2: Sentiment Analysis in Twitter

no code implementations • SEMEVAL 2013 • Preslav Nakov, Zornitsa Kozareva, Alan Ritter, Sara Rosenthal, Veselin Stoyanov, Theresa Wilson

To address this issue, we have proposed SemEval-2013 Task 2: Sentiment Analysis in Twitter, which included two subtasks: A, an expression-level subtask, and B, a message-level subtask.

Sentiment Analysis Task 2

Paper
Add Code

SemEval-2014 Task 9: Sentiment Analysis in Twitter

no code implementations • SEMEVAL 2014 • Sara Rosenthal, Preslav Nakov, Alan Ritter, Veselin Stoyanov

We describe the Sentiment Analysis in Twitter task, ran as part of SemEval-2014.

Sentiment Analysis

Paper
Add Code

SemEval-2015 Task 10: Sentiment Analysis in Twitter

no code implementations • SEMEVAL 2015 • Sara Rosenthal, Saif M. Mohammad, Preslav Nakov, Alan Ritter, Svetlana Kiritchenko, Veselin Stoyanov

In this paper, we describe the 2015 iteration of the SemEval shared task on Sentiment Analysis in Twitter.

Sentiment Analysis

Paper
Add Code

SemEval-2016 Task 4: Sentiment Analysis in Twitter

no code implementations • SEMEVAL 2016 • Preslav Nakov, Alan Ritter, Sara Rosenthal, Fabrizio Sebastiani, Veselin Stoyanov

The three new subtasks focus on two variants of the basic ``sentiment classification in Twitter'' task.

General Classification Sentiment Analysis +1

Paper
Add Code

Unsupervised Cross-lingual Representation Learning at Scale

26 code implementations • ACL 2020 • Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov

We also present a detailed empirical analysis of the key factors that are required to achieve these gains, including the trade-offs between (1) positive transfer and capacity dilution and (2) the performance of high and low resource languages at scale.

Cross-Lingual Transfer Multilingual NLP +2

124,353

Paper
Code

Emerging Cross-lingual Structure in Pretrained Language Models

no code implementations • ACL 2020 • Shijie Wu, Alexis Conneau, Haoran Li, Luke Zettlemoyer, Veselin Stoyanov

We study the problem of multilingual masked language modeling, i. e. the training of a single model on concatenated text from multiple languages, and present a detailed study of several factors that influence why these models are so effective for cross-lingual transfer.

Cross-Lingual Transfer Language Modelling +2

Paper
Add Code

Bridging the domain gap in cross-lingual document classification

1 code implementation • 16 Sep 2019 • Guokun Lai, Barlas Oguz, Yiming Yang, Veselin Stoyanov

We consider the setting of semi-supervised cross-lingual understanding, where labeled data is available in a source language (English), but only unlabeled data is available in the target language.

Ranked #1 on Cross-Domain Document Classification on Amazon cn (Dianping train)