Search Results for author: Stephen Rawls

Found 8 papers, 2 papers with code

Translation-Enhanced Multilingual Text-to-Image Generation

no code implementations • 30 May 2023 • Yaoyiran Li, Ching-Yun Chang, Stephen Rawls, Ivan Vulić, Anna Korhonen

Research on text-to-image generation (TTI) still predominantly focuses on the English language due to the lack of annotated image-caption data in other languages; in the long run, this might widen inequitable access to TTI technology.

Cross-lingual Text-to-Image Generation Crosslingual Text-to-Image Generation +6

Paper
Add Code

Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data

no code implementations • 4 Apr 2023 • Vladislav Lialin, Stephen Rawls, David Chan, Shalini Ghosh, Anna Rumshisky, Wael Hamza

Currently popular video-text data mining approach via automatic speech recognition (ASR) used in HowTo100M provides low-quality captions that often do not refer to the video content.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model

1 code implementation • 2 Aug 2022 • Saleh Soltan, Shankar Ananthakrishnan, Jack FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan

In this work, we demonstrate that multilingual large-scale sequence-to-sequence (seq2seq) models, pre-trained on a mixture of denoising and Causal Language Modeling (CLM) tasks, are more efficient few-shot learners than decoder-only models on various tasks.

Ranked #14 on Natural Language Inference on CommitmentBank

Causal Language Modeling Common Sense Reasoning +8

363

Paper
Code

Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

no code implementations • 15 Jun 2022 • Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, Jin Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan Hueser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak, Gokmen Oz, Enrico Palumbo, Charith Peris, Chandana Satya Prakash, Stephen Rawls, Andy Rosenbaum, Anjali Shenoy, Saleh Soltan, Mukund Harakere Sridhar, Liz Tan, Fabian Triefenbach, Pan Wei, Haiyang Yu, Shuai Zheng, Gokhan Tur, Prem Natarajan

We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9. 3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the Natural Language Understanding (NLU) component of a virtual assistant system.

Cross-Lingual Natural Language Inference intent-classification +5

Paper
Add Code

Don't Parse, Insert: Multilingual Semantic Parsing with Insertion Based Decoding

no code implementations • CONLL 2020 • Qile Zhu, Haidar Khan, Saleh Soltan, Stephen Rawls, Wael Hamza

For complex parsing tasks, the state-of-the-art method is based on autoregressive sequence to sequence models to generate the parse directly.

Natural Language Understanding Semantic Parsing +4

Paper
Add Code

Implicit Language Model in LSTM for OCR

1 code implementation • 23 May 2018 • Ekraam Sabir, Stephen Rawls, Prem Natarajan

Neural networks have become the technique of choice for OCR, but many aspects of how and why they deliver superior performance are still unknown.

Language Modelling Optical Character Recognition (OCR)

Paper
Code

Pose-Aware Face Recognition in the Wild

no code implementations • CVPR 2016 • Iacopo Masi, Stephen Rawls, Gerard Medioni, Prem Natarajan

We propose a method to push the frontiers of unconstrained face recognition in the wild, focusing on the problem of extreme pose variations.

Face Recognition

Paper
Add Code

Learning Document Image Binarization from Data

no code implementations • 4 May 2015 • Yue Wu, Stephen Rawls, Wael Abd-Almageed, Premkumar Natarajan

In this paper we present a fully trainable binarization solution for degraded document images.

Binarization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.