Search Results for author: Krishna Srinivasan

Found 11 papers, 4 papers with code

MURAL: Multimodal, Multitask Representations Across Languages

no code implementations • Findings (EMNLP) 2021 • Aashi Jain, Mandy Guo, Krishna Srinivasan, Ting Chen, Sneha Kudugunta, Chao Jia, Yinfei Yang, Jason Baldridge

Both image-caption pairs and translation pairs provide the means to learn deep representations of and connections between languages.

Image-text matching Retrieval +2

Paper
Add Code

Ambiguity-Aware In-Context Learning with Large Language Models

no code implementations • 14 Sep 2023 • Lingyu Gao, Aditi Chaudhary, Krishna Srinivasan, Kazuma Hashimoto, Karthik Raman, Michael Bendersky

In-context learning (ICL) i. e. showing LLMs only a few task-specific demonstrations has led to downstream gains with no task-specific fine-tuning required.

In-Context Learning Semantic Similarity +3

Paper
Add Code

Exploring the Viability of Synthetic Query Generation for Relevance Prediction

no code implementations • 19 May 2023 • Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Kazuma Hashimoto, Mike Bendersky, Marc Najork

While our experiments demonstrate that these modifications help improve performance of QGen techniques, we also find that QGen approaches struggle to capture the full nuance of the relevance label space and as a result the generated queries are not faithful to the desired relevance label.

Information Retrieval Question Answering +2

Paper
Add Code

WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset

2 code implementations • 9 May 2023 • Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan A. Plummer, Kate Saenko, Jianmo Ni, Mandy Guo

Webpages have been a rich resource for language and vision-language tasks.

Image Captioning

957

Paper
Code

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

1 code implementation • 5 May 2023 • Andrea Burns, Krishna Srinivasan, Joshua Ainslie, Geoff Brown, Bryan A. Plummer, Kate Saenko, Jianmo Ni, Mandy Guo

Webpages have been a rich, scalable resource for vision-language and language only tasks.

Image Captioning

957

Paper
Code

AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation

2 code implementations • 4 Apr 2023 • Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin

This paper presents the AToMiC (Authoring Tools for Multimedia Content) dataset, designed to advance research in image/text cross-modal retrieval.

Cross-Modal Retrieval Retrieval +1

957

Paper
Code

QUILL: Query Intent with Large Language Models using Retrieval Augmentation and Multi-stage Distillation

no code implementations • 27 Oct 2022 • Krishna Srinivasan, Karthik Raman, Anupam Samanta, Lingrui Liao, Luca Bertelli, Mike Bendersky

Thus, in this paper we make the following contributions: (1) We demonstrate that Retrieval Augmentation of queries provides LLMs with valuable additional context enabling improved understanding.

Feature Engineering Knowledge Distillation +1

Paper
Add Code

Transforming Sequence Tagging Into A Seq2Seq Task

no code implementations • 16 Mar 2022 • Karthik Raman, Iftekhar Naim, Jiecao Chen, Kazuma Hashimoto, Kiran Yalasangi, Krishna Srinivasan

Pretrained, large, generative language models (LMs) have had great success in a wide range of sequence tagging and structured prediction tasks.

Hallucination Structured Prediction +1

Paper
Add Code

MURAL: Multimodal, Multitask Retrieval Across Languages

no code implementations • 10 Sep 2021 • Aashi Jain, Mandy Guo, Krishna Srinivasan, Ting Chen, Sneha Kudugunta, Chao Jia, Yinfei Yang, Jason Baldridge

Both image-caption pairs and translation pairs provide the means to learn deep representations of and connections between languages.

Ranked #1 on Semantic Image-Text Similarity on CxC

Image-text matching Retrieval +5

Paper
Add Code

WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning

3 code implementations • 2 Mar 2021 • Krishna Srinivasan, Karthik Raman, Jiecao Chen, Michael Bendersky, Marc Najork

First, WIT is the largest multimodal dataset by the number of image-text examples by 3x (at the time of writing).

Ranked #1 on Image Retrieval on WIT

BIG-bench Machine Learning Image Retrieval +3

957

Paper
Code

DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries

no code implementations • 23 Oct 2020 • Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Jiecao Chen

In particular, by requiring the model to predict the language-specific token, the MLM objective disincentivizes learning a language-agnostic representation -- which is a key goal of multilingual pre-training.

Language Modelling Masked Language Modeling +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.