Search Results for author: Jiecao Chen

Found 14 papers, 1 papers with code

FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

no code implementations • 28 Sep 2022 • Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani

Retrieval-augmented generation models offer many benefits over standalone language models: besides a textual answer to a given query they provide provenance items retrieved from an updateable knowledge base.

Open-Domain Question Answering Re-Ranking +2

Paper
Add Code

Multi-Task Retrieval-Augmented Text Generation with Relevance Sampling

no code implementations • 7 Jul 2022 • Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani

This paper studies multi-task training of retrieval-augmented generation models for knowledge-intensive tasks.

Open-Domain Question Answering Retrieval +1

Paper
Add Code

Transforming Sequence Tagging Into A Seq2Seq Task

no code implementations • 16 Mar 2022 • Karthik Raman, Iftekhar Naim, Jiecao Chen, Kazuma Hashimoto, Kiran Yalasangi, Krishna Srinivasan

Pretrained, large, generative language models (LMs) have had great success in a wide range of sequence tagging and structured prediction tasks.

Hallucination Structured Prediction +1

Paper
Add Code

WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning

3 code implementations • 2 Mar 2021 • Krishna Srinivasan, Karthik Raman, Jiecao Chen, Michael Bendersky, Marc Najork

First, WIT is the largest multimodal dataset by the number of image-text examples by 3x (at the time of writing).

Ranked #1 on Image Retrieval on WIT

BIG-bench Machine Learning Image Retrieval +3

957

Paper
Code

DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries

no code implementations • 23 Oct 2020 • Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Jiecao Chen

In particular, by requiring the model to predict the language-specific token, the MLM objective disincentivizes learning a language-agnostic representation -- which is a key goal of multilingual pre-training.

Language Modelling Masked Language Modeling +1

Paper
Add Code

DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

no code implementations • Findings of the Association for Computational Linguistics 2020 • Jiecao Chen, Liu Yang, Karthik Raman, Michael Bendersky, Jung-Jung Yeh, Yun Zhou, Marc Najork, Danyang Cai, Ehsan Emadzadeh

Pre-trained models like BERT (Devlin et al., 2018) have dominated NLP / IR applications such as single sentence classification, text pair classification, and question answering.

Knowledge Distillation Question Answering +4

Paper
Add Code

Sampled Softmax with Random Fourier Features

no code implementations • NeurIPS 2019 • Ankit Singh Rawat, Jiecao Chen, Felix Yu, Ananda Theertha Suresh, Sanjiv Kumar

For the settings where a large number of classes are involved, a common method to speed up training is to sample a subset of classes and utilize an estimate of the loss gradient based on these classes, known as the sampled softmax method.

Paper
Add Code

Distinct Sampling on Streaming Data with Near-Duplicates

no code implementations • 29 Oct 2018 • Jiecao Chen, Qin Zhang

In this paper we study how to perform distinct sampling in the streaming model where data contain near-duplicates.

Data Structures and Algorithms

Paper
Add Code

Stochastic Negative Mining for Learning with Large Output Spaces

no code implementations • 16 Oct 2018 • Sashank J. Reddi, Satyen Kale, Felix Yu, Dan Holtmann-Rice, Jiecao Chen, Sanjiv Kumar

Furthermore, we identify a particularly intuitive class of loss functions in the aforementioned family and show that they are amenable to practical implementation in the large output space setting (i. e. computation is possible without evaluating scores of all labels) by developing a technique called Stochastic Negative Mining.

Retrieval

Paper
Add Code

A Practical Algorithm for Distributed Clustering and Outlier Detection

no code implementations • NeurIPS 2018 • Jiecao Chen, Erfan Sadeqi Azer, Qin Zhang

We study the classic $k$-means/median clustering, which are fundamental problems in unsupervised learning, in the setting where data are partitioned across multiple sites, and where we are allowed to discard a small portion of the data by labeling them as outliers.

Clustering Outlier Detection

Paper
Add Code

Tight Bounds for Collaborative PAC Learning via Multiplicative Weights

no code implementations • NeurIPS 2018 • Jiecao Chen, Qin Zhang, Yuan Zhou

We study the collaborative PAC learning problem recently proposed in Blum et al.~\cite{BHPQ17}, in which we have $k$ players and they want to learn a target function collaboratively, such that the learned function approximates the target function well on all players' distributions simultaneously.

PAC learning

Paper
Add Code

Adapting Kernel Representations Online Using Submodular Maximization

no code implementations • ICML 2017 • Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White

In this work, we develop an approximately submodular criterion for this setting, and an efficient online greedy submodular maximization algorithm for optimizing the criterion.

Continual Learning

Paper
Add Code

Adaptive Multiple-Arm Identification

no code implementations • ICML 2017 • Jiecao Chen, Xi Chen, Qin Zhang, Yuan Zhou

We study the problem of selecting $K$ arms with the highest expected rewards in a stochastic $n$-armed bandit game.

Paper
Add Code

Communication-Optimal Distributed Clustering

no code implementations • NeurIPS 2016 • Jiecao Chen, He Sun, David P. Woodruff, Qin Zhang

We would like the quality of the clustering in the distributed setting to match that in the centralized setting for which all the data resides on a single site.

Clustering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.