Search Results for author: Jiecao Chen

Found 14 papers, 1 papers with code

FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

no code implementations28 Sep 2022 Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani

Retrieval-augmented generation models offer many benefits over standalone language models: besides a textual answer to a given query they provide provenance items retrieved from an updateable knowledge base.

Open-Domain Question Answering Re-Ranking +2

Transforming Sequence Tagging Into A Seq2Seq Task

no code implementations16 Mar 2022 Karthik Raman, Iftekhar Naim, Jiecao Chen, Kazuma Hashimoto, Kiran Yalasangi, Krishna Srinivasan

Pretrained, large, generative language models (LMs) have had great success in a wide range of sequence tagging and structured prediction tasks.

Hallucination Structured Prediction +1

DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries

no code implementations23 Oct 2020 Aditi Chaudhary, Karthik Raman, Krishna Srinivasan, Jiecao Chen

In particular, by requiring the model to predict the language-specific token, the MLM objective disincentivizes learning a language-agnostic representation -- which is a key goal of multilingual pre-training.

Language Modelling Masked Language Modeling +1

Sampled Softmax with Random Fourier Features

no code implementations NeurIPS 2019 Ankit Singh Rawat, Jiecao Chen, Felix Yu, Ananda Theertha Suresh, Sanjiv Kumar

For the settings where a large number of classes are involved, a common method to speed up training is to sample a subset of classes and utilize an estimate of the loss gradient based on these classes, known as the sampled softmax method.

Distinct Sampling on Streaming Data with Near-Duplicates

no code implementations29 Oct 2018 Jiecao Chen, Qin Zhang

In this paper we study how to perform distinct sampling in the streaming model where data contain near-duplicates.

Data Structures and Algorithms

Stochastic Negative Mining for Learning with Large Output Spaces

no code implementations16 Oct 2018 Sashank J. Reddi, Satyen Kale, Felix Yu, Dan Holtmann-Rice, Jiecao Chen, Sanjiv Kumar

Furthermore, we identify a particularly intuitive class of loss functions in the aforementioned family and show that they are amenable to practical implementation in the large output space setting (i. e. computation is possible without evaluating scores of all labels) by developing a technique called Stochastic Negative Mining.

Retrieval

A Practical Algorithm for Distributed Clustering and Outlier Detection

no code implementations NeurIPS 2018 Jiecao Chen, Erfan Sadeqi Azer, Qin Zhang

We study the classic $k$-means/median clustering, which are fundamental problems in unsupervised learning, in the setting where data are partitioned across multiple sites, and where we are allowed to discard a small portion of the data by labeling them as outliers.

Clustering Outlier Detection

Tight Bounds for Collaborative PAC Learning via Multiplicative Weights

no code implementations NeurIPS 2018 Jiecao Chen, Qin Zhang, Yuan Zhou

We study the collaborative PAC learning problem recently proposed in Blum et al.~\cite{BHPQ17}, in which we have $k$ players and they want to learn a target function collaboratively, such that the learned function approximates the target function well on all players' distributions simultaneously.

PAC learning

Adapting Kernel Representations Online Using Submodular Maximization

no code implementations ICML 2017 Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White

In this work, we develop an approximately submodular criterion for this setting, and an efficient online greedy submodular maximization algorithm for optimizing the criterion.

Continual Learning

Adaptive Multiple-Arm Identification

no code implementations ICML 2017 Jiecao Chen, Xi Chen, Qin Zhang, Yuan Zhou

We study the problem of selecting $K$ arms with the highest expected rewards in a stochastic $n$-armed bandit game.

Communication-Optimal Distributed Clustering

no code implementations NeurIPS 2016 Jiecao Chen, He Sun, David P. Woodruff, Qin Zhang

We would like the quality of the clustering in the distributed setting to match that in the centralized setting for which all the data resides on a single site.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.