Search Results for author: Surin Ahn

Found 3 papers, 2 papers with code

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

2 code implementations2 Jul 2024 Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu

With the pattern and sparse indices, we perform efficient sparse attention calculations via our optimized GPU kernels to significantly reduce the latency in the pre-filling stage of long-context LLMs.

Language Modelling Large Language Model

Uncertainty Quantification for Local Model Explanations Without Model Access

1 code implementation13 Jan 2023 Surin Ahn, Justin Grana, Yafet Tamene, Kristian Holsheimer

We present a model-agnostic algorithm for generating post-hoc explanations and uncertainty intervals for a machine learning model when only a static sample of inputs and outputs from the model is available, rather than direct access to the model itself.

regression Uncertainty Quantification

Global Multiclass Classification and Dataset Construction via Heterogeneous Local Experts

no code implementations21 May 2020 Surin Ahn, Ayfer Ozgur, Mert Pilanci

In the domains of dataset construction and crowdsourcing, a notable challenge is to aggregate labels from a heterogeneous set of labelers, each of whom is potentially an expert in some subset of tasks (and less reliable in others).

Classification Federated Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.