We present speculative sampling, an algorithm for accelerating transformer decoding by enabling the generation of multiple tokens from each transformer call.

Paper
Code

OpenICL: An Open-Source Framework for In-context Learning

shark-nlp/openicl • • 6 Mar 2023

However, the implementation of ICL is sophisticated due to the diverse retrieval and inference methods involved, as well as the varying pre-processing requirements for different models, datasets, and tasks.

Paper
Code

Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference

microsoft/FLAML • 8 Mar 2023

Large Language Models (LLMs) have sparked significant interest in their generative capabilities, leading to the development of various commercial applications.

Paper
Code

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

xinhaomei/wavcaps • • 30 Mar 2023

To address this data scarcity issue, we introduce WavCaps, the first large-scale weakly-labelled audio captioning dataset, comprising approximately 400k audio clips with paired captions.

Paper
Code

SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification

flexflow/flexflow • 16 May 2023

Our evaluation shows that SpecInfer outperforms existing LLM serving systems by 1. 5-2. 8x for distributed LLM inference and by 2. 6-3. 5x for offloading-based LLM inference, while preserving the same generative performance.

Paper
Code

Hierarchical Prompting Assists Large Language Model on Web Navigation

robert1003/ash-prompting • 23 May 2023

Large language models (LLMs) struggle on processing complicated observations in interactive decision making tasks.

Paper
Code

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

bradyfu/awesome-multimodal-large-language-models • 23 Jun 2023

Multimodal Large Language Model (MLLM) relies on the powerful LLM to perform multimodal tasks, showing amazing emergent abilities in recent studies, such as writing poems based on an image.

Paper
Code

Large Language Model

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result