16k

77 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Datasets


Most implemented papers

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

dao-ailab/flash-attention 27 May 2022

We also extend FlashAttention to block-sparse attention, yielding an approximate attention algorithm that is faster than any existing approximate attention method.

Long Range Arena: A Benchmark for Efficient Transformers

google-research/long-range-arena 8 Nov 2020

In the recent months, a wide spectrum of efficient, fast Transformers have been proposed to tackle this problem, more often than not claiming superior or comparable model quality to vanilla Transformer models.

Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset

google-research-datasets/dstc8-schema-guided-dialogue 12 Sep 2019

In this work, we introduce the the Schema-Guided Dialogue (SGD) dataset, containing over 16k multi-domain conversations spanning 16 domains.

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

thudm/longbench 28 Aug 2023

In this paper, we introduce LongBench, the first bilingual, multi-task benchmark for long context understanding, enabling a more rigorous evaluation of long context understanding.

Long-form factuality in large language models

google-deepmind/long-form-factuality 27 Mar 2024

Empirically, we demonstrate that LLM agents can outperform crowdsourced human annotators - on a set of ~16k individual facts, SAFE agrees with crowdsourced human annotators 72% of the time, and on a random subset of 100 disagreement cases, SAFE wins 76% of the time.

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

test-time-training/ttt-lm-pytorch 5 Jul 2024

We evaluate our instantiations at the scale of 125M to 1. 3B parameters, comparing with a strong Transformer and Mamba, a modern RNN.

Visual Semantic Role Labeling

s-gupta/v-coco 17 May 2015

In this paper we introduce the problem of Visual Semantic Role Labeling: given an image we want to detect people doing actions and localize the objects of interaction.

Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society

firojalam/COVID-19-disinformation Findings (EMNLP) 2021

With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic.

Investigating Efficiently Extending Transformers for Long Input Summarization

google-research/pegasus 8 Aug 2022

While large pretrained Transformer models have proven highly capable at tackling natural language tasks, handling long sequence inputs continues to be a significant challenge.

An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions

bnu-ivc/ccpg CVPR 2023

For the cloth-changing problem, video-based ReID is rarely studied due to the lack of a suitable cloth-changing benchmark, and gait recognition is often researched under controlled conditions.