Long-range modeling

45 papers with code • 2 benchmarks • 4 datasets

A new task for testing the long-sequence modeling capabilities and efficiency of language models.

Image credit: SCROLLS: Standardized CompaRison Over Long Language Sequences

Benchmarks

Add a Result

These leaderboards are used to track progress in Long-range modeling

Trend	Dataset	Best Model	Paper	Code	Compare
	LRA	Mega			See all
	SCROLLS	CoLT5 XL			See all

Libraries

Use these libraries to find Long-range modeling models and implementations

ag1988/dss

2 papers

Datasets

Most implemented papers

Most implemented Social Latest No code

T-former: An Efficient Transformer for Image Inpainting

dengyecode/t-former_image_inpainting • • 12 May 2023

And based on this attention, a network called $T$-former is designed for image inpainting.

Paper
Code

MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration

guo-stone/mambamorph • • 25 Jan 2024

Capturing voxel-wise spatial correspondence across distinct modalities is crucial for medical image analysis.

Paper
Code

VM-UNet: Vision Mamba UNet for Medical Image Segmentation

jcruan519/vm-unet • • 4 Feb 2024

To our best knowledge, this is the first medical image segmentation model constructed based on the pure SSM-based model.

Paper
Code

V4D:4D Convolutional Neural Networks for Video-level Representation Learning

MalongTech/research-v4d • • 18 Feb 2020

Most existing 3D CNNs for video representation learning are clip-based methods, and thus do not consider video-level temporal evolution of spatio-temporal features.

Paper
Code

DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

whwu95/DSANet • • 25 May 2021

Long-range and short-range temporal modeling are two complementary and crucial aspects of video recognition.

Paper
Code

Image Super-Resolution With Non-Local Sparse Attention

HarukiYqM/Non-Local-Sparse-Attention • • CVPR 2021

NLSA is designed to retain long-range modeling capability from NL operation while enjoying robustness and high-efficiency of sparse representation.

Paper
Code

Sparse Factorization of Large Square Matrices

ruslankhalitov/sparsefactorization • • 16 Sep 2021

The sparse factorization method is tested for a variety of synthetic and real-world square matrices.

Paper
Code

LongT5: Efficient Text-To-Text Transformer for Long Sequences

google-research/longt5 • • Findings (NAACL) 2022

Recent work has shown that either (1) increasing the input length or (2) increasing model size can improve the performance of Transformer-based neural models.

Paper
Code