Trending Research

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

mindspore-lab/mindone • • None 2023

We then explore the impact of finetuning our base model on high-quality data and train a text-to-video model that is competitive with closed-source video generation.

Image Generation Image to Video Generation

309

1.27 stars / hour

Paper
Code

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

mit-han-lab/qserve • • 7 May 2024

The key insight driving QServe is that the efficiency of LLM serving on GPUs is critically influenced by operations on low-throughput CUDA cores.

Language Modelling Large Language Model +1

185

1.26 stars / hour

Paper
Code

Autonomous LLM-driven research from data to human-verifiable research papers

technion-kishony-lab/data-to-paper • 24 Apr 2024

As AI promises to accelerate scientific discovery, it remains unclear whether fully AI-driven research is possible and whether it can adhere to key scientific values, such as transparency, traceability and verifiability.

233

1.18 stars / hour

Paper
Code

Improving Diffusion Models for Virtual Try-on

yisol/IDM-VTON • • 8 Mar 2024

Finally, we present a customization method using a pair of person-garment images, which significantly improves fidelity and authenticity.

Ranked #1 on Virtual Try-on on VITON-HD

Virtual Try-on

2,297

1.16 stars / hour

Paper
Code

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

OS-Copilot/FRIDAY • 12 Feb 2024

Autonomous interaction with the computer has been a longstanding challenge with great potential, and the recent proliferation of large language models (LLMs) has markedly accelerated progress in building digital agents.

1,241

1.04 stars / hour

Paper
Code

KAN: Kolmogorov-Arnold Networks

Blealtan/efficient-kan • • 30 Apr 2024

Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer Perceptrons (MLPs).

2,153

0.99 stars / hour

Paper
Code

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

zzzhang-jx/docres • • 7 May 2024

This underscores the potential of DocRes across a broader spectrum of document image restoration tasks.

Binarization Deblurring +3

126

0.97 stars / hour

Paper
Code

TimeGPT-1

Nixtla/nixtla • 5 Oct 2023

In this paper, we introduce TimeGPT, the first foundation model for time series, capable of generating accurate predictions for diverse datasets not seen during training.

Time Series Time Series Analysis

1,618

0.96 stars / hour

Paper
Code

Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

thudm/inf-dit • 7 May 2024

However, due to a quadratic increase in memory during generating ultra-high-resolution images (e. g. 4096*4096), the resolution of generated images is often limited to 1024*1024.

Image Generation Super-Resolution

115

0.90 stars / hour

Paper
Code

X-LoRA: Mixture of Low-Rank Adapter Experts, a Flexible Framework for Large Language Models with Applications in Protein Mechanics and Molecular Design

ericlbuehler/mistral.rs • 11 Feb 2024

Starting with a set of pre-trained LoRA adapters, our gating strategy uses the hidden states to dynamically mix adapted layers, allowing the resulting X-LoRA model to draw upon different capabilities and create never-before-used deep layer-wise combinations to solve tasks.

graph construction Knowledge Graphs +2

1,610

0.79 stars / hour

Paper
Code