Trending Research

Score-Guided Diffusion for 3D Human Recovery

statho/scorehmr • • 14 Mar 2024

We present Score-Guided Human Mesh Recovery (ScoreHMR), an approach for solving inverse problems for 3D human pose and shape reconstruction.

Denoising Human Mesh Recovery

316

0.60 stars / hour

Paper
Code

Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids

junchenliu77/rip-nerf • • 3 May 2024

Despite significant advancements in Neural Radiance Fields (NeRFs), the renderings may still suffer from aliasing and blurring artifacts, since it remains a fundamental challenge to effectively and efficiently characterize anisotropic areas induced by the cone-casting procedure.

0.58 stars / hour

Paper
Code

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

gigaai-research/general-world-models-survey • • 6 May 2024

General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems.

Autonomous Driving Decision Making +1

0.55 stars / hour

Paper
Code

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation

ailab-cvc/seed-x • • 22 Apr 2024

We hope that our work will inspire future research into what can be achieved by versatile multimodal foundation models in real-world applications.

Image Generation

224

0.54 stars / hour

Paper
Code

WavCraft: Audio Editing and Generation with Natural Language Prompts

jinhualiang/wavcraft • • 14 Mar 2024

We introduce WavCraft, a collective system that leverages large language models (LLMs) to connect diverse task-specific models for audio content creation and editing.

In-Context Learning

289

0.53 stars / hour

Paper
Code

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

mindspore-lab/mindone • • None 2023

We then explore the impact of finetuning our base model on high-quality data and train a text-to-video model that is competitive with closed-source video generation.

Image Generation Image to Video Generation

204

0.52 stars / hour

Paper
Code

TimeGPT-1

Nixtla/nixtla • 5 Oct 2023

In this paper, we introduce TimeGPT, the first foundation model for time series, capable of generating accurate predictions for diverse datasets not seen during training.

Time Series Time Series Analysis

1,551

0.51 stars / hour

Paper
Code

AlphaMath Almost Zero: process Supervision without process

MARIO-Math-Reasoning/Super_MARIO • 6 May 2024

We proceed to train a step-level value model designed to improve the LLM's inference process in mathematical domains.

Mathematical Reasoning

0.51 stars / hour

Paper
Code

QLoRA: Efficient Finetuning of Quantized LLMs

internlm/xtuner • • NeurIPS 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2

2,563

0.49 stars / hour

Paper
Code

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

tencentarc/instantmesh • • 10 Apr 2024

We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability.

Image to 3D

1,924

0.48 stars / hour

Paper
Code