Trending Research

Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels

baofff/U-ViT • • NeurIPS 2023

In an effort to further advance semi-supervised generative and classification tasks, we propose a simple yet effective training strategy called dual pseudo training (DPT), built upon strong semi-supervised learners and diffusion models.

Classification

701

0.35 stars / hour

Paper
Code

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

fudan-generative-vision/champ • • 21 Mar 2024

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

3,105

0.35 stars / hour

Paper
Code

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

macaronlin/llama3-quantization • • 22 Apr 2024

This exploration holds the potential to unveil new insights and challenges for low-bit quantization of LLaMA3 and other forthcoming LLMs, especially in addressing performance degradation problems that suffer in LLM compression.

Language Modelling Large Language Model +1

0.34 stars / hour

Paper
Code

BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models

ledzy/badam • • 3 Apr 2024

This work presents BAdam, an optimizer that leverages the block coordinate optimization framework with Adam as the inner solver.

117

0.34 stars / hour

Paper
Code

BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

tencentarc/brushnet • • 11 Mar 2024

Image inpainting, the process of restoring corrupted images, has seen significant advancements with the advent of diffusion models (DMs).

Image Inpainting

907

0.34 stars / hour

Paper
Code

HGRN2: Gated Linear RNNs with State Expansion

sustcsonglin/flash-linear-attention • • 11 Apr 2024

Hierarchically gated linear RNN (HGRN, Qin et al. 2023) has demonstrated competitive training speed and performance in language modeling, while offering efficient inference.

Image Classification Language Modelling

448

0.34 stars / hour

Paper
Code

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

showlab/show-1 • • 27 Sep 2023

In this paper, we are the first to propose a hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation.

Ranked #2 on Text-to-Video Generation on EvalCrafter Text-to-Video (ECTV) Dataset (using extra training data)

Text-to-Video Generation Video Alignment +1

970

0.33 stars / hour

Paper
Code

CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

qinghew/CharacterFactory • 24 Apr 2024

In this work, we propose CharacterFactory, a framework that allows sampling new characters with consistent identities in the latent space of GANs for diffusion models.

Consistent Character Generation Word Embeddings

0.33 stars / hour

Paper
Code

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

facebookresearch/purplellama • 19 Apr 2024

We present BenchmarkName, a novel benchmark to quantify LLM security risks and capabilities.

1,862

0.32 stars / hour

Paper
Code

OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

crystalwlz/omegas • • 24 Apr 2024

Current scene reconstruction techniques frequently result in the loss of object detail textures and are unable to reconstruct object portions that are occluded or unseen in views.

3D Reconstruction Object +2

0.31 stars / hour

Paper
Code