Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels

baofff/U-ViT NeurIPS 2023

In an effort to further advance semi-supervised generative and classification tasks, we propose a simple yet effective training strategy called dual pseudo training (DPT), built upon strong semi-supervised learners and diffusion models.

Classification

701
0.35 stars / hour

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

fudan-generative-vision/champ 21 Mar 2024

In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in curernt human generative techniques.

Animated GIF Generation Image Animation +1

3,105
0.35 stars / hour

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

macaronlin/llama3-quantization 22 Apr 2024

This exploration holds the potential to unveil new insights and challenges for low-bit quantization of LLaMA3 and other forthcoming LLMs, especially in addressing performance degradation problems that suffer in LLM compression.

Language Modelling Large Language Model +1

59
0.34 stars / hour

BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models

ledzy/badam 3 Apr 2024

This work presents BAdam, an optimizer that leverages the block coordinate optimization framework with Adam as the inner solver.

117
0.34 stars / hour

BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

tencentarc/brushnet 11 Mar 2024

Image inpainting, the process of restoring corrupted images, has seen significant advancements with the advent of diffusion models (DMs).

Image Inpainting

907
0.34 stars / hour

HGRN2: Gated Linear RNNs with State Expansion

sustcsonglin/flash-linear-attention 11 Apr 2024

Hierarchically gated linear RNN (HGRN, Qin et al. 2023) has demonstrated competitive training speed and performance in language modeling, while offering efficient inference.

Image Classification Language Modelling

448
0.34 stars / hour

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

showlab/show-1 27 Sep 2023

In this paper, we are the first to propose a hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation.

Text-to-Video Generation Video Alignment +1

970
0.33 stars / hour

CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

qinghew/CharacterFactory 24 Apr 2024

In this work, we propose CharacterFactory, a framework that allows sampling new characters with consistent identities in the latent space of GANs for diffusion models.

Consistent Character Generation Word Embeddings

40
0.33 stars / hour

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

facebookresearch/purplellama 19 Apr 2024

We present BenchmarkName, a novel benchmark to quantify LLM security risks and capabilities.

1,862
0.32 stars / hour

OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

crystalwlz/omegas 24 Apr 2024

Current scene reconstruction techniques frequently result in the loss of object detail textures and are unable to reconstruct object portions that are occluded or unseen in views.

3D Reconstruction Object +2

15
0.31 stars / hour