Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

picsart-ai-research/text2video-zero 23 Mar 2023

Recent text-to-video generation approaches rely on computationally heavy training and require large-scale video datasets.

Image Generation Text-to-Video Generation +3

1,748
5.44 stars / hour

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

zrrskywalker/llama-adapter 28 Mar 2023

We present LLaMA-Adapter, a lightweight adaption method to efficiently fine-tune LLaMA into an instruction-following model.

Instruction Following Language Modelling +2

720
5.37 stars / hour

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

kent0n-li/chatdoctor 24 Mar 2023

Recent large language models (LLMs) in the general domain, such as ChatGPT, have shown remarkable success in following instructions and producing human-like responses.

Medical Diagnosis

1,154
5.31 stars / hour

PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters

shuhongchen/panic3d-anime-reconstruction 25 Mar 2023

We propose PAniC-3D, a system to reconstruct stylized 3D character heads directly from illustrated (p)ortraits of (ani)me (c)haracters.

3D Reconstruction Single-View 3D Reconstruction

263
4.67 stars / hour

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

showlab/Tune-A-Video 22 Dec 2022

To replicate the success of text-to-image (T2I) generation, recent works employ large-scale video datasets to train a text-to-video (T2V) generator.

Style Transfer Text-to-Video Generation +1

2,254
4.28 stars / hour

Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases

lianjiatech/belle 26 Mar 2023

However current research rarely studies the impact of different amounts of instruction data on model performance, especially in the real-world use cases.

2,592
2.95 stars / hour

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

junshutang/Make-It-3D 24 Mar 2023

In this work, we investigate the problem of creating high-fidelity 3D content from only a single image.

Text to 3D

353
2.60 stars / hour

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

BlinkDL/RWKV-LM 18 Nov 2022

We propose SmoothQuant, a training-free, accuracy-preserving, and general-purpose post-training quantization (PTQ) solution to enable 8-bit weight, 8-bit activation (W8A8) quantization for LLMs.

Quantization

4,490
2.47 stars / hour

Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation

Gorilla-Lab-SCUT/Fantasia3D 24 Mar 2023

Key to Fantasia3D is the disentangled modeling and learning of geometry and appearance.

Text to 3D

124
1.91 stars / hour

P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks

huggingface/peft 14 Oct 2021

Prompt tuning, which only tunes continuous prompts with a frozen language model, substantially reduces per-task storage and memory usage at training.

Language Modelling

2,694
1.83 stars / hour