DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

XavierXiao/Dreambooth-Stable-Diffusion 25 Aug 2022

Once the subject is embedded in the output domain of the model, the unique identifier can then be used to synthesize fully-novel photorealistic images of the subject contextualized in different scenes.

Image Generation

1,538
4.19 stars / hour

Learning to Learn with Generative Models of Neural Network Checkpoints

wpeebles/g.pt 26 Sep 2022

We explore a data-driven approach for learning to optimize neural networks.

reinforcement-learning

139
2.29 stars / hour

Robust Speech Recognition via Large-Scale Weak Supervision

openai/whisper Preprint 2022

We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet.

Robust Speech Recognition speech-recognition

10,067
2.11 stars / hour

LAVIS: A Library for Language-Vision Intelligence

salesforce/lavis 15 Sep 2022

We introduce LAVIS, an open-source deep learning library for LAnguage-VISion research and applications.

Image Captioning Image Retrieval +6

714
1.36 stars / hour

Efficient Few-Shot Learning Without Prompts

huggingface/setfit 22 Sep 2022

This simple framework requires no prompts or verbalizers, and achieves high accuracy with orders of magnitude less parameters than existing techniques.

Few-Shot Learning

294
1.13 stars / hour

VToonify: Controllable High-Resolution Portrait Video Style Transfer

williamyang1991/vtoonify 22 Sep 2022

Although a series of successful portrait image toonification models built upon the powerful StyleGAN have been proposed, these image-oriented methods have obvious limitations when applied to videos, such as the fixed frame size, the requirement of face alignment, missing non-facial details and temporal inconsistency.

Face Alignment Style Transfer +1

465
0.97 stars / hour

Zero-Shot Text-Guided Object Generation with Dream Fields

shengyu-meng/dreamfields-3D CVPR 2022

Our method, Dream Fields, can generate the geometry and color of a wide range of objects without 3D supervision.

Neural Rendering

226
0.87 stars / hour

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

IDEA-Research/detrex 7 Mar 2022

Compared to other models on the leaderboard, DINO significantly reduces its model size and pre-training data size while achieving better results.

 Ranked #1 on Object Detection on COCO minival (using extra training data)

object-detection Real-Time Object Detection

328
0.81 stars / hour

EditEval: An Instruction-Based Benchmark for Text Improvements

facebookresearch/editeval 27 Sep 2022

Evaluation of text generation to date has primarily focused on content created sequentially, rather than improvements on a piece of text.

Text Generation

53
0.79 stars / hour

towhee

towhee-io/towhee 22 Oct 2020

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Audio Fingerprint Contrastive Learning +1

1,410
0.73 stars / hour