A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases

google/learned_optimization 22 Sep 2022

We apply the resulting learned optimizer to a variety of neural network training tasks, where it outperforms the current state of the art learned optimizer -- at matched optimizer computational overhead -- with regard to optimization performance and meta-training speed, and is capable of generalization to tasks far different from those it was meta-trained on.

Inductive Bias

418
0.47 stars / hour

VeLO: Training Versatile Learned Optimizers by Scaling Up

google/learned_optimization 17 Nov 2022

While deep learning models have replaced hand-designed features across many domains, these models are still trained with hand-designed optimizers.

415
0.46 stars / hour

Latent Video Diffusion Models for High-Fidelity Video Generation with Arbitrary Lengths

yingqinghe/lvdm 23 Nov 2022

Diffusion models (DMs) are another class of deep generative models and have recently achieved remarkable performance on various image synthesis tasks.

Denoising Image Generation +1

40
0.45 stars / hour

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

XavierXiao/Dreambooth-Stable-Diffusion 25 Aug 2022

Once the subject is embedded in the output domain of the model, the unique identifier can then be used to synthesize fully-novel photorealistic images of the subject contextualized in different scenes.

Image Generation

4,340
0.40 stars / hour

DeepPrivacy2: Towards Realistic Full-Body Anonymization

hukkelas/deep_privacy2 17 Nov 2022

Generative Adversarial Networks (GANs) are widely adapted for anonymization of human figures.

Face Anonymization

49
0.39 stars / hour

AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities

flagai-open/flagai 12 Nov 2022

In this work, we present a conceptually simple and effective method to train a strong bilingual/multilingual multimodal representation model.

Contrastive Learning Cross-Modal Retrieval +11

638
0.39 stars / hour

Inversion-Based Creativity Transfer with Diffusion Models

zyxelsa/creativity-transfer 23 Nov 2022

In this paper, we introduce the task of "Creativity Transfer".

Denoising Style Transfer +1

35
0.36 stars / hour

Vision Transformers for Dense Prediction

isl-org/MiDaS ICCV 2021

We introduce dense vision transformers, an architecture that leverages vision transformers in place of convolutional networks as a backbone for dense prediction tasks.

Monocular Depth Estimation Semantic Segmentation

2,075
0.34 stars / hour

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

MineDojo/MineDojo 17 Jun 2022

Autonomous agents have made great strides in specialist domains like Atari games and Go.

Atari Games

778
0.33 stars / hour

Revisiting Image Pyramid Structure for High Resolution Salient Object Detection

plemeri/transparent-background 20 Sep 2022

Salient object detection (SOD) has been in the spotlight recently, yet has been studied less for high-resolution (HR) images.

Dichotomous Image Segmentation Salient Object Detection

34
0.32 stars / hour