Fine-Tuning Language Models from Human Preferences

carperai/trlx 18 Sep 2019

Most work on reward learning has used simulated environments, but complex information about values is often expressed in natural language, and we believe reward learning for language is a key to making RL practical and safe for real-world tasks.

Language Modelling

0.48 stars / hour

StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces

williamyang1991/styleganex 10 Mar 2023

Recent advances in face manipulation using StyleGAN have produced impressive results.


0.48 stars / hour

SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving

weiyithu/surroundocc 16 Mar 2023

Towards a more comprehensive perception of a 3D scene, in this paper, we propose a SurroundOcc method to predict the 3D occupancy with multi-camera images.

3D Object Detection Autonomous Driving +2

0.48 stars / hour

LoRA: Low-Rank Adaptation of Large Language Models

microsoft/LoRA ICLR 2022

We propose Low-Rank Adaptation, or LoRA, which freezes the pre-trained model weights and injects trainable rank decomposition matrices into each layer of the Transformer architecture, greatly reducing the number of trainable parameters for downstream tasks.

Language Modelling

0.45 stars / hour


microsoft/unilm NAACL 2021

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Contrastive Learning Cross-Lingual Transfer +1

0.43 stars / hour

Prismer: A Vision-Language Model with An Ensemble of Experts

nvlabs/prismer 4 Mar 2023

Recent vision-language models have shown impressive multi-modal generation capabilities.

Few-Shot Learning Image Captioning +2

0.42 stars / hour

BiFormer: Vision Transformer with Bi-Level Routing Attention

rayleizhu/biformer 15 Mar 2023

As the core building block of vision transformers, attention is a powerful tool to capture long-range dependency.

Ranked #3 on Object Detection on COCO 2017 (mAP metric)

Image Classification object-detection +2

0.40 stars / hour

T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models

tencentarc/t2i-adapter 16 Feb 2023

The incredible generative ability of large-scale text-to-image (T2I) models has demonstrated strong power of learning complex structures and meaningful semantics.

0.40 stars / hour

DIRE for Diffusion-Generated Image Detection

zhendongwang6/dire 16 Mar 2023

We find that existing detectors struggle to detect images generated by diffusion models, even if we include generated images from a specific diffusion model in their training data.

0.39 stars / hour