Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review

alex-petrenko/sample-factory 2 May 2018

The framework of reinforcement learning or optimal control provides a mathematical formalization of intelligent decision making that is powerful and broadly applicable.

Decision Making reinforcement-learning +1

457
0.27 stars / hour

Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach

qitianwu/IDCF 9 Jul 2020

The first model follows conventional matrix factorization which factorizes a group of key users' rating matrix to obtain meta latents.

Collaborative Filtering Matrix Completion +2

41
0.26 stars / hour

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

opendilab/ace 29 Nov 2022

In the learning phase, each agent minimizes the TD error that is dependent on how the subsequent agents have reacted to their chosen action.

Decision Making Q-Learning +2

89
0.26 stars / hour

Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark

roboflow-ai/roboflow-100-benchmark 24 Nov 2022

The evaluation of object detection models is usually performed by optimizing a single metric, e. g. mAP, on a fixed set of datasets, e. g. Microsoft COCO and Pascal VOC.

2D object detection Image Retrieval +13

89
0.25 stars / hour

Human-level play in the game of Diplomacy by combining language models with strategic reasoning

facebookresearch/diplomacy_cicero Science 2022

Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge.

774
0.23 stars / hour

OpenFE: Automated Feature Generation beyond Expert-level Performance

zhangtp1996/openfe 22 Nov 2022

The major challenge in automated feature generation is to efficiently and accurately identify useful features from a vast pool of candidate features.

Feature Importance

78
0.23 stars / hour

Galactica: A Large Language Model for Science

paperswithcode/galai 16 Nov 2022

We believe these results demonstrate the potential for language models as a new interface for science.

Anachronisms Bias Detection +13

1,743
0.23 stars / hour

Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion

google-research/nerf-from-image 21 Nov 2022

Neural Radiance Fields (NeRF) coupled with GANs represent a promising direction in the area of 3D reconstruction from a single view, owing to their ability to efficiently model arbitrary topologies.

3D Reconstruction Pose Estimation

100
0.22 stars / hour

MetaFormer Baselines for Vision

facebookresearch/xformers 24 Oct 2022

By simply applying depthwise separable convolutions as token mixer in the bottom stages and vanilla self-attention in the top stages, the resulting model CAFormer sets a new record on ImageNet-1K: it achieves an accuracy of 85. 5% at 224x224 resolution, under normal supervised training without external data or distillation.

Image Classification

1,801
0.22 stars / hour

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

XavierXiao/Dreambooth-Stable-Diffusion 25 Aug 2022

Once the subject is embedded in the output domain of the model, the unique identifier can then be used to synthesize fully-novel photorealistic images of the subject contextualized in different scenes.

Image Generation

4,505
0.21 stars / hour