Pen and Paper Exercises in Machine Learning

michaelgutmann/ml-pen-and-paper-exercises 27 Jun 2022

This is a collection of (mostly) pen-and-paper exercises in machine learning.

Variational Inference

469
2.39 stars / hour

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

Vegetebird/StridedTransformer-Pose3D 26 Mar 2021

The modified VTE is termed as Strided Transformer Encoder (STE), which is built upon the outputs of VTE.

Monocular 3D Human Pose Estimation

185
0.91 stars / hour

Multi-Graph Fusion Networks for Urban Region Embedding

wushangbin/mgfn 24 Jan 2022

Human mobility data contains rich but abundant information, which yields to the comprehensive region embeddings for cross domain tasks.

Crime Prediction

202
0.89 stars / hour

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

lucidrains/parti-pytorch 22 Jun 2022

We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge.

Machine Translation Text to image generation +1

251
0.86 stars / hour

ProGen2: Exploring the Boundaries of Protein Language Models

salesforce/progen 27 Jun 2022

Attention-based models trained on protein sequences have demonstrated incredible success at classification and generation tasks relevant for artificial intelligence-driven protein design.

67
0.83 stars / hour

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

tjiiv-cprg/epro-pnp CVPR 2022

The 2D-3D coordinates and corresponding weights are treated as intermediate variables learned by minimizing the KL divergence between the predicted and target pose distribution.

3D Object Detection 6D Pose Estimation using RGB +2

570
0.74 stars / hour

Ivy: Templated Deep Learning for Inter-Framework Portability

ivy-dl/ivy 4 Feb 2021

We introduce Ivy, a templated Deep Learning (DL) framework which abstracts existing DL frameworks.

2,958
0.64 stars / hour

BokehMe: When Neural Rendering Meets Classical Rendering

juewenpeng/bokehme CVPR 2022

Based on this formulation, we implement the classical renderer by a scattering-based method and propose a two-stage neural renderer to fix the erroneous areas from the classical renderer.

Neural Rendering

73
0.54 stars / hour

Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning

tianyu0207/RTFM ICCV 2021

To address this issue, we introduce a novel and theoretically sound method, named Robust Temporal Feature Magnitude learning (RTFM), which trains a feature magnitude learning function to effectively recognise the positive instances, substantially improving the robustness of the MIL approach to the negative instances from abnormal videos.

Anomaly Detection In Surveillance Videos Contrastive Learning +1

198
0.49 stars / hour

Free-Form Image Inpainting with Gated Convolution

zuruoke/watermark-removal ICCV 2019

We present a generative image inpainting system to complete images with free-form mask and guidance.

feature selection Image Inpainting

292
0.49 stars / hour