Pen and Paper Exercises in Machine Learning

michaelgutmann/ml-pen-and-paper-exercises 27 Jun 2022

This is a collection of (mostly) pen-and-paper exercises in machine learning.

1,085
2.86 stars / hour

Instant Neural Graphics Primitives with a Multiresolution Hash Encoding

kwea123/ngp_pl 16 Jan 2022

Neural graphics primitives, parameterized by fully connected neural networks, can be costly to train and evaluate.

3D Reconstruction 3D Shape Reconstruction +2

190
1.56 stars / hour

DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

keonlee9420/DailyTalk 3 Jul 2022

We sampled, modified, and recorded 2, 541 dialogues from the open-domain dialogue dataset DailyDialog which are adequately long to represent context of each dialogue.

45
1.00 stars / hour

Back to MLP: A Simple Baseline for Human Motion Prediction

dulucas/simlpe 4 Jul 2022

This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences.

Human motion prediction motion prediction

23
0.88 stars / hour

LViT: Language meets Vision Transformer in Medical Image Segmentation

huanglizi/lvit 29 Jun 2022

In our model, medical text annotation is introduced to compensate for the quality deficiency in image data.

Medical Image Segmentation Semantic Segmentation

104
0.81 stars / hour

Disentangling Random and Cyclic Effects in Time-Lapse Sequences

harskish/tlgan 4 Jul 2022

We introduce the problem of disentangling time-lapse sequences in a way that allows separate, after-the-fact control of overall trends, cyclic effects, and random effects in the images, and describe a technique based on data-driven generative models that achieves this goal.

21
0.67 stars / hour

Text2Human: Text-Driven Controllable Human Image Generation

yumingj/deepfashion-multimodal 31 May 2022

In this work, we present a text-driven controllable framework, Text2Human, for a high-quality and diverse human generation.

Human Parsing Image Generation

136
0.59 stars / hour

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

niraharon/bot-sort 29 Jun 2022

The goal of multi-object tracking (MOT) is detecting and tracking all the objects in a scene, while keeping a unique identifier for each object.

 Ranked #1 on Multi-Object Tracking on MOT20 (using extra training data)

Multi-Object Tracking

57
0.54 stars / hour

Forecasting Future World Events with Neural Networks

andyzoujm/autocast 30 Jun 2022

We test language models on our forecasting task and find that performance is far below a human expert baseline.

Decision Making Language Modelling

54
0.46 stars / hour

Ivy: Templated Deep Learning for Inter-Framework Portability

ivy-dl/ivy 4 Feb 2021

We introduce Ivy, a templated Deep Learning (DL) framework which abstracts existing DL frameworks.

3,086
0.44 stars / hour