Pen and Paper Exercises in Machine Learning

michaelgutmann/ml-pen-and-paper-exercises 27 Jun 2022

This is a collection of (mostly) pen-and-paper exercises in machine learning.

2.31 stars / hour

Instant Neural Graphics Primitives with a Multiresolution Hash Encoding

kwea123/ngp_pl 16 Jan 2022

Neural graphics primitives, parameterized by fully connected neural networks, can be costly to train and evaluate.

3D Reconstruction 3D Shape Reconstruction +2

1.45 stars / hour

DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

keonlee9420/DailyTalk 3 Jul 2022

We sampled, modified, and recorded 2, 541 dialogues from the open-domain dialogue dataset DailyDialog which are adequately long to represent context of each dialogue.

1.06 stars / hour

Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation

haomo-ai/motionseg3d 5 Jul 2022

We also use a point refinement module via 3D sparse convolution to fuse the information from both LiDAR range image and point cloud representations and reduce the artifacts on the borders of the objects.

Autonomous Driving Semantic Segmentation

0.79 stars / hour

Back to MLP: A Simple Baseline for Human Motion Prediction

dulucas/simlpe 4 Jul 2022

This paper tackles the problem of human motion prediction, consisting in forecasting future body poses from historically observed sequences.

Human motion prediction motion prediction

0.79 stars / hour

LViT: Language meets Vision Transformer in Medical Image Segmentation

huanglizi/lvit 29 Jun 2022

In our model, medical text annotation is introduced to compensate for the quality deficiency in image data.

Medical Image Segmentation Semantic Segmentation

0.78 stars / hour

AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture

lizhe00/avatarcap 5 Jul 2022

Then given a monocular RGB video of this subject, our method integrates information from both the image observation and the avatar prior, and accordingly recon-structs high-fidelity 3D textured models with dynamic details regardless of the visibility.

0.75 stars / hour

Disentangling Random and Cyclic Effects in Time-Lapse Sequences

harskish/tlgan 4 Jul 2022

We introduce the problem of disentangling time-lapse sequences in a way that allows separate, after-the-fact control of overall trends, cyclic effects, and random effects in the images, and describe a technique based on data-driven generative models that achieves this goal.

0.65 stars / hour

Text2Human: Text-Driven Controllable Human Image Generation

yumingj/deepfashion-multimodal 31 May 2022

In this work, we present a text-driven controllable framework, Text2Human, for a high-quality and diverse human generation.

Human Parsing Image Generation

0.59 stars / hour

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

niraharon/bot-sort 29 Jun 2022

The goal of multi-object tracking (MOT) is detecting and tracking all the objects in a scene, while keeping a unique identifier for each object.

 Ranked #1 on Multi-Object Tracking on MOT20 (using extra training data)

Multi-Object Tracking

0.54 stars / hour