Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language

microsoft/2D-TAN 4 Dec 2020

It is a challenging problem because a target moment may take place in the context of other temporal moments in the untrimmed video.

361
0.51 stars / hour

Package for Fast ABC-Boost

pltrees/abcboost 18 Jul 2022

Although the gain formula in Li (2010) was derived for logistic regression loss, it is a generic formula for loss functions with second-derivatives.

Multi-class Classification

72
0.49 stars / hour

FRA-RIR: Fast Random Approximation of the Image-source Method

yluo42/fra-rir 8 Aug 2022

The training of modern speech processing systems often requires a large amount of simulated room impulse response (RIR) data in order to allow the systems to generalize well in real-world, reverberant environments.

Denoising Speech Denoising

56
0.41 stars / hour

Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

compvis/latent-diffusion 26 Jul 2022

In RDMs, a set of nearest neighbors is retrieved from an external database during training for each training instance, and the diffusion model is conditioned on these informative samples.

Image Generation Prompt Engineering

2,614
0.40 stars / hour

Hybrid Spectrogram and Waveform Source Separation

facebookresearch/demucs 5 Nov 2021

Source separation models either work on the spectrogram or waveform domain.

Music Source Separation

3,910
0.40 stars / hour

GAUDI: A Neural Architect for Immersive 3D Scene Generation

apple/ml-gaudi 27 Jul 2022

We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera.

Image Generation Scene Generation

365
0.39 stars / hour

Label-Free Synthetic Pretraining of Object Detectors

princeton-vl/solid 8 Aug 2022

Our "SOLID" approach consists of two main components: (1) generating synthetic images using a collection of unlabelled 3D models with optimized scene arrangement; (2) pretraining an object detector on "instance detection" task - given a query image depicting an object, detecting all instances of the exact same object in a target image.

26
0.39 stars / hour

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios

bytedance/next-vit 12 Jul 2022

Then, Next Hybrid Strategy (NHS) is designed to stack NCB and NTB in an efficient hybrid paradigm, which boosts performance in various downstream tasks.

67
0.39 stars / hour

Artificial Intelligence and Machine Learning for Quantum Technologies

ml4qtech/collection 7 Aug 2022

In recent years, the dramatic progress in machine learning has begun to impact many areas of science and technology significantly.

27
0.35 stars / hour

ParlAI

facebookresearch/ParlAI ICLR 2019

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Dialogue Generation

9,327
0.35 stars / hour