NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion

microsoft/nuwa 24 Nov 2021

To cover language, image, and video at the same time for different scenarios, a 3D transformer encoder-decoder framework is designed, which can not only deal with videos as 3D data but also adapt to texts and images as 1D and 2D data, respectively.

Text-to-Image Generation Video Generation +1

Semi-supervised Implicit Scene Completion from Sparse LiDAR

open-air-sun/sisc 29 Nov 2021

Recent advances show that semi-supervised implicit representation learning can be achieved through physical constraints like Eikonal equations.

Representation Learning

OpenUE: An Open Toolkit of Universal Extraction from Text

zjunlp/openue EMNLP 2020

We introduce a prototype model and provide an open-source and extensible toolkit called OpenUE for various extraction tasks.

Event Extraction Intent Detection

Robust High-Resolution Video Matting with Temporal Guidance

PeterL1n/RobustVideoMatting 25 Aug 2021

We introduce a robust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance.

Video Matting

Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation

SysCV/pcan NeurIPS 2021

We propose Prototypical Cross-Attention Network (PCAN), capable of leveraging rich spatio-temporal information for online multiple object tracking and segmentation.

Multi-Object Tracking and Segmentation Multiple Object Track and Segmentation +1

Vector Quantized Diffusion Model for Text-to-Image Synthesis

microsoft/vq-diffusion 29 Nov 2021

Our experiments indicate that the VQ-Diffusion model with the reparameterization is fifteen times faster than traditional AR methods while achieving a better image quality.

Denoising Text-to-Image Generation

The Devil is the Classifier: Investigating Long Tail Relation Classification with Decoupling Analysis

zjunlp/deepke 15 Sep 2020

Long-tailed relation classification is a challenging problem as the head classes may dominate the training phase, thereby leading to the deterioration of the tail performance.

General Classification Relation Classification

PaddlePaddle/PaddleRec 6 Jul 2020

大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、DeepWalk、SSR、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、ListWise等,包含经典推荐系统数据集criteo 、movielens等

Click-Through Rate Prediction

VaxNeRF: Revisiting the Classic for Voxel-Accelerated Neural Radiance Field

naruya/vaxnerf 25 Nov 2021

We hope VaxNeRF -- a careful combination of a classic technique with a deep method (that arguably replaced it) -- can empower and accelerate new NeRF extensions and applications, with its simplicity, portability, and reliable performance gains.

3D Reconstruction Meta-Learning

