Empirical Analysis of Training Strategies of Transformer-based Japanese Chit-chat Systems

nttcslab/japanese-dialog-transformers 11 Sep 2021

In recent years, several high-performance conversational systems have been proposed based on the Transformer encoder-decoder model.

Evaluating Large Language Models Trained on Code

microsoft/PythonProgrammingPuzzles 7 Jul 2021

We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities.

Code Generation Language Modelling

Robust High-Resolution Video Matting with Temporal Guidance

PeterL1n/RobustVideoMatting 25 Aug 2021

We introduce a robust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance.

Video Matting

PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering

renyurui/pirender 17 Sep 2021

The proposed model can generate photo-realistic portrait images with accurate movements according to intuitive modifications.

Image Generation Neural Rendering

An End-to-End Transformer Model for 3D Object Detection

facebookresearch/3detr 16 Sep 2021

We propose 3DETR, an end-to-end Transformer based object detection model for 3D point clouds.

3D Object Detection

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

xinntao/Real-ESRGAN 22 Jul 2021

Though many attempts have been made in blind super-resolution to restore low-resolution images with unknown and complex degradations, they are still far from addressing general real-world degraded images.

Video Super-Resolution

Physics-based Deep Learning

thunil/Physics-Based-Deep-Learning 11 Sep 2021

This digital book contains a practical and comprehensive introduction of everything related to deep learning in the context of physical simulations.

Physical Simulations

Suppressing Uncertainties for Large-Scale Facial Expression Recognition

RainbowRui/Landmark-Driven-Facial-Expression-Recognition CVPR 2020

Annotating a qualitative large-scale facial expression dataset is extremely difficult due to the uncertainties caused by ambiguous facial expressions, low-quality facial images, and the subjectiveness of annotators.

Facial Expression Recognition

PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models

ElementAI/picard 10 Sep 2021

Large pre-trained language models for textual data have an unconstrained output space; at each decoding step, they can produce any of 10, 000s of sub-word tokens.

Dialogue State Tracking Semantic Parsing +1

LibFewShot: A Comprehensive Library for Few-shot Learning

rl-vig/libfewshot 10 Sep 2021

Furthermore, based on LibFewShot, we provide comprehensive evaluations on multiple benchmark datasets with multiple backbone architectures to evaluate common pitfalls and effects of different training tricks.

Data Augmentation Few-Shot Image Classification +1

