CVPR 2020

HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation

CVPR 2020 HRNet/Higher-HRNet-Human-Pose-Estimation

Bottom-up human pose estimation methods have difficulties in predicting the correct pose for small persons due to challenges in scale variation.

MULTI-PERSON POSE ESTIMATION REPRESENTATION LEARNING

Show, Edit and Tell: A Framework for Editing Image Captions

CVPR 2020 fawazsammani/show-edit-tell

Specifically, our caption-editing model consisting of two sub-modules: (1) EditNet, a language module with an adaptive copy mechanism (Copy-LSTM) and a Selective Copy Memory Attention mechanism (SCMA), and (2) DCNet, an LSTM-based denoising auto-encoder.

DENOISING IMAGE CAPTIONING

Show, Edit and Tell: A Framework for Editing Image Captions

CVPR 2020 fawazsammani/show-edit-tell

Specifically, our caption-editing model consisting of two sub-modules: (1) EditNet, a language module with an adaptive copy mechanism (Copy-LSTM) and a Selective Copy Memory Attention mechanism (SCMA), and (2) DCNet, an LSTM-based denoising auto-encoder.

DENOISING IMAGE CAPTIONING