OCR-free Document Understanding Transformer

clovaai/donut 30 Nov 2021

Current Visual Document Understanding (VDU) methods outsource the task of reading text to off-the-shelf Optical Character Recognition (OCR) engines and focus on the understanding task with the OCR outputs.

Optical Character Recognition

Towards Robust Blind Face Restoration with Codebook Lookup Transformer

sczhou/codeformer 22 Jun 2022

In this paper, we demonstrate that a learned discrete codebook prior in a small proxy space largely reduces the uncertainty and ambiguity of restoration mapping by casting blind face restoration as a code prediction task, while providing rich visual atoms for generating high-quality faces.

Blind Face Restoration

DAMO-YOLO : A Report on Real-Time Object Detection Design

tinyvision/damo-yolo 23 Nov 2022

In this report, we present a fast and accurate object detection method dubbed DAMO-YOLO, which achieves higher performance than the state-of-the-art YOLO series.

Neural Architecture Search object-detection +1

TorchGeo: Deep Learning With Geospatial Data

microsoft/torchgeo 17 Nov 2021

Deep learning methods are particularly promising for modeling many remote sensing tasks given the success of deep neural networks in similar computer vision tasks and the sheer volume of remotely sensed imagery available.

Transfer Learning

Planar Object Tracking via Weighted Optical Flow

serycjon/WOFT 24 Jan 2023

We propose WOFT -- a novel method for planar object tracking that estimates a full 8 degrees-of-freedom pose, i. e. the homography w. r. t.

Object Tracking Optical Flow Estimation

On the Expressive Power of Geometric Graph Neural Networks

chaitjo/geometric-gnn-dojo 23 Jan 2023

The expressive power of Graph Neural Networks (GNNs) has been studied extensively through the Weisfeiler-Leman (WL) graph isomorphism test.

Utilizing supervised models to infer consensus labels and their quality from data with multiple annotators

cleanlab/cleanlab 13 Oct 2022

Many algorithms also rely solely on annotator statistics, ignoring the features of the examples from which the annotations derive.

