DAMO-YOLO : A Report on Real-Time Object Detection Design

tinyvision/damo-yolo 23 Nov 2022

In this report, we present a fast and accurate object detection method dubbed DAMO-YOLO, which achieves higher performance than the state-of-the-art YOLO series.

Neural Architecture Search object-detection +1

992
0.60 stars / hour

OCR-free Document Understanding Transformer

clovaai/donut 30 Nov 2021

Current Visual Document Understanding (VDU) methods outsource the task of reading text to off-the-shelf Optical Character Recognition (OCR) engines and focus on the understanding task with the OCR outputs.

Optical Character Recognition

1,151
0.60 stars / hour

Multiview Compressive Coding for 3D Reconstruction

facebookresearch/mcc 19 Jan 2023

We introduce a simple framework that operates on 3D points of single objects or whole scenes coupled with category-agnostic large-scale training from diverse RGB-D videos.

3D Reconstruction Self-Supervised Learning +1

181
0.50 stars / hour

On the Expressive Power of Geometric Graph Neural Networks

chaitjo/geometric-gnn-dojo 23 Jan 2023

The expressive power of Graph Neural Networks (GNNs) has been studied extensively through the Weisfeiler-Leman (WL) graph isomorphism test.

35
0.46 stars / hour

TorchGeo: Deep Learning With Geospatial Data

microsoft/torchgeo 17 Nov 2021

Deep learning methods are particularly promising for modeling many remote sensing tasks given the success of deep neural networks in similar computer vision tasks and the sheer volume of remotely sensed imagery available.

Transfer Learning

1,385
0.36 stars / hour

Utilizing supervised models to infer consensus labels and their quality from data with multiple annotators

cleanlab/cleanlab 13 Oct 2022

Many algorithms also rely solely on annotator statistics, ignoring the features of the examples from which the annotations derive.

4,916
0.35 stars / hour

GLM-130B: An Open Bilingual Pre-trained Model

thudm/glm-130b 5 Oct 2022

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model with 130 billion parameters.

Language Modelling Multi-task Language Understanding +1

1,688
0.35 stars / hour
171
0.35 stars / hour

A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction

hustvl/saunet 24 Jan 2023

We present a simple, efficient, and scalable unfolding network, SAUNet, to simplify the network design with an adaptive alternate optimization framework for hyperspectral image (HSI) reconstruction.

Image Reconstruction

16
0.32 stars / hour