Search Results

Mobile-Former: Bridging MobileNet and Transformer

BR-IDL/PaddleViT 12 Aug 2021

We present Mobile-Former, a parallel design of MobileNet and Transformer with a two-way bridge in between.

Object Detection

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

BR-IDL/PaddleViT ICCV 2021

This paper presents a new vision Transformer, called Swin Transformer, that capably serves as a general-purpose backbone for computer vision.

Ranked #3 on Semantic Segmentation on FoodSeg103 (using extra training data)

Image Classification Instance Segmentation +2

CycleMLP: A MLP-like Architecture for Dense Prediction

BR-IDL/PaddleViT 21 Jul 2021

We build a family of models that surpass existing MLPs and achieve a comparable accuracy (83. 2%) on ImageNet-1K classification compared to the state-of-the-art Transformer such as Swin Transformer (83. 3%) but using fewer parameters and FLOPs.

Image Classification Instance Segmentation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.