Search Results for author: Cheng-Ze Lu

Found 6 papers, 4 papers with code

Delving Deeper into Data Scaling in Masked Image Modeling

no code implementations • 24 May 2023 • Cheng-Ze Lu, Xiaojie Jin, Qibin Hou, Jun Hao Liew, Ming-Ming Cheng, Jiashi Feng

The study reveals that: 1) MIM can be viewed as an effective method to improve the model capacity when the scale of the training data is relatively small; 2) Strong reconstruction targets can endow the models with increased capacities on downstream tasks; 3) MIM pre-training is data-agnostic under most scenarios, which means that the strategy of sampling pre-training data is non-critical.

Self-Supervised Learning

Paper
Add Code

CMAE-V: Contrastive Masked Autoencoders for Video Action Recognition

no code implementations • 15 Jan 2023 • Cheng-Ze Lu, Xiaojie Jin, Zhicheng Huang, Qibin Hou, Ming-Ming Cheng, Jiashi Feng

Contrastive Masked Autoencoder (CMAE), as a new self-supervised framework, has shown its potential of learning expressive feature representations in visual image recognition.

Action Recognition Temporal Action Localization

Paper
Add Code

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition

1 code implementation • 22 Nov 2022 • Qibin Hou, Cheng-Ze Lu, Ming-Ming Cheng, Jiashi Feng

This paper does not attempt to design a state-of-the-art method for visual recognition but investigates a more efficient way to make use of convolutions to encode spatial features.

object-detection Object Detection +1

128

Paper
Code

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation

3 code implementations • 18 Sep 2022 • Meng-Hao Guo, Cheng-Ze Lu, Qibin Hou, ZhengNing Liu, Ming-Ming Cheng, Shi-Min Hu

Notably, SegNeXt outperforms EfficientNet-L2 w/ NAS-FPN and achieves 90. 6% mIoU on the Pascal VOC 2012 test leaderboard using only 1/10 parameters of it.

Ranked #1 on Semantic Segmentation on iSAID

Segmentation Semantic Segmentation

7,405

Paper
Code

Towards An End-to-End Framework for Flow-Guided Video Inpainting

2 code implementations • CVPR 2022 • Zhen Li, Cheng-Ze Lu, Jianhua Qin, Chun-Le Guo, Ming-Ming Cheng

Optical flow, which captures motion information across frames, is exploited in recent video inpainting methods through propagating pixels along its trajectories.

Ranked #2 on Seeing Beyond the Visible on KITTI360-EX

Hallucination Optical Flow Estimation +2

964

Paper
Code

Visual Attention Network

17 code implementations • 20 Feb 2022 • Meng-Hao Guo, Cheng-Ze Lu, Zheng-Ning Liu, Ming-Ming Cheng, Shi-Min Hu

In this paper, we propose a novel linear attention named large kernel attention (LKA) to enable self-adaptive and long-range correlations in self-attention while avoiding its shortcomings.

Ranked #1 on Panoptic Segmentation on COCO panoptic

Image Classification Instance Segmentation +5

124,889

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.