Search Results for author: Wenming Tan

Found 18 papers, 10 papers with code

Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose Estimation

1 code implementation ICCV 2023 Heng Zhao, Shenxing Wei, Dahu Shi, Wenming Tan, Zheyang Li, Ye Ren, Xing Wei, Yi Yang, ShiLiang Pu

Taking the symmetry properties of objects into consideration, we design a symmetry-aware matching loss to facilitate the learning of dense point-wise geometry features and improve the performance considerably.

6D Pose Estimation 6D Pose Estimation using RGB +3

Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection

1 code implementation ICCV 2023 Liangqi Li, Jiaxu Miao, Dahu Shi, Wenming Tan, Ye Ren, Yi Yang, ShiLiang Pu

Current methods for open-vocabulary object detection (OVOD) rely on a pre-trained vision-language model (VLM) to acquire the recognition ability.

Knowledge Distillation Language Modelling +2

SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization

1 code implementation NIPS 2022 Zheng Chuanyang, Zheyang Li, Kai Zhang, Zhi Yang, Wenming Tan, Jun Xiao, Ye Ren, ShiLiang Pu

In this paper, we introduce joint importance, which integrates essential structural-aware interactions between components for the first time, to perform collaborative pruning.

object-detection Object Detection

Unified Normalization for Accelerating and Stabilizing Transformers

1 code implementation2 Aug 2022 Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, ShiLiang Pu

To tackle these issues, we propose Unified Normalization (UN), which can speed up the inference by being fused with other linear operations and achieve comparable performance on par with LN.

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding

no code implementations ACL 2022 Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan, Jin Wang, Peng Wang, ShiLiang Pu, Fei Wu

To achieve effective grounding under a limited annotation budget, we investigate one-shot video grounding, and learn to ground natural language in all video frames with solely one frame labeled, in an end-to-end manner.

Descriptive Representation Learning +1

UWC: Unit-wise Calibration Towards Rapid Network Compression

no code implementations17 Jan 2022 Chen Lin, Zheyang Li, Bo Peng, Haoji Hu, Wenming Tan, Ye Ren, ShiLiang Pu

This paper introduces a post-training quantization~(PTQ) method achieving highly efficient Convolutional Neural Network~ (CNN) quantization with high performance.

Quantization

End-to-End Multi-Person Pose Estimation With Transformers

1 code implementation CVPR 2022 Dahu Shi, Xing Wei, Liangqi Li, Ye Ren, Wenming Tan

Current methods of multi-person pose estimation typically treat the localization and association of body joints separately.

Multi-Person Pose Estimation

Scene-Adaptive Attention Network for Crowd Counting

no code implementations31 Dec 2021 Xing Wei, Yuanrui Kang, Jihao Yang, Yunfeng Qiu, Dahu Shi, Wenming Tan, Yihong Gong

First of all, we design a deformable attention in-built Transformer backbone, which learns adaptive feature representations with deformable sampling locations and dynamic attention weights.

Crowd Counting

SOIT: Segmenting Objects with Instance-Aware Transformers

1 code implementation21 Dec 2021 Xiaodong Yu, Dahu Shi, Xing Wei, Ye Ren, Tingqun Ye, Wenming Tan

The pixel-wise mask, especially, is embedded by a group of parameters to construct a lightweight instance-aware transformer.

Instance Segmentation Segmentation +1

Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition

1 code implementation13 May 2021 Hui Jiang, Yunlu Xu, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Wenqi Ren, Fei Wu, Wenming Tan

In this work, we excavate the implicit task, character counting within the traditional text recognition, without additional labor annotation cost.

Optical Character Recognition (OCR) Scene Text Recognition

LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment

1 code implementation13 May 2021 Liang Qiao, Zaisheng Li, Zhanzhan Cheng, Peng Zhang, ShiLiang Pu, Yi Niu, Wenqi Ren, Wenming Tan, Fei Wu

In this paper, we aim to obtain more reliable aligned bounding boxes by fully utilizing the visual information from both text regions in proposed local features and cell relations in global features.

Table Recognition

Rethinking Pseudo-labeled Sample Mining for Semi-Supervised Object Detection

no code implementations1 Jan 2021 Duo Li, Sanli Tang, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Wenming Tan, Fei Wu, Xiaokang Yang

However, the impact of the pseudo-labeled samples' quality as well as the mining strategies for high quality training sample have rarely been studied in SSL.

object-detection Object Detection +1

PolarDet: A Fast, More Precise Detector for Rotated Target in Aerial Images

no code implementations17 Oct 2020 Pengbo Zhao, Zhenshen Qu, Yingjia Bu, Wenming Tan, Ye Ren, ShiLiang Pu

Fast and precise object detection for high-resolution aerial images has been a challenging task over the years.

Ranked #35 on Object Detection In Aerial Images on DOTA (using extra training data)

Object object-detection +1

MAFF-Net: Filter False Positive for 3D Vehicle Detection with Multi-modal Adaptive Feature Fusion

no code implementations23 Sep 2020 Zehan Zhang, Ming Zhang, Zhidong Liang, Xian Zhao, Ming Yang, Wenming Tan, ShiLiang Pu

Experimental results on the KITTI dataset demonstrate significant improvement in filtering false positive over the approach using only point cloud data.

Autonomous Driving

Extreme Network Compression via Filter Group Approximation

no code implementations ECCV 2018 Bo Peng, Wenming Tan, Zheyang Li, Shun Zhang, Di Xie, ShiLiang Pu

In this paper we propose a novel decomposition method based on filter group approximation, which can significantly reduce the redundancy of deep convolutional neural networks (CNNs) while maintaining the majority of feature representation.

General Classification Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.