Search Results for author: Wankou Yang

Found 33 papers, 22 papers with code

Object-level Geometric Structure Preserving for Natural Image Stitching

1 code implementation20 Feb 2024 Wenxiao Cai, Wankou Yang

Current methodologies exhibit the ability to preserve local geometric structures, yet fall short in maintaining relationships between these geometric structures.

Image Stitching

Correlation-Embedded Transformer Tracking: A Single-Branch Framework

1 code implementation23 Jan 2024 Fei Xie, Wankou Yang, Chunyu Wang, Lei Chu, Yue Cao, Chao Ma, Wenjun Zeng

Thus, we reformulate the two-branch Siamese tracking as a conceptually simple, fully transformer-based Single-Branch Tracking pipeline, dubbed SBT.

Feature Correlation Visual Object Tracking

SSPNet: Scale and Spatial Priors Guided Generalizable and Interpretable Pedestrian Attribute Recognition

1 code implementation11 Dec 2023 Jifeng Shen, Teng Guo, Xin Zuo, Heng Fan, Wankou Yang

The AFSS module learns to provide reasonable scale prior information for different attribute groups, allowing the model to focus on different levels of feature maps with varying semantic granularity.

Attribute Pedestrian Attribute Recognition

CLIP for Lightweight Semantic Segmentation

no code implementations11 Oct 2023 Ke Jin, Wankou Yang

Later works, such as DenseCLIP and LSeg, extend this paradigm to dense prediction, including semantic segmentation, and have achieved excellent results.

Segmentation Semantic Segmentation

ADNet: Lane Shape Prediction via Anchor Decomposition

2 code implementations ICCV 2023 Lingyu Xiao, Xiang Li, Sen yang, Wankou Yang

In this paper, we revisit the limitations of anchor-based lane detection methods, which have predominantly focused on fixed anchors that stem from the edges of the image, disregarding their versatility and quality.

Lane Detection

VDD: Varied Drone Dataset for Semantic Segmentation

1 code implementation23 May 2023 Wenxiao Cai, Ke Jin, Jinyan Hou, Cong Guo, Letian Wu, Wankou Yang

We expect that our dataset will generate considerable interest in drone image segmentation and serve as a foundation for other drone vision tasks.

Image Segmentation Segmentation +1

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation

no code implementations13 Apr 2023 Letian Wu, Wenyao Zhang, Tengping Jiang, Wankou Yang, Xin Jin, Wenjun Zeng

Based on that, we build upon the CLIP model as a backbone which we extend with a One-Way [CLS] token navigation from text to the visual branch that enables zero-shot dense prediction, dubbed \textbf{ClsCLIP}.

Few-Shot Semantic Segmentation Language Modelling +4

Exploit CAM by itself: Complementary Learning System for Weakly Supervised Semantic Segmentation

no code implementations4 Mar 2023 Jiren Mai, Fei Zhang, Junjie Ye, Marcus Kalander, Xian Zhang, Wankou Yang, Tongliang Liu, Bo Han

Motivated by this simple but effective learning pattern, we propose a General-Specific Learning Mechanism (GSLM) to explicitly drive a coarse-grained CAM to a fine-grained pseudo mask.

General Knowledge Hippocampus +2

EfficientFace: An Efficient Deep Network with Feature Enhancement for Accurate Face Detection

no code implementations23 Feb 2023 Guangtao Wang, Jun Li, Zhijian Wu, Jianhua Xu, Jifeng Shen, Wankou Yang

Besides, this is conducive to estimating the locations of faces and enhancing the descriptive power of face features.

Descriptive Face Detection

Probabilistic Decomposition Transformer for Time Series Forecasting

1 code implementation31 Oct 2022 Junlong Tong, Liping Xie, Wankou Yang, Kanjian Zhang

The Transformer is employed to learn temporal patterns and implement primary probabilistic forecasts, while the conditional generative model is used to achieve non-autoregressive hierarchical probabilistic forecasts by introducing latent space feature representations.

Time Series Time Series Forecasting

Finding Point with Image: A Simple and Efficient Method for UAV Self-Localization

no code implementations13 Aug 2022 Ming Dai, Enhui Zheng, ZhenHua Feng, Jiahao Chen, Wankou Yang

To validate the practicality of our framework, we construct a paired dataset, namely UL14, that consists of UAV and satellite views.

Image Retrieval Retrieval +1

Correlation-Aware Deep Tracking

1 code implementation CVPR 2022 Fei Xie, Chunyu Wang, Guangting Wang, Yue Cao, Wankou Yang, Wenjun Zeng

In contrast to the Siamese-like feature extraction, our network deeply embeds cross-image feature correlation in multiple layers of the feature network.

Feature Correlation Visual Object Tracking

Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

1 code implementation23 Jan 2022 Ming Dai, Enhui Zheng, ZhenHua Feng, Jiedong Zhuang, Wankou Yang

Last, we enhance the Recall@K metric and introduce a new measurement, SDM@K, to evaluate the performance of a trained model from both the retrieval and localization perspectives simultaneously.

Metric Learning Representation Learning

Video Based Fall Detection Using Human Poses

1 code implementation29 Jul 2021 Ziwei Chen, Yiye Wang, Wankou Yang

Video based fall detection accuracy has been largely improved due to the recent progress on deep convolutional neural networks.

Action Recognition

TokenPose: Learning Keypoint Tokens for Human Pose Estimation

1 code implementation ICCV 2021 YanJie Li, Shoukui Zhang, Zhicheng Wang, Sen yang, Wankou Yang, Shu-Tao Xia, Erjin Zhou

Most existing CNN-based methods do well in visual representation, however, lacking in the ability to explicitly learn the constraint relationships between keypoints.

Pose Estimation

SIENet: Spatial Information Enhancement Network for 3D Object Detection from Point Cloud

1 code implementation29 Mar 2021 Ziyu Li, Yuncong Yao, Zhibin Quan, Wankou Yang, Jin Xie

Specifically, we design the Spatial Information Enhancement (SIE) module to predict the spatial shapes of the foreground points within proposals, and extract the structure information to learn the representative features for further box refinement.

3D Object Detection Autonomous Vehicles +3

Separable Batch Normalization for Robust Facial Landmark Localization with Cross-protocol Network Training

no code implementations17 Jan 2021 Shuangping Jin, ZhenHua Feng, Wankou Yang, Josef Kittler

Different from the standard BN layer that uses all the training data to calculate a single set of parameters, SepBN considers that the samples of a training dataset may belong to different sub-domains.

Face Alignment

TransPose: Keypoint Localization via Transformer

1 code implementation ICCV 2021 Sen yang, Zhibin Quan, Mu Nie, Wankou Yang

While CNN-based models have made remarkable progress on human pose estimation, what spatial dependencies they capture to localize keypoints remains unclear.

Ranked #3 on Pose Estimation on OCHuman (Validation AP metric)

Keypoint Detection Multi-Person Pose Estimation

Learning Spatio-Appearance Memory Network for High-Performance Visual Tracking

1 code implementation21 Sep 2020 Fei Xie, Wankou Yang, Bo Liu, Kaihua Zhang, Wanli Xue, WangMeng Zuo

Existing visual object tracking usually learns a bounding-box based template to match the targets across frames, which cannot accurately learn a pixel-wise representation, thereby being limited in handling severe appearance variations.

Segmentation Semantic Segmentation +5

Unsupervised Eyeglasses Removal in the Wild

1 code implementation16 Sep 2019 Bingwen Hu, Zhedong Zheng, Ping Liu, Wankou Yang, Mingwu Ren

Given two facial images with and without eyeglasses, the proposed model learns to swap the eye area in two faces.

Face Reconstruction Face Verification +3

Pose Neural Fabrics Search

2 code implementations16 Sep 2019 Sen Yang, Wankou Yang, Zhen Cui

Neural Architecture Search (NAS) technologies have emerged in many domains to jointly learn the architectures and weights of the neural network.

Image Classification Keypoint Detection +3

Refining Image Categorization by Exploiting Web Images and General Corpus

no code implementations16 Mar 2017 Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Wankou Yang, Zhenmin Tang

To tackle these problems, in this work, we exploit general corpus information to automatically select and subsequently classify web images into semantic rich (sub-)categories.

Image Categorization

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps

no code implementations29 Apr 2016 Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun

Crowd counting is an important task in computer vision, which has many applications in video surveillance.

Attribute Crowd Counting

Cannot find the paper you are looking for? You can Submit a new open access paper.