Search Results for author: Shengwu Xiong

Found 13 papers, 6 papers with code

ESPT: A Self-Supervised Episodic Spatial Pretext Task for Improving Few-Shot Learning

1 code implementation26 Apr 2023 Yi Rong, Xiongbo Lu, Zhaoyang Sun, Yaxiong Chen, Shengwu Xiong

With this definition, the ESPT-augmented FSL objective promotes learning more transferable feature representations that capture the local spatial features of different images and their inter-relational structural information in each input episode, thus enabling the model to generalize better to new categories with only a few samples.

Few-Shot Image Classification Few-Shot Learning +1

SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal

no code implementations7 Dec 2021 Zhaoyang Sun, Yaxiong Chen, Shengwu Xiong

Makeup transfer is not only to extract the makeup style of the reference image, but also to render the makeup style to the semantic corresponding position of the target image.

Semantic correspondence

Benchmark Platform for Ultra-Fine-Grained Visual Categorization Beyond Human Performance

1 code implementation ICCV 2021 Xiaohan Yu, Yang Zhao, Yongsheng Gao, Xiaohui Yuan, Shengwu Xiong

The proposed UFG image dataset and evaluation protocols is intended to serve as a benchmark platform that can advance research of visual classification from approaching human performance to beyond human ability, via facilitating benchmark data of artificial intelligence (AI) not to be limited by the labels of human intelligence (HI).

Fine-Grained Visual Categorization

Scene Text Detection with Selected Anchor

no code implementations19 Aug 2020 Anna Zhu, Hang Du, Shengwu Xiong

Object proposal technique with dense anchoring scheme for scene text detection were applied frequently to achieve high recall.

Region Proposal Scene Text Detection +1

Multi-components System for Automatic Arabic Diacritization

1 code implementation8 Apr 2020 Hamza Abbad, Shengwu Xiong

In this paper, we propose an approach to tackle the problem of the automatic restoration of Arabic diacritics that includes three components stacked in a pipeline: a deep learning model which is a multi-layer recurrent neural network with LSTM and Dense layers, a character-level rule-based corrector which applies deterministic operations to prevent some errors, and a word-level statistical corrector which uses the context and the distance information to fix some diacritization issues.

Arabic Text Diacritization

Local Facial Makeup Transfer via Disentangled Representation

no code implementations27 Mar 2020 Zhaoyang Sun, Wenxuan Liu, Feng Liu, Ryan Wen Liu, Shengwu Xiong

In this paper, we propose a novel unified adversarial disentangling network to further decompose face images into four independent components, i. e., personal identity, lips makeup style, eyes makeup style and face makeup style.

Facial Makeup Transfer

Patchy Image Structure Classification Using Multi-Orientation Region Transform

1 code implementation2 Dec 2019 Xiaohan Yu, Yang Zhao, Yongsheng Gao, Shengwu Xiong, Xiaohui Yuan

To address above limitations, this paper proposes a novel Multi-Orientation Region Transform (MORT), which can effectively characterize both contour and structure features simultaneously, for patchy image structure classification.

Classification General Classification

From Species to Cultivar: Soybean Cultivar Recognition using Multiscale Sliding Chord Matching of Leaf Images

no code implementations11 Oct 2019 Bin Wang, Yongsheng Gao, Xiaohan Yu, Xiaohui Yuan, Shengwu Xiong, Xianzhong Feng

Encouraging experimental results of the proposed method in comparison to the state-of-the-art leaf species recognition methods demonstrate the availability of cultivar information in soybean leaves and effectiveness of the proposed MSCM for soybean cultivar identification, which may advance the research in leaf recognition from species to cultivar.

MobileFAN: Transferring Deep Hidden Representation for Face Alignment

no code implementations11 Aug 2019 Yang Zhao, Yifan Liu, Chunhua Shen, Yongsheng Gao, Shengwu Xiong

To this end, we propose an effective lightweight model, namely Mobile Face Alignment Network (MobileFAN), using a simple backbone MobileNetV2 as the encoder and three deconvolutional layers as the decoder.

Face Alignment Facial Landmark Detection

Directional Regularized Tensor Modeling for Video Rain Streaks Removal

1 code implementation19 Feb 2019 Zhaoyang Sun, Shengwu Xiong, Ryan Wen Liu

Outdoor videos sometimes contain unexpected rain streaks due to the rainy weather, which bring negative effects on subsequent computer vision applications, e. g., video surveillance, object recognition and tracking, etc.

Object Recognition Rain Removal

Robust Facial Landmark Localization Based on Texture and Pose Correlated Initialization

no code implementations15 May 2018 Yiyun Pan, Junwei Zhou, Yongsheng Gao, Shengwu Xiong

In this paper, we propose a Robust Initialization for Cascaded Pose Regression (RICPR) by providing texture and pose correlated initial shapes for the testing face.

Face Alignment regression

Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification

1 code implementation COLING 2016 Abdelghani Dahou, Shengwu Xiong, Junwei Zhou, Mohamed Houcine Haddoud, Pengfei Duan

Moreover, a convolutional neural network trained on top of pre-trained Arabic word embeddings is used for sentiment classification to evaluate the quality of these word embeddings.

General Classification Sentiment Analysis +2

Cannot find the paper you are looking for? You can Submit a new open access paper.