Search Results for author: Yongsheng Gao

Found 33 papers, 11 papers with code

A Novel Line Integral Transform for 2D Affine-Invariant Shape Retrieval

no code implementations ECCV 2020 Bin Wang, Yongsheng Gao

While conducting the trace transform once only generates a single feature and multiple trace transforms of different functionals are needed to derive more to make the descriptors informative.

Retrieval

Towards Temporally Consistent Referring Video Object Segmentation

1 code implementation28 Mar 2024 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Mubarak Shah, Ajmal Mian

Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects.

Ranked #3 on Referring Video Object Segmentation on Refer-YouTube-VOS (using extra training data)

Object Referring Video Object Segmentation +4

A Language Model based Framework for New Concept Placement in Ontologies

1 code implementation27 Feb 2024 Hang Dong, Jiaoyan Chen, Yuan He, Yongsheng Gao, Ian Horrocks

In all steps, we propose to leverage neural methods, where we apply embedding-based methods and contrastive learning with Pre-trained Language Models (PLMs) such as BERT for edge search, and adapt a BERT fine-tuning-based multi-label Edge-Cross-encoder, and Large Language Models (LLMs) such as GPT series, FLAN-T5, and Llama 2, for edge selection.

Contrastive Learning Entity Linking +1

Spectrum-guided Multi-granularity Referring Video Object Segmentation

1 code implementation ICCV 2023 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

To address the drift problem, we propose a Spectrum-guided Multi-granularity (SgMg) approach, which performs direct segmentation on the encoded features and employs visual details to further optimize the masks.

 Ranked #1 on Referring Expression Segmentation on J-HMDB (using extra training data)

Object Referring Expression Segmentation +4

Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification

no code implementations11 Jul 2023 Yi Liao, Yongsheng Gao, Weichuan Zhang

However, all the CAM-based methods (e. g., CAM, Grad-CAM, and Relevance-CAM) can only be used for interpreting CNN models with fully-connected (FC) layers as a classifier.

Classification Contrastive Learning +4

Fan-Beam Binarization Difference Projection (FB-BDP): A Novel Local Object Descriptor for Fine-Grained Leaf Image Retrieval

no code implementations ICCV 2023 Xin Chen, Bin Wang, Yongsheng Gao

Fine-grained leaf image retrieval (FGLIR) aims to search similar leaf images in subspecies level which involves very high interclass visual similarity and accordingly poses great challenges to leaf image description.

Binarization Image Retrieval +1

Region Aware Video Object Segmentation with Deep Motion Modeling

no code implementations21 Jul 2022 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

Current semi-supervised video object segmentation (VOS) methods usually leverage the entire features of one frame to predict object masks and update memory.

Object Segmentation +3

Imitation of Manipulation Skills Using Multiple Geometries

no code implementations2 Mar 2022 Boyang Ti, Yongsheng Gao, Jie Zhao, Sylvain Calinon

Daily manipulation tasks are characterized by geometric primitives related to actions and object shapes.

Mask-Guided Feature Extraction and Augmentation for Ultra-Fine-Grained Visual Categorization

no code implementations16 Sep 2021 Zicheng Pan, Xiaohan Yu, Miaohua Zhang, Yongsheng Gao

The advantage of the proposed method is that the feature detection and extraction model only requires a small amount of target region samples with bounding boxes for training, then it can automatically locate the target area for a large number of images in the dataset at a high detection accuracy.

Fine-Grained Visual Categorization

Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation

1 code implementation27 Jul 2021 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

We propose a self-supervised spatio-temporal matching method, coined Motion-Aware Mask Propagation (MAMP), for video object segmentation.

Segmentation Semantic Segmentation +2

EAR-NET: Error Attention Refining Network For Retinal Vessel Segmentation

1 code implementation3 Jul 2021 Jun Wang, Yang Zhao, Linglong Qian, Xiaohan Yu, Yongsheng Gao

The precise detection of blood vessels in retinal images is crucial to the early diagnosis of the retinal vascular diseases, e. g., diabetic, hypertensive and solar retinopathies.

Retinal Vessel Segmentation Segmentation +1

Image Feature Information Extraction for Interest Point Detection: A Review

no code implementations15 Jun 2021 Junfeng Jing, Tian Gao, Weichuan Zhang, Yongsheng Gao, Changming Sun

The existing popular datasets and evaluation standards are provided and the performances for eighteen state-of-the-art approaches are evaluated and discussed.

Interest Point Detection

NDPNet: A novel non-linear data projection network for few-shot fine-grained image classification

no code implementations13 Jun 2021 Weichuan Zhang, Xuefang Liu, Zhe Xue, Yongsheng Gao, Changming Sun

Metric-based few-shot fine-grained image classification (FSFGIC) aims to learn a transferable feature embedding network by estimating the similarities between query images and support classes from very few examples.

Few-Shot Learning Fine-Grained Image Classification +1

Mask Guided Attention For Fine-Grained Patchy Image Classification

2 code implementations4 Feb 2021 Jun Wang, Xiaohan Yu, Yongsheng Gao

Specifically, the proposed MGA integrates a pre-trained semantic segmentation model that produces auxiliary supervision signal, i. e., patchy attention mask, enabling a discriminative representation learning.

Classification General Classification +3

Benchmark Platform for Ultra-Fine-Grained Visual Categorization Beyond Human Performance

1 code implementation ICCV 2021 Xiaohan Yu, Yang Zhao, Yongsheng Gao, Xiaohui Yuan, Shengwu Xiong

The proposed UFG image dataset and evaluation protocols is intended to serve as a benchmark platform that can advance research of visual classification from approaching human performance to beyond human ability, via facilitating benchmark data of artificial intelligence (AI) not to be limited by the labels of human intelligence (HI).

Fine-Grained Visual Categorization

Multi-layer Feature Aggregation for Deep Scene Parsing Models

no code implementations4 Nov 2020 Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang, Qiang Wu

The proposed module can auto-select the intermediate visual features to correlate the spatial and semantic information.

Scene Parsing Semantic Segmentation

Parameter Efficient Deep Neural Networks with Bilinear Projections

1 code implementation3 Nov 2020 Litao Yu, Yongsheng Gao, Jun Zhou, Jian Zhang

Recent research on deep neural networks (DNNs) has primarily focused on improving the model accuracy.

Robust Tensor Decomposition for Image Representation Based on Generalized Correntropy

no code implementations10 May 2020 Miaohua Zhang, Yongsheng Gao, Changming Sun, Michael Blumenstein

Traditional tensor decomposition methods, e. g., two dimensional principal component analysis and two dimensional singular value decomposition, that minimize mean square errors, are sensitive to outliers.

Clustering Face Reconstruction +3

A Unified Weight Learning and Low-Rank Regression Model for Robust Complex Error Modeling

no code implementations10 May 2020 Miaohua Zhang, Yongsheng Gao, Jun Zhou

For the structured error caused by occlusions or disguises, we propose a GC function based rank approximation to measure the rank of error matrices.

Face Recognition regression +1

A Robust Matching Pursuit Algorithm Using Information Theoretic Learning

no code implementations10 May 2020 Miaohua Zhang, Yongsheng Gao, Changming Sun, Michael Blumenstein

Current orthogonal matching pursuit (OMP) algorithms calculate the correlation between two vectors using the inner product operation and minimize the mean square error, which are both suboptimal when there are non-Gaussian noises or outliers in the observation data.

Image Reconstruction

A Generalized Kernel Risk Sensitive Loss for Robust Two-Dimensional Singular Value Decomposition

no code implementations10 May 2020 Miaohua Zhang, Yongsheng Gao

Two-dimensional singular decomposition (2DSVD) has been widely used for image processing tasks, such as image reconstruction, classification, and clustering.

Clustering Image Reconstruction

Deep Residual-Dense Lattice Network for Speech Enhancement

2 code implementations27 Feb 2020 Mohammad Nikzad, Aaron Nicolson, Yongsheng Gao, Jun Zhou, Kuldip K. Paliwal, Fanhua Shang

Motivated by this, we propose the residual-dense lattice network (RDL-Net), which is a new CNN for speech enhancement that employs both residual and dense aggregations without over-allocating parameters for feature re-usage.

Speech Enhancement

Patchy Image Structure Classification Using Multi-Orientation Region Transform

1 code implementation2 Dec 2019 Xiaohan Yu, Yang Zhao, Yongsheng Gao, Shengwu Xiong, Xiaohui Yuan

To address above limitations, this paper proposes a novel Multi-Orientation Region Transform (MORT), which can effectively characterize both contour and structure features simultaneously, for patchy image structure classification.

Classification General Classification

From Species to Cultivar: Soybean Cultivar Recognition using Multiscale Sliding Chord Matching of Leaf Images

no code implementations11 Oct 2019 Bin Wang, Yongsheng Gao, Xiaohan Yu, Xiaohui Yuan, Shengwu Xiong, Xianzhong Feng

Encouraging experimental results of the proposed method in comparison to the state-of-the-art leaf species recognition methods demonstrate the availability of cultivar information in soybean leaves and effectiveness of the proposed MSCM for soybean cultivar identification, which may advance the research in leaf recognition from species to cultivar.

MobileFAN: Transferring Deep Hidden Representation for Face Alignment

no code implementations11 Aug 2019 Yang Zhao, Yifan Liu, Chunhua Shen, Yongsheng Gao, Shengwu Xiong

To this end, we propose an effective lightweight model, namely Mobile Face Alignment Network (MobileFAN), using a simple backbone MobileNetV2 as the encoder and three deconvolutional layers as the decoder.

Face Alignment Facial Landmark Detection

Robust Facial Landmark Localization Based on Texture and Pose Correlated Initialization

no code implementations15 May 2018 Yiyun Pan, Junwei Zhou, Yongsheng Gao, Shengwu Xiong

In this paper, we propose a Robust Initialization for Cascaded Pose Regression (RICPR) by providing texture and pose correlated initial shapes for the testing face.

Face Alignment regression

Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes?

no code implementations CVPR 2017 Bin Wang, Yongsheng Gao, Changming Sun, Michael Blumenstein, John La Salle

A novel chord bunch walks (CBW) descriptor is developed through the chord walking that effectively integrates the shape image function over the walked chord to reflect the contour features and the inner properties of the shape.

Translation

Face Recognition using Optimal Representation Ensemble

no code implementations3 Oct 2011 Hanxi Li, Chunhua Shen, Yongsheng Gao

It also overwhelms other modular heuristics on the faces with random occlusions, extreme expressions and disguises.

Face Recognition Model Selection

Cannot find the paper you are looking for? You can Submit a new open access paper.