Search Results for author: Zhanyu Ma

Found 75 papers, 34 papers with code

GINet: Graph Interaction Network for Scene Parsing

1 code implementation • ECCV 2020 • Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, Ming Wu, Zhanyu Ma, Guodong Guo

GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based semantic graph.

Scene Parsing

8,228

Paper
Code

DemoFusion: Democratising High-Resolution Image Generation With No $$$

1 code implementation • 24 Nov 2023 • Ruoyi Du, Dongliang Chang, Timothy Hospedales, Yi-Zhe Song, Zhanyu Ma

High-resolution image generation with Generative Artificial Intelligence (GenAI) has immense potential but, due to the enormous capital investment required for training, it is increasingly centralised to a few large corporations, and hidden behind paywalls.

Image Generation

1,855

Paper
Code

The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

3 code implementations • 11 Feb 2020 • Dongliang Chang, Yifeng Ding, Jiyang Xie, Ayan Kumar Bhunia, Xiaoxu Li, Zhanyu Ma, Ming Wu, Jun Guo, Yi-Zhe Song

The proposed loss function, termed as mutual-channel loss (MC-Loss), consists of two channel-specific components: a discriminality component and a diversity component.

Ranked #29 on Fine-Grained Image Classification on FGVC Aircraft

Fine-Grained Image Classification General Classification +1

301

Paper
Code

Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches

5 code implementations • ECCV 2020 • Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma, Yi-Zhe Song, Jun Guo

In this work, we propose a novel framework for fine-grained visual classification to tackle these problems.

Ranked #17 on Fine-Grained Image Classification on Stanford Cars

Classification Fine-Grained Image Classification +1

211

Paper
Code

Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis

1 code implementation • ACL 2021 • Ruifan Li, Hao Chen, Fangxiang Feng, Zhanyu Ma, Xiaojie Wang, Eduard Hovy

To overcome these challenges, in this paper, we propose a dual graph convolutional networks (DualGCN) model that considers the complementarity of syntax structures and semantic correlations simultaneously.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

126

Paper
Code

Your "Flamingo" is My "Bird": Fine-Grained, or Not

1 code implementation • CVPR 2021 • Dongliang Chang, Kaiyue Pang, Yixiao Zheng, Zhanyu Ma, Yi-Zhe Song, Jun Guo

For that, we re-envisage the traditional setting of FGVC, from single-label classification, to that of top-down traversal of a pre-defined coarse-to-fine label hierarchy -- so that our answer becomes "bird"-->"Phoenicopteriformes"-->"Phoenicopteridae"-->"flamingo".

Ranked #16 on Fine-Grained Image Classification on FGVC Aircraft

Disentanglement Fine-Grained Image Classification +1

Paper
Code

On-the-Fly Category Discovery

1 code implementation • CVPR 2023 • Ruoyi Du, Dongliang Chang, Kongming Liang, Timothy Hospedales, Yi-Zhe Song, Zhanyu Ma

Our code is available at https://github. com/PRIS-CV/On-the-fly-Category-Discovery.

Disentanglement Novel Class Discovery

Paper
Code

Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot Image Classification

1 code implementation • 30 Nov 2022 • Jijie Wu, Dongliang Chang, Aneeshan Sain, Xiaoxu Li, Zhanyu Ma, Jie Cao, Jun Guo, Yi-Zhe Song

Conventional few-shot learning methods however cannot be naively adopted for this fine-grained setting -- a quick pilot study reveals that they in fact push for the opposite (i. e., lower inter-class variations and higher intra-class variations).

Few-Shot Image Classification Few-Shot Learning +2

Paper
Code

OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer

1 code implementation • 20 Apr 2020 • Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jingyi Yu, Jun Guo

A deep neural network of multiple nonlinear layers forms a large function space, which can easily lead to overfitting when it encounters small-sample data.

Classification General Classification

Paper
Code

BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

1 code implementation • 29 Nov 2020 • Xiaoxu Li, Jijie Wu, Zhuo Sun, Zhanyu Ma, Jie Cao, Jing-Hao Xue

Motivated by this, we propose a so-called \textit{Bi-Similarity Network} (\textit{BSNet}) that consists of a single embedding module and a bi-similarity module of two similarity measures.

Few-Shot Learning Fine-Grained Image Classification +1

Paper
Code

Duplex Contextual Relation Network for Polyp Segmentation

1 code implementation • 11 Mar 2021 • Zijin Yin, Kongming Liang, Zhanyu Ma, Jun Guo

However, previous methods only focus on learning the dependencies between the position within an individual image and ignore the contextual relation across different images.

Position Relation +1

Paper
Code

Progressive Co-Attention Network for Fine-grained Visual Classification

1 code implementation • 21 Jan 2021 • Tian Zhang, Dongliang Chang, Zhanyu Ma, Jun Guo

Fine-grained visual classification aims to recognize images belonging to multiple sub-categories within a same category.

Ranked #32 on Fine-Grained Image Classification on FGVC Aircraft

Classification Fine-Grained Image Classification +1

Paper
Code

Mind the Gap: Enlarging the Domain Gap in Open Set Domain Adaptation

2 code implementations • 8 Mar 2020 • Dongliang Chang, Aneeshan Sain, Zhanyu Ma, Yi-Zhe Song, Jun Guo

The key insight lies with how we exploit the mutually beneficial information between two networks; (a) to separate samples of known and unknown classes, (b) to maximize the domain confusion between source and target domain without the influence of unknown samples.

Unsupervised Domain Adaptation

Paper
Code

Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization

1 code implementation • 11 Oct 2020 • Jiyang Xie, Zhanyu Ma, and Jianjun Lei, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan, Jun Guo

Due to lack of data, overfitting ubiquitously exists in real-world applications of deep neural networks (DNNs).

Network Pruning text-classification +1

Paper
Code

Making a Bird AI Expert Work for You and Me

1 code implementation • 6 Dec 2021 • Dongliang Chang, Kaiyue Pang, Ruoyi Du, Zhanyu Ma, Yi-Zhe Song, Jun Guo

1 lays out our approach in answering this question.

Fine-Grained Image Classification

Paper
Code

Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features

2 code implementations • 31 Jan 2021 • Dongliang Chang, Yixiao Zheng, Zhanyu Ma, Ruoyi Du, Kongming Liang

Finally, we can obtain multiple discriminative regions on high-level feature channels and obtain multiple more minute regions within these discriminative regions on middle-level feature channels.

Fine-Grained Image Classification General Classification

Paper
Code

ScaleNet: Searching for the Model to Scale

1 code implementation • 15 Jul 2022 • Jiyang Xie, Xiu Su, Shan You, Zhanyu Ma, Fei Wang, Chen Qian

Recently, community has paid increasing attention on model scaling and contributed to developing a model family with a wide spectrum of scales.

Paper
Code

Dual-attention Guided Dropblock Module for Weakly Supervised Object Localization

1 code implementation • 9 Mar 2020 • Junhui Yin, Siqing Zhang, Dongliang Chang, Zhanyu Ma, Jun Guo

This module contains two key components, the channel attention guided dropout (CAGD) and the spatial attention guided dropblock (SAGD).

Weakly-Supervised Object Localization

Paper
Code

Super-Resolution Information Enhancement For Crowd Counting

1 code implementation • 13 Mar 2023 • Jiahao Xie, Wei Xu, Dingkang Liang, Zhanyu Ma, Kongming Liang, Weidong Liu, Rui Wang, Ling Jin

As the proposed method requires SR labels, we further propose a Super-Resolution Crowd Counting dataset (SR-Crowd).

Crowd Counting Super-Resolution

Paper
Code

Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition

1 code implementation • 18 Sep 2023 • Ming-Zhe Li, Zhen Jia, Zhang Zhang, Zhanyu Ma, Liang Wang

In order to solve this dilemma, we propose a multi-semantic fusion (MSF) model for improving the performance of GZSSAR, where two kinds of class-level textual descriptions (i. e., action descriptions and motion descriptions), are collected as auxiliary semantic information to enhance the learning efficacy of generalizable skeleton features.

Ranked #1 on Generalized Zero Shot skeletal action recognition on NTU RGB+D 120

Action Recognition Generalized Zero Shot skeletal action recognition +1

Paper
Code

SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval

1 code implementation • CVPR 2018 • Peng Xu, Yongye Huang, Tongtong Yuan, Kaiyue Pang, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Zhanyu Ma, Jun Guo

Key to our network design is the embedding of unique characteristics of human sketch, where (i) a two-branch CNN-RNN architecture is adapted to explore the temporal ordering of strokes, and (ii) a novel hashing loss is specifically designed to accommodate both the temporal and abstract traits of sketches.

Deep Hashing Sketch Recognition

Paper
Code

Knowledge Transfer Based Fine-grained Visual Classification

1 code implementation • 21 Dec 2020 • Siqing Zhang, Ruoyi Du, Dongliang Chang, Zhanyu Ma, Jun Guo

Convolution neural networks (CNNs), which employ the cross entropy loss (CE-loss) as the loss function, show poor performance since the model can only learn the most discriminative part and ignore other meaningful regions.

Ranked #36 on Fine-Grained Image Classification on CUB-200-2011

Classification Fine-Grained Image Classification +2

Paper
Code

Learning Invariant Visual Representations for Compositional Zero-Shot Learning

1 code implementation • 1 Jun 2022 • Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, Jun Guo

Compositional Zero-Shot Learning (CZSL) aims to recognize novel compositions using knowledge learned from seen attribute-object compositions in the training set.

Attribute Compositional Zero-Shot Learning +2

Paper
Code

Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data

1 code implementation • 6 Dec 2021 • Ruoyi Du, Dongliang Chang, Zhanyu Ma, Yi-Zhe Song, Jun Guo

Despite great strides made on fine-grained visual classification (FGVC), current methods are still heavily reliant on fully-supervised paradigms where ample expert labels are called for.

Fine-Grained Image Classification

Paper
Code

Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

1 code implementation • 2 Mar 2024 • Zijin Yin, Kongming Liang, Bing Li, Zhanyu Ma, Jun Guo

We evaluate a broad variety of semantic segmentation models, spanning from conventional close-set models to recent open-vocabulary large models on their robustness to different types of variations.

Attribute Benchmarking +2

Paper
Code

CMF: Cascaded Multi-model Fusion for Referring Image Segmentation

1 code implementation • 16 Jun 2021 • Jianhua Yang, Yan Huang, Zhanyu Ma, Liang Wang

To solve this problem, we propose a simple yet effective Cascaded Multi-modal Fusion (CMF) module, which stacks multiple atrous convolutional layers in parallel and further introduces a cascaded branch to fuse visual and linguistic features.

Image Segmentation Segmentation +1

Paper
Code

ReMarNet: Conjoint Relation and Margin Learning for Small-Sample Image Classification

1 code implementation • 27 Jun 2020 • Xiaoxu Li, Liyun Yu, Xiaochen Yang, Zhanyu Ma, Jing-Hao Xue, Jie Cao, Jun Guo

Despite achieving state-of-the-art performance, deep learning methods generally require a large amount of labeled data during training and may suffer from overfitting when the sample size is small.

Classification General Classification +3

Paper
Code

Multi-View Active Fine-Grained Visual Recognition

1 code implementation • ICCV 2023 • Ruoyi Du, Wenqing Yu, Heqing Wang, Ting-En Lin, Dongliang Chang, Zhanyu Ma

Despite the remarkable progress of Fine-grained visual classification (FGVC) with years of history, it is still limited to recognizing 2 images.

Fine-Grained Image Classification Fine-Grained Visual Recognition

Paper
Code

HumanRecon: Neural Reconstruction of Dynamic Human Using Geometric Cues and Physical Priors

1 code implementation • 26 Nov 2023 • Junhui Yin, Wei Yin, Hao Chen, Xuqian Ren, Zhanyu Ma, Jun Guo, Yifan Liu

These priors ensure the color rendered along rays to be robust to view direction and reduce the inherent ambiguities of density estimated along rays.

Novel View Synthesis

Paper
Code

GPCA: A Probabilistic Framework for Gaussian Process Embedded Channel Attention

1 code implementation • 10 Mar 2020 • Jiyang Xie, Dongliang Chang, Zhanyu Ma, Guo-Qiang Zhang, Jun Guo

In this paper, we propose Gaussian process embedded channel attention (GPCA) module and further interpret the channel attention schemes in a probabilistic way.

Image Classification

Paper
Code

Caption Feature Space Regularization for Audio Captioning

1 code implementation • 18 Apr 2022 • Yiming Zhang, Hong Yu, Ruoyi Du, Zhanyu Ma, Yuan Dong

To eliminate this negative effect, in this paper, we propose a two-stage framework for audio captioning: (i) in the first stage, via the contrastive learning, we construct a proxy feature space to reduce the distances between captions correlated to the same audio, and (ii) in the second stage, the proxy feature space is utilized as additional supervision to encourage the model to be optimized in the direction that benefits all the correlated captions.

Audio captioning Contrastive Learning

Paper
Code

Multi-View Active Fine-Grained Recognition

1 code implementation • 2 Jun 2022 • Ruoyi Du, Wenqing Yu, Heqing Wang, Dongliang Chang, Ting-En Lin, Yongbin Li, Zhanyu Ma

As fine-grained visual classification (FGVC) being developed for decades, great works related have exposed a key direction -- finding discriminative local regions and revealing subtle differences.

Fine-Grained Image Classification

Paper
Code

Competing Ratio Loss for Discriminative Multi-class Image Classification

1 code implementation • 25 Dec 2019 • Ke Zhang, Yurong Guo, Xinsheng Wang, Dongliang Chang, Zhenbing Zhao, Zhanyu Ma, Tony X. Han

However, during the training of the deep convolutional neural network, the value of NLLR is not always positive or negative, which severely affects the convergence of NLLR.

Age Estimation Classification +3

Paper
Code

Structured DropConnect for Uncertainty Inference in Image Classification

1 code implementation • 16 Jun 2021 • Wenqing Zheng, Jiyang Xie, Weidong Liu, Zhanyu Ma

For image classification tasks, we propose a structured DropConnect (SDC) framework to model the output of a deep neural network by a Dirichlet distribution.

Classification Image Classification +1

Paper
Code

Fine-Grained Age Estimation in the wild with Attention LSTM Networks

no code implementations • 26 May 2018 • Ke Zhang, Na Liu, Xingfang Yuan, Xinyao Guo, Ce Gao, Zhenbing Zhao, Zhanyu Ma

Then, we fine-tune the ResNets or the RoR on the target age datasets to extract the global features of face images.

Ranked #3 on Age And Gender Classification on Adience Age (using extra training data)

Age And Gender Classification Age Estimation +1

Paper
Add Code

Decorrelation of Neutral Vector Variables: Theory and Applications

no code implementations • 30 May 2017 • Zhanyu Ma, Jing-Hao Xue, Arne Leijon, Zheng-Hua Tan, Zhen Yang, Jun Guo

In this paper, we propose novel strategies for neutral vector variable decorrelation.

Paper
Add Code

Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval

no code implementations • 28 May 2017 • Peng Xu, Qiyue Yin, Yongye Huang, Yi-Zhe Song, Zhanyu Ma, Liang Wang, Tao Xiang, W. Bastiaan Kleijn, Jun Guo

Sketch-based image retrieval (SBIR) is challenging due to the inherent domain-gap between sketch and photo.

Ranked #5 on Sketch-Based Image Retrieval on Chairs

Image-text matching Retrieval +2

Paper
Add Code

DNN Filter Bank Cepstral Coefficients for Spoofing Detection

no code implementations • 13 Feb 2017 • Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Jun Guo

In order to improve the reliability of speaker verification systems, we develop a new filter bank based cepstral feature, deep neural network filter bank cepstral coefficients (DNN-FBCC), to distinguish between natural and spoofed speech.

Speaker Verification Speech Synthesis

Paper
Add Code

BALSON: Bayesian Least Squares Optimization with Nonnegative L1-Norm Constraint

no code implementations • 8 Jul 2018 • Jiyang Xie, Zhanyu Ma, Guo-Qiang Zhang, Jing-Hao Xue, Jen-Tzung Chien, Zhiqing Lin, Jun Guo

In order to explicitly characterize the nonnegative L1-norm constraint of the parameters, we further approximate the true posterior distribution by a Dirichlet distribution.

Paper
Add Code

Infinite Mixture of Inverted Dirichlet Distributions

no code implementations • 27 Jul 2018 • Zhanyu Ma, Yuping Lai

In this work, we develop a novel Bayesian estimation method for the Dirichlet process (DP) mixture of the inverted Dirichlet distributions, which has been shown to be very flexible for modeling vectors with positive elements.

Variational Inference

Paper
Add Code

SEA: A Combined Model for Heat Demand Prediction

no code implementations • 28 Jul 2018 • Jiyang Xie, Jiaxin Guo, Zhanyu Ma, Jing-Hao Xue, Qie Sun, Hailong Li, Jun Guo

ENN and ARIMA are used to predict seasonal and trend components, respectively.

Paper
Add Code

Dirichlet Mixture Model based VQ Performance Prediction for Line Spectral Frequency

no code implementations • 2 Aug 2018 • Zhanyu Ma

In this paper, we continue our previous work on the Dirichlet mixture model (DMM)-based VQ to derive the performance bound of the LSF VQ.

Quantization

Paper
Add Code

Classification of EEG Signal based on non-Gaussian Neutral Vector

no code implementations • 2 Aug 2018 • Zhanyu Ma

In the design of brain-computer interface systems, classification of Electroencephalogram (EEG) signals is the essential part and a challenging task.

Brain Computer Interface Classification +3

Paper
Add Code

Mobile big data analysis with machine learning

no code implementations • 2 Aug 2018 • Jiyang Xie, Zeyu Song, Yupeng Li, Zhanyu Ma

Finally, we summarize the main challenges and future development directions of mobile big data analysis.

BIG-bench Machine Learning speech-recognition +1

Paper
Add Code

Impacts of Weather Conditions on District Heat System

no code implementations • 2 Aug 2018 • Jiyang Xie, Zhanyu Ma, Jun Guo

Using artificial neural network for the prediction of heat demand has attracted more and more attention.

Paper
Add Code

Histogram Transform-based Speaker Identification

no code implementations • 2 Aug 2018 • Zhanyu Ma, Hong Yu

A novel text-independent speaker identification (SI) method is proposed.

Speaker Identification

Paper
Add Code

Deep Neural Network for Analysis of DNA Methylation Data

no code implementations • 2 Aug 2018 • Hong Yu, Zhanyu Ma

Many researches demonstrated that the DNA methylation, which occurs in the context of a CpG, has strong correlation with diseases, including cancer.

Paper
Add Code

Language Identification with Deep Bottleneck Features

no code implementations • 18 Sep 2018 • Zhanyu Ma, Hong Yu

In order to improve the SLD accuracy of short utterances a phase vocoder based time-scale modification(TSM) method is used to reduce and increase speech rated of the test utterance.

Language Identification Transfer Learning

Paper
Add Code

On the Convergence of Extended Variational Inference for Non-Gaussian Statistical Models

no code implementations • 13 Feb 2019 • Zhanyu Ma, Jalil Taghia, Jun Guo

Recently, an improved framework, namely the extended variational inference (EVI), has been introduced and applied to derive analytically tractable solution by employing lower-bound approximation to the variational objective function.

Variational Inference

Paper
Add Code

Channel Max Pooling Layer for Fine-Grained Vehicle Classification

no code implementations • 14 Feb 2019 • Zhanyu Ma, Dongliang Chang, Xiaoxu Li

Experimental results on two fine-grained vehicle datasets, the Stanford Cars-196 dataset and the Comp Cars dataset, demonstrate that the proposed layer could improve classification accuracies of deep neural networks on fine-grained vehicle classification in the situation that a massive of parameters are reduced.

Classification Fine-Grained Vehicle Classification +1

Paper
Add Code

Deep Zero-Shot Learning for Scene Sketch

no code implementations • 11 May 2019 • Yao Xie, Peng Xu, Zhanyu Ma

We introduce a novel problem of scene sketch zero-shot learning (SSZSL), which is a challenging task, since (i) different from photo, the gap between common semantic domain (e. g., word vector) and sketch is too huge to exploit common semantic knowledge as the bridge for knowledge transfer, and (ii) compared with single-object sketch, more expressive feature representation for scene sketch is required to accommodate its high-level of abstraction and complexity.

Transfer Learning Zero-Shot Learning

Paper
Add Code

Competing Ratio Loss for Discriminative Multi-class Image Classification

no code implementations • 31 Jul 2019 • Ke Zhang, Xinsheng Wang, Yurong Guo, Zhenbing Zhao, Zhanyu Ma, Tony X. Han

A lot of studies of image classification based on deep convolutional neural network focus on the network structure to improve the image classification performance.

Age Estimation Classification +3

Paper
Add Code

Semi-Heterogeneous Three-Way Joint Embedding Network for Sketch-Based Image Retrieval

no code implementations • 10 Nov 2019 • Jianjun Lei, Yuxin Song, Bo Peng, Zhanyu Ma, Ling Shao, Yi-Zhe Song

How to align abstract sketches and natural images into a common high-level semantic space remains a key problem in SBIR.

Retrieval Sketch-Based Image Retrieval

Paper
Add Code

Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification

no code implementations • 9 Feb 2020 • Yifeng Ding, Shaoguo Wen, Jiyang Xie, Dongliang Chang, Zhanyu Ma, Zhongwei Si, Haibin Ling

Classifying the sub-categories of an object from the same super-category (e. g. bird species, car and aircraft models) in fine-grained visual classification (FGVC) highly relies on discriminative feature representation and accurate region localization.

Fine-Grained Image Classification General Classification

Paper
Add Code

Fine-Grained Instance-Level Sketch-Based Video Retrieval

no code implementations • 21 Feb 2020 • Peng Xu, Kun Liu, Tao Xiang, Timothy M. Hospedales, Zhanyu Ma, Jun Guo, Yi-Zhe Song

Existing sketch-analysis work studies sketches depicting static objects or scenes.

Cross-Modal Retrieval Image Retrieval +2

Paper
Add Code

OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail Enhancement

no code implementations • 8 Mar 2020 • Fangyi Zhu, Jenq-Neng Hwang, Zhanyu Ma, Guang Chen, Jun Guo

Thereafter, we construct a new dataset, providing consistent object-sentence pairs, to facilitate effective cross-modal learning.

Object Sentence +1

Paper
Add Code

A Concise Review of Recent Few-shot Meta-learning Methods

no code implementations • 22 May 2020 • Xiaoxu Li, Zhuo Sun, Jing-Hao Xue, Zhanyu Ma

Few-shot meta-learning has been recently reviving with expectations to mimic humanity's fast adaption to new concepts based on prior knowledge.

Meta-Learning

Paper
Add Code

SSKD: Self-Supervised Knowledge Distillation for Cross Domain Adaptive Person Re-Identification

no code implementations • 13 Sep 2020 • Junhui Yin, Jiayan Qiu, Siqing Zhang, Zhanyu Ma, Jun Guo

To this end, we propose a Self-Supervised Knowledge Distillation (SSKD) technique containing two modules, the identity learning and the soft label learning.

Clustering Domain Adaptive Person Re-Identification +2

Paper
Add Code

CC-Loss: Channel Correlation Loss For Image Classification

no code implementations • 12 Oct 2020 • Zeyu Song, Dongliang Chang, Zhanyu Ma, Xiaoxu Li, Zheng-Hua Tan

The loss function is a key component in deep learning models.

Classification General Classification +1

Paper
Add Code

Actor and Action Modular Network for Text-based Video Segmentation

no code implementations • 2 Nov 2020 • Jianhua Yang, Yan Huang, Kai Niu, Linjiang Huang, Zhanyu Ma, Liang Wang

Previous methods fail to explicitly align the video content with the textual query in a fine-grained manner according to the actor and its action, due to the problem of \emph{semantic asymmetry}.

Ranked #9 on Referring Expression Segmentation on J-HMDB

Action Segmentation Action Understanding +5

Paper
Add Code

DS-UI: Dual-Supervised Mixture of Gaussian Mixture Models for Uncertainty Inference

no code implementations • 17 Nov 2020 • Jiyang Xie, Zhanyu Ma, Jing-Hao Xue, Guoqiang Zhang, Jun Guo

In the DS-UI, we combine the classifier of a DNN, i. e., the last fully-connected (FC) layer, with a mixture of Gaussian mixture models (MoGMM) to obtain an MoGMM-FC layer.

Paper
Add Code

Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting

no code implementations • 15 Dec 2020 • Wei Xu, Dingkang Liang, Yixiao Zheng, Zhanyu Ma

In this paper, we propose a simple yet efficient counting network based on point-level annotations.

Object Object Counting

Paper
Add Code

TLRM: Task-level Relation Module for GNN-based Few-Shot Learning

no code implementations • 25 Jan 2021 • Yurong Guo, Zhanyu Ma, Xiaoxu Li, Yuan Dong

We consider this method of measuring relation of samples only models the sample-to-sample relation, while neglects the specificity of different tasks.

Few-Shot Learning Relation +1

Paper
Add Code

Grad-CAM guided channel-spatial attention module for fine-grained visual classification

no code implementations • 24 Jan 2021 • Shuai Xu, Dongliang Chang, Jiyang Xie, Zhanyu Ma

The proposed method outperforms the SOTA attention modules in the FGVC task.

Ranked #21 on Fine-Grained Image Classification on FGVC Aircraft

Fine-Grained Image Classification General Classification +1

Paper
Add Code

DF^2AM: Dual-level Feature Fusion and Affinity Modeling for RGB-Infrared Cross-modality Person Re-identification

no code implementations • 1 Apr 2021 • Junhui Yin, Zhanyu Ma, Jiyang Xie, Shibo Nie, Kongming Liang, Jun Guo

Meanwhile, to further mining the relationships between global features from person images, we propose an Affinities Modeling (AM) module to obtain the optimal intra- and inter-modality image matching.

Cross-Modality Person Re-identification Person Re-Identification

Paper
Add Code

Unsupervised Person Re-identification via Simultaneous Clustering and Consistency Learning

no code implementations • 1 Apr 2021 • Junhui Yin, Jiayan Qiu, Siqing Zhang, Jiyang Xie, Zhanyu Ma, Jun Guo

Unsupervised person re-identification (re-ID) has become an important topic due to its potential to resolve the scalability problem of supervised re-ID models.

Clustering Unsupervised Person Re-Identification

Paper
Add Code

Deep Metric Learning for Few-Shot Image Classification: A Review of Recent Developments

no code implementations • 17 May 2021 • Xiaoxu Li, Xiaochen Yang, Zhanyu Ma, Jing-Hao Xue

Few-shot image classification is a challenging problem that aims to achieve the human level of recognition based only on a small number of training images.

Classification Few-Shot Image Classification +3

Paper
Add Code

Channel DropBlock: An Improved Regularization Method for Fine-Grained Visual Classification

no code implementations • 7 Jun 2021 • Yifeng Ding, Shuwei Dong, Yujun Tong, Zhanyu Ma, Bo Xiao, Haibin Ling

Classifying the sub-categories of an object from the same super-category (e. g., bird) in a fine-grained visual classification (FGVC) task highly relies on mining multiple discriminative features.

Fine-Grained Image Classification

Paper
Add Code

Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification

no code implementations • 21 Jun 2021 • Chenyu Guo, Jiyang Xie, Kongming Liang, Xian Sun, Zhanyu Ma

Then, attention mechanisms are used after feature fusion to extract spatial and channel information while linking the high-level semantic information and the low-level texture features, which can better locate the discriminative regions for the FGVC.

Fine-Grained Image Classification

Paper
Add Code

Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction

no code implementations • 20 Jan 2022 • Jingye Wang, Ruoyi Du, Dongliang Chang, Kongming Liang, Zhanyu Ma

Adaptation to out-of-distribution data is a meta-challenge for all statistical learning algorithms that strongly rely on the i. i. d.

Data Augmentation Disentanglement +2

Paper
Add Code

HCLD: A Hierarchical Framework for Zero-shot Cross-lingual Dialogue System

no code implementations • COLING 2022 • Zhanyu Ma, Jian Ye, Xurui Yang, Jianfeng Liu

Recently, many task-oriented dialogue systems need to serve users in different languages.

Intent Detection Sentence +3

Paper
Add Code

An Erudite Fine-Grained Visual Classification Model

no code implementations • CVPR 2023 • Dongliang Chang, Yujun Tong, Ruoyi Du, Timothy Hospedales, Yi-Zhe Song, Zhanyu Ma

Therefore, we first propose a feature disentanglement module and a feature re-fusion module to reduce negative transfer and boost positive transfer between different datasets.

Classification Disentanglement +2