Search Results for author: Mang Ye

Found 37 papers, 26 papers with code

Transformer for Object Re-Identification: A Survey

no code implementations • 13 Jan 2024 • Mang Ye, Shuoyi Chen, Chenyue Li, Wei-Shi Zheng, David Crandall, Bo Du

Object Re-Identification (Re-ID) aims to identify and retrieve specific objects from varying viewpoints.

Paper
Add Code

Negative Pre-aware for Noisy Cross-modal Matching

1 code implementation • 10 Dec 2023 • Xu Zhang, Hao Li, Mang Ye

Since clean samples are easier distinguished by GMM with increasing noise, the memory bank can still maintain high quality at a high noise ratio.

Image-text matching Image-to-Text Retrieval +2

Paper
Code

Federated Learning for Generalization, Robustness, Fairness: A Survey and Benchmark

1 code implementation • 12 Nov 2023 • Wenke Huang, Mang Ye, Zekun Shi, Guancheng Wan, He Li, Bo Du, Qiang Yang

In this survey, we provide a systematic overview of the important and recent developments of research on federated learning.

Fairness Federated Learning +1

Paper
Code

Rotation Invariant Transformer for Recognizing Object in UAVs

3 code implementations • ACM Multimedia 2022 • Shuoyi Chen, Mang Ye, Bo Du

Existing methods are usually designed for city cameras, incapable of handing the rotation issue in UAV scenarios.

Object Person Re-Identification +1

Paper
Code

Generalizable Heterogeneous Federated Cross-Correlation and Instance Similarity Learning

2 code implementations • 28 Sep 2023 • Wenke Huang, Mang Ye, Zekun Shi, Bo Du

Federated learning is an important privacy-preserving multi-party learning paradigm, involving collaborative learning with others and local updating on private data.

Domain Generalization Federated Learning +1

Paper
Code

Rethinking Client Drift in Federated Learning: A Logit Perspective

no code implementations • 20 Aug 2023 • Yunlu Yan, Chun-Mei Feng, Mang Ye, WangMeng Zuo, Ping Li, Rick Siow Mong Goh, Lei Zhu, C. L. Philip Chen

Concretely, FedCSD introduces a class prototype similarity distillation to align the local logits with the refined global logits that are weighted by the similarity between local logits and the global prototype.

Federated Learning

Paper
Add Code

An Empirical Study of CLIP for Text-based Person Search

1 code implementation • 19 Aug 2023 • Min Cao, Yang Bai, Ziyin Zeng, Mang Ye, Min Zhang

TPBS, as a fine-grained cross-modal retrieval task, is also facing the rise of research on the CLIP-based TBPS.

Ranked #4 on Text based Person Retrieval on RSTPReid

Cross-Modal Retrieval Data Augmentation +5

Paper
Code

Heterogeneous Federated Learning: State-of-the-art and Research Challenges

2 code implementations • 20 Jul 2023 • Mang Ye, Xiuwen Fang, Bo Du, Pong C. Yuen, DaCheng Tao

Therefore, a systematic survey on this topic about the research challenges and state-of-the-art is essential.

Federated Learning

Paper
Code

Symmetric Uncertainty-Aware Feature Transmission for Depth Super-Resolution

1 code implementation • 1 Jun 2023 • Wuxuan Shi, Mang Ye, Bo Du

(2) For the cross-modality gap, we propose a novel Symmetric Uncertainty scheme to remove parts of RGB information harmful to the recovery of HR depth maps.

Super-Resolution

Paper
Code

Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval

1 code implementation • CVPR 2023 • Ding Jiang, Mang Ye

To alleviate these issues, we present IRRA: a cross-modal Implicit Relation Reasoning and Aligning framework that learns relations between local visual-textual tokens and enhances global image-text matching without requiring additional prior supervision.

Ranked #5 on Text based Person Retrieval on RSTPReid (using extra training data)

Image-text matching Language Modelling +8

171

Paper
Code

Towards Modality-Agnostic Person Re-Identification With Descriptive Query

1 code implementation • CVPR 2023 • Cuiqun Chen, Mang Ye, Ding Jiang

Person re-identification (ReID) with descriptive query (text or sketch) provides an important supplement for general image-image paradigms, which is usually studied in a single cross-modality matching manner, e. g., text-to-image or sketch-to-photo.

Descriptive Person Re-Identification +1

Paper
Code

Robust Heterogeneous Federated Learning under Data Corruption

1 code implementation • ICCV 2023 • Xiuwen Fang, Mang Ye, Xiyuan Yang

Model heterogeneous federated learning is a realistic and challenging problem.

Data Augmentation Federated Learning +1

Paper
Code

Prototype Reminiscence and Augmented Asymmetric Knowledge Aggregation for Non-Exemplar Class-Incremental Learning

no code implementations • ICCV 2023 • Wuxuan Shi, Mang Ye

However, since the model continuously learns new knowledge, the stored prototypical representations cannot correctly model the properties of old classes in the existence of knowledge updates.

Class Incremental Learning Incremental Learning

Paper
Add Code

Rethinking Federated Learning With Domain Shift: A Prototype View

2 code implementations • CVPR 2023 • Wenke Huang, Mang Ye, Zekun Shi, He Li, Bo Du

The private model presents degenerative performance on other domains (with domain shift).

Federated Learning Privacy Preserving

Paper
Code

Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning

1 code implementation • CVPR 2023 • Zesen Wu, Mang Ye

In response, we devise a Progressive Graph Matching method to globally mine cross-modality correspondences under cluster imbalance scenarios.

Contrastive Learning Graph Matching +1

Paper
Code

Towards Grand Unified Representation Learning for Unsupervised Visible-Infrared Person Re-Identification

1 code implementation • ICCV 2023 • Bin Yang, Jun Chen, Mang Ye

The grand unified representation lies in two aspects: 1) GUR adopts a bottom-up domain learning strategy with a cross-memory association embedding module to explore the information of hierarchical domains, i. e., intra-camera, inter-camera, and inter-modality domains, learning a unified and robust representation against hierarchical discrepancy.

Person Re-Identification Representation Learning

Paper
Code

Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning

1 code implementation • 28 Nov 2022 • Xian Zhong, Zipeng Li, Shuqin Chen, Kui Jiang, Chen Chen, Mang Ye

In this paper, we introduce a novel Refined Semantic enhancement method towards Frequency Diffusion (RSFD), a captioning model that constantly perceives the linguistic representation of the infrequent tokens.

FAD Video Captioning

Paper
Code

Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification

1 code implementation • ACM MM 2022 • Bin Yang, Mang Ye, Jun Chen, Zesen Wu

Visible infrared person re-identification (VI-ReID) aims at searching out the corresponding infrared (visible) images from a gallery set captured by other spectrum cameras.

Contrastive Learning Person Re-Identification

Paper
Code

Few-Shot Model Agnostic Federated Learning

2 code implementations • Proceedings of the 30th ACM International Conference on Multimedia 2022 • Wenke Huang, Mang Ye, Bo Du, Xiang Gao

To address these issues, this paper presents a novel framework with two main parts: 1) model agnostic federated learning, it performs public-private communication by unifying the model prediction outputs on the shared public datasets; 2) latent embedding adaptation, it addresses the domain gap with an adversarial learning scheme to discriminate the public and private domains.

Federated Learning

Paper
Code

Pyramidal Transformer with Conv-Patchify for Person Re-identification

2 code implementations • Proceedings of the 30th ACM International Conference on Multimedia 2022 • He Li, Mang Ye, Cong Wang, Bo Do

The robust and discriminative feature extraction is the key component in person re-identification (Re-ID).

Person Re-Identification Re-Ranking +1

Paper
Code

Learnable Privacy-Preserving Anonymization for Pedestrian Images

1 code implementation • 24 Jul 2022 • Junwu Zhang, Mang Ye, Yao Yang

We further propose a progressive training strategy to improve the performance, which iteratively upgrades the initial anonymization supervision.

Person Re-Identification Privacy Preserving

Paper
Code

Learn From Others and Be Yourself in Heterogeneous Federated Learning

1 code implementation • CVPR 2022 • Wenke Huang, Mang Ye, Bo Du

Federated learning has emerged as an important distributed learning paradigm, which normally involves collaborative updating with others and local updating on private data.

Continual Learning Federated Learning +2

Paper
Code

Robust Federated Learning With Noisy and Heterogeneous Clients

4 code implementations • CVPR 2022 • Xiuwen Fang, Mang Ye

Model heterogeneous federated learning is a challenging task since each client independently designs its own model.

Federated Learning

Paper
Code

AVA-AVD: Audio-Visual Speaker Diarization in the Wild

7 code implementations • 29 Nov 2021 • Eric Zhongcong Xu, Zeyang Song, Satoshi Tsutsui, Chao Feng, Mang Ye, Mike Zheng Shou

Audio-visual speaker diarization aims at detecting "who spoke when" using both auditory and visual signals.

Relation Network speaker-diarization +1

4,966

Paper
Code

The Multi-Modal Video Reasoning and Analyzing Competition

no code implementations • 18 Aug 2021 • Haoran Peng, He Huang, Li Xu, Tianjiao Li, Jun Liu, Hossein Rahmani, Qiuhong Ke, Zhicheng Guo, Cong Wu, Rongchang Li, Mang Ye, Jiahao Wang, Jiaxu Zhang, Yuanzhong Liu, Tao He, Fuwei Zhang, Xianbin Liu, Tao Lin

In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021.

Action Recognition Person Re-Identification +3

Paper
Add Code

TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval

no code implementations • 5 May 2021 • Yongbiao Chen, Sheng Zhang, Fangxin Liu, Zhigang Chang, Mang Ye, Zhengwei Qi

Until now, the deep hashing for the image retrieval community has been dominated by convolutional neural network architectures, e. g. \texttt{Resnet}\cite{he2016deep}.

Deep Hashing Image Retrieval

Paper
Add Code

Person Re-Identification by Context-aware Part Attention and Multi-Head Collaborative Learning

no code implementations • IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2021 • Dongming Wu, Mang Ye, Gaojie Lin, Xin Gao, Jianbing Shen

In addition, we propose a novel multi-head collaborative training scheme to improve the performance, which is collaboratively supervised by multiple heads with the same structure but different parameters.

Video-Based Person Re-Identification

Paper
Add Code

Channel Augmented Joint Learning for Visible-Infrared Recognition

1 code implementation • ICCV 2021 • Mang Ye, Weijian Ruan, Bo Du, Mike Zheng Shou

This paper introduces a powerful channel augmented joint learning strategy for the visible-infrared recognition problem.

Data Augmentation Metric Learning

Paper
Code

Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation

no code implementations • ICCV 2021 • Xin Hao, Sanyuan Zhao, Mang Ye, Jianbing Shen

Cross-modality person re-identification is a challenging task due to large cross-modality discrepancy and intra-modality variations.

Cross-Modality Person Re-identification Person Re-Identification

Paper
Add Code

Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification

no code implementations • 12 Dec 2020 • Can Zhang, Hong Liu, Wei Guo, Mang Ye

RGB-Infrared person re-identification (RGB-IR Re-ID) aims to match persons from heterogeneous images captured by visible and thermal cameras, which is of great significance in the surveillance system under poor light conditions.

Person Re-Identification

Paper
Add Code

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification

5 code implementations • ECCV 2020 • Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo

In this paper, we propose a novel dynamic dual-attentive aggregation (DDAG) learning method by mining both intra-modality part-level and cross-modality graph-level contextual cues for VI-ReID.

Person Re-Identification Retrieval

334

Paper
Code

Probabilistic Structural Latent Representation for Unsupervised Embedding

1 code implementation • 22 Jun 2020 • Mang Ye, Jianbing Shen∗

Unsupervised embedding learning aims at extracting low-dimensional visually meaningful representations from large-scale unlabeled images, which can then be directly used for similarity-based search.

Data Augmentation

Paper
Code

Probabilistic Structural Latent Representation for Unsupervised Embedding

no code implementations • CVPR 2020 • Mang Ye, Jianbing Shen

Unsupervised embedding learning aims at extracting low-dimensional visually meaningful representations from large-scale unlabeled images, which can then be directly used for similarity-based search.

Ranked #57 on Image Classification on STL-10

Data Augmentation Image Classification

Paper
Add Code

Deep Learning for Person Re-identification: A Survey and Outlook

5 code implementations • 13 Jan 2020 • Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, Steven C. H. Hoi

The widely studied closed-world setting is usually applied under various research-oriented assumptions, and has achieved inspiring success using deep learning techniques on a number of datasets.

Ranked #1 on Cross-Modal Person Re-Identification on RegDB-C

Cross-Modal Person Re-Identification Metric Learning +2

3,946

Paper
Code

Unsupervised Embedding Learning via Invariant and Spreading Instance Feature

1 code implementation • CVPR 2019 • Mang Ye, Xu Zhang, Pong C. Yuen, Shih-Fu Chang

This paper studies the unsupervised embedding learning problem, which requires an effective similarity measurement between samples in low-dimensional embedding space.

Data Augmentation

202

Paper
Code