Search Results for author: Mang Ye

Found 54 papers, 37 papers with code

Robust Asymmetric Heterogeneous Federated Learning with Corrupted Clients

1 code implementation12 Mar 2025 Xiuwen Fang, Mang Ye, Bo Du

This paper studies a challenging robust federated learning task with model heterogeneous and data corrupted clients, where the clients have different local model structures.

Contrastive Learning Data Augmentation +1

Privacy-Enhancing Paradigms within Federated Multi-Agent Systems

no code implementations11 Mar 2025 Zitong Shi, Guancheng Wan, Wenke Huang, Guibin Zhang, Jiawei Shao, Mang Ye, Carl Yang

LLM-based Multi-Agent Systems (MAS) have proven highly effective in solving complex problems by integrating multiple agents, each performing different roles.

RAG Retrieval

Synthetic Data is an Elegant GIFT for Continual Vision-Language Models

no code implementations6 Mar 2025 Bin Wu, Wuxuan Shi, Jinqiao Wang, Mang Ye

Pre-trained Vision-Language Models (VLMs) require Continual Learning (CL) to efficiently update their knowledge and adapt to various downstream tasks without retraining from scratch.

Continual Learning Image Generation

A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations

1 code implementation14 Feb 2025 Mang Ye, Xuankun Rong, Wenke Huang, Bo Du, Nenghai Yu, DaCheng Tao

With the rapid advancement of Large Vision-Language Models (LVLMs), ensuring their safety has emerged as a crucial area of research.

Survey

Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning

2 code implementations17 Nov 2024 Wenke Huang, Jian Liang, Zekun Shi, Didi Zhu, Guancheng Wan, He Li, Bo Du, DaCheng Tao, Mang Ye

To balance the trade-off between generalization and specialization, we propose measuring the parameter importance for both pre-trained and fine-tuning distributions, based on frozen pre-trained weight magnitude and accumulated fine-tuning gradient values.

Image Captioning Language Modeling +5

FedSSP: Federated Graph Learning with Spectral Knowledge and Personalized Preference

1 code implementation26 Oct 2024 Zihan Tan, Guancheng Wan, Wenke Huang, Mang Ye

Personalized Federated Graph Learning (pFGL) facilitates the decentralized training of Graph Neural Networks (GNNs) without compromising privacy while accommodating personalized requirements for non-IID participants.

Graph Learning

Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification

no code implementations9 Oct 2024 Chenyue Li, Shuoyi Chen, Mang Ye

Wildlife ReID involves utilizing visual technology to identify specific individuals of wild animals in different scenarios, holding significant importance for wildlife conservation, ecological research, and environmental monitoring.

Domain Generalization

Federated Graph Semantic and Structural Learning

1 code implementation27 Jun 2024 Wenke Huang, Guancheng Wan, Mang Ye, Bo Du

First, for node-level semantics, we find that contrasting nodes from distinct classes is beneficial to provide a well-performing discrimination.

Graph Learning Graph Neural Network

EmoLLM: Multimodal Emotional Understanding Meets Large Language Models

1 code implementation24 Jun 2024 Qu Yang, Mang Ye, Bo Du

Experimental results demonstrate that EmoLLM significantly elevates multimodal emotional understanding performance, with an average improvement of 12. 1% across multiple foundation models on EmoBench.

Emotional Intelligence

CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder

no code implementations9 Jun 2024 Tangfei Liao, Xiaoqin Zhang, Guobao Xiao, Min Li, Tao Wang, Mang Ye

To tackle these challenges, we propose a pre-training method to acquire a generic inliers-consistent representation by reconstructing masked correspondences, providing a strong initial representation for downstream tasks.

Representation Learning

Vertical Federated Learning for Effectiveness, Security, Applicability: A Survey

1 code implementation25 May 2024 Mang Ye, Wei Shen, Bo Du, Eduard Snezhko, Vassili Kovalev, Pong C. Yuen

Vertical Federated Learning (VFL) is a privacy-preserving distributed learning paradigm where different parties collaboratively learn models using partitioned features of shared samples, without leaking private data.

Privacy Preserving Survey +1

All in One Framework for Multimodal Re-identification in the Wild

no code implementations CVPR 2024 He Li, Mang Ye, Ming Zhang, Bo Du

In Re-identification (ReID), recent advancements yield noteworthy progress in both unimodal and cross-modal retrieval tasks.

All Cross-Modal Retrieval +2

Semi-supervised Text-based Person Search

no code implementations28 Apr 2024 Daming Gao, Yang Bai, Min Cao, Hao Dou, Mang Ye, Min Zhang

Text-based person search (TBPS) aims to retrieve images of a specific person from a large image gallery based on a natural language description.

Image Captioning Person Search +2

Transformer for Object Re-Identification: A Survey

1 code implementation13 Jan 2024 Mang Ye, Shuoyi Chen, Chenyue Li, Wei-Shi Zheng, David Crandall, Bo Du

Object Re-identification (Re-ID) aims to identify specific objects across different times and scenes, which is a widely researched task in computer vision.

Object Survey

FedAS: Bridging Inconsistency in Personalized Federated Learning

2 code implementations CVPR 2024 Xiyuan Yang, Wenke Huang, Mang Ye

In PFL clients update their shared parameters to communicate and learn from others while keeping personalized parts unchanged leading to poor coordination between these two components.

Personalized Federated Learning

Shallow-Deep Collaborative Learning for Unsupervised Visible-Infrared Person Re-Identification

1 code implementation CVPR 2024 Bin Yang, Jun Chen, Mang Ye

Unsupervised visible-infrared person re-identification (US-VI-ReID) centers on learning a cross-modality retrieval model without labels reducing the reliance on expensive cross-modality manual annotation.

Collaborative Ranking Contrastive Learning +1

Negative Pre-aware for Noisy Cross-modal Matching

2 code implementations10 Dec 2023 Xu Zhang, Hao Li, Mang Ye

Since clean samples are easier distinguished by GMM with increasing noise, the memory bank can still maintain high quality at a high noise ratio.

Cross-modal retrieval with noisy correspondence Image-text matching +3

Federated Learning for Generalization, Robustness, Fairness: A Survey and Benchmark

1 code implementation12 Nov 2023 Wenke Huang, Mang Ye, Zekun Shi, Guancheng Wan, He Li, Bo Du, Qiang Yang

In this survey, we provide a systematic overview of the important and recent developments of research on federated learning.

Fairness Federated Learning +2

Rotation Invariant Transformer for Recognizing Object in UAVs

3 code implementations ACM Multimedia 2022 Shuoyi Chen, Mang Ye, Bo Du

Existing methods are usually designed for city cameras, incapable of handing the rotation issue in UAV scenarios.

Object Person Re-Identification +1

Generalizable Heterogeneous Federated Cross-Correlation and Instance Similarity Learning

2 code implementations28 Sep 2023 Wenke Huang, Mang Ye, Zekun Shi, Bo Du

Federated learning is an important privacy-preserving multi-party learning paradigm, involving collaborative learning with others and local updating on private data.

Domain Generalization Federated Learning +1

Rethinking Client Drift in Federated Learning: A Logit Perspective

no code implementations20 Aug 2023 Yunlu Yan, Chun-Mei Feng, Mang Ye, WangMeng Zuo, Ping Li, Rick Siow Mong Goh, Lei Zhu, C. L. Philip Chen

Concretely, FedCSD introduces a class prototype similarity distillation to align the local logits with the refined global logits that are weighted by the similarity between local logits and the global prototype.

Federated Learning

An Empirical Study of CLIP for Text-based Person Search

1 code implementation19 Aug 2023 Min Cao, Yang Bai, Ziyin Zeng, Mang Ye, Min Zhang

TPBS, as a fine-grained cross-modal retrieval task, is also facing the rise of research on the CLIP-based TBPS.

Cross-Modal Retrieval Data Augmentation +5

Heterogeneous Federated Learning: State-of-the-art and Research Challenges

2 code implementations20 Jul 2023 Mang Ye, Xiuwen Fang, Bo Du, Pong C. Yuen, DaCheng Tao

Therefore, a systematic survey on this topic about the research challenges and state-of-the-art is essential.

Federated Learning Survey

Symmetric Uncertainty-Aware Feature Transmission for Depth Super-Resolution

1 code implementation1 Jun 2023 Wuxuan Shi, Mang Ye, Bo Du

(2) For the cross-modality gap, we propose a novel Symmetric Uncertainty scheme to remove parts of RGB information harmful to the recovery of HR depth maps.

Super-Resolution

Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval

1 code implementation CVPR 2023 Ding Jiang, Mang Ye

To alleviate these issues, we present IRRA: a cross-modal Implicit Relation Reasoning and Aligning framework that learns relations between local visual-textual tokens and enhances global image-text matching without requiring additional prior supervision.

Image-text matching Language Modeling +11

Towards Modality-Agnostic Person Re-Identification With Descriptive Query

1 code implementation CVPR 2023 Cuiqun Chen, Mang Ye, Ding Jiang

Person re-identification (ReID) with descriptive query (text or sketch) provides an important supplement for general image-image paradigms, which is usually studied in a single cross-modality matching manner, e. g., text-to-image or sketch-to-photo.

Descriptive Person Re-Identification +1

Towards Grand Unified Representation Learning for Unsupervised Visible-Infrared Person Re-Identification

1 code implementation ICCV 2023 Bin Yang, Jun Chen, Mang Ye

The grand unified representation lies in two aspects: 1) GUR adopts a bottom-up domain learning strategy with a cross-memory association embedding module to explore the information of hierarchical domains, i. e., intra-camera, inter-camera, and inter-modality domains, learning a unified and robust representation against hierarchical discrepancy.

Person Re-Identification Representation Learning

Prototype Reminiscence and Augmented Asymmetric Knowledge Aggregation for Non-Exemplar Class-Incremental Learning

no code implementations ICCV 2023 Wuxuan Shi, Mang Ye

However, since the model continuously learns new knowledge, the stored prototypical representations cannot correctly model the properties of old classes in the existence of knowledge updates.

class-incremental learning Class Incremental Learning +1

Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning

1 code implementation CVPR 2023 Zesen Wu, Mang Ye

In response, we devise a Progressive Graph Matching method to globally mine cross-modality correspondences under cluster imbalance scenarios.

Contrastive Learning Graph Matching +1

Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning

1 code implementation28 Nov 2022 Xian Zhong, Zipeng Li, Shuqin Chen, Kui Jiang, Chen Chen, Mang Ye

In this paper, we introduce a novel Refined Semantic enhancement method towards Frequency Diffusion (RSFD), a captioning model that constantly perceives the linguistic representation of the infrequent tokens.

FAD Video Captioning

Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification

1 code implementation ACM MM 2022 Bin Yang, Mang Ye, Jun Chen, Zesen Wu

Visible infrared person re-identification (VI-ReID) aims at searching out the corresponding infrared (visible) images from a gallery set captured by other spectrum cameras.

Contrastive Learning Person Re-Identification

Few-Shot Model Agnostic Federated Learning

4 code implementations Proceedings of the 30th ACM International Conference on Multimedia 2022 Wenke Huang, Mang Ye, Bo Du, Xiang Gao

To address these issues, this paper presents a novel framework with two main parts: 1) model agnostic federated learning, it performs public-private communication by unifying the model prediction outputs on the shared public datasets; 2) latent embedding adaptation, it addresses the domain gap with an adversarial learning scheme to discriminate the public and private domains.

Federated Learning model

Learnable Privacy-Preserving Anonymization for Pedestrian Images

1 code implementation24 Jul 2022 Junwu Zhang, Mang Ye, Yao Yang

We further propose a progressive training strategy to improve the performance, which iteratively upgrades the initial anonymization supervision.

Decoder Person Re-Identification +1

Learn From Others and Be Yourself in Heterogeneous Federated Learning

1 code implementation CVPR 2022 Wenke Huang, Mang Ye, Bo Du

Federated learning has emerged as an important distributed learning paradigm, which normally involves collaborative updating with others and local updating on private data.

Continual Learning Federated Learning +2

Robust Federated Learning With Noisy and Heterogeneous Clients

5 code implementations CVPR 2022 Xiuwen Fang, Mang Ye

Model heterogeneous federated learning is a challenging task since each client independently designs its own model.

Federated Learning

AVA-AVD: Audio-Visual Speaker Diarization in the Wild

8 code implementations29 Nov 2021 Eric Zhongcong Xu, Zeyang Song, Satoshi Tsutsui, Chao Feng, Mang Ye, Mike Zheng Shou

Audio-visual speaker diarization aims at detecting "who spoke when" using both auditory and visual signals.

Relation Network speaker-diarization +1

TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval

no code implementations5 May 2021 Yongbiao Chen, Sheng Zhang, Fangxin Liu, Zhigang Chang, Mang Ye, Zhengwei Qi

Until now, the deep hashing for the image retrieval community has been dominated by convolutional neural network architectures, e. g. \texttt{Resnet}\cite{he2016deep}.

Deep Hashing Image Retrieval

Person Re-Identification by Context-aware Part Attention and Multi-Head Collaborative Learning

no code implementations IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2021 Dongming Wu, Mang Ye, Gaojie Lin, Xin Gao, Jianbing Shen

In addition, we propose a novel multi-head collaborative training scheme to improve the performance, which is collaboratively supervised by multiple heads with the same structure but different parameters.

Video-Based Person Re-Identification

Channel Augmented Joint Learning for Visible-Infrared Recognition

2 code implementations ICCV 2021 Mang Ye, Weijian Ruan, Bo Du, Mike Zheng Shou

This paper introduces a powerful channel augmented joint learning strategy for the visible-infrared recognition problem.

Data Augmentation Diversity +1

Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification

no code implementations12 Dec 2020 Can Zhang, Hong Liu, Wei Guo, Mang Ye

RGB-Infrared person re-identification (RGB-IR Re-ID) aims to match persons from heterogeneous images captured by visible and thermal cameras, which is of great significance in the surveillance system under poor light conditions.

Person Re-Identification

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification

5 code implementations ECCV 2020 Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo

In this paper, we propose a novel dynamic dual-attentive aggregation (DDAG) learning method by mining both intra-modality part-level and cross-modality graph-level contextual cues for VI-ReID.

Person Re-Identification Retrieval

Probabilistic Structural Latent Representation for Unsupervised Embedding

1 code implementation22 Jun 2020 Mang Ye, Jianbing Shen∗

Unsupervised embedding learning aims at extracting low-dimensional visually meaningful representations from large-scale unlabeled images, which can then be directly used for similarity-based search.

Data Augmentation

Probabilistic Structural Latent Representation for Unsupervised Embedding

no code implementations CVPR 2020 Mang Ye, Jianbing Shen

Unsupervised embedding learning aims at extracting low-dimensional visually meaningful representations from large-scale unlabeled images, which can then be directly used for similarity-based search.

Data Augmentation Image Classification

Deep Learning for Person Re-identification: A Survey and Outlook

7 code implementations13 Jan 2020 Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, Steven C. H. Hoi

The widely studied closed-world setting is usually applied under various research-oriented assumptions, and has achieved inspiring success using deep learning techniques on a number of datasets.

Cross-Modal Person Re-Identification Deep Learning +4

Unsupervised Embedding Learning via Invariant and Spreading Instance Feature

1 code implementation CVPR 2019 Mang Ye, Xu Zhang, Pong C. Yuen, Shih-Fu Chang

This paper studies the unsupervised embedding learning problem, which requires an effective similarity measurement between samples in low-dimensional embedding space.

Data Augmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.