Search Results for author: Xiangfeng Wang

Found 41 papers, 10 papers with code

Scalable Reinforcement Learning for Virtual Machine Scheduling

no code implementations1 Mar 2025 Junjie Sheng, Jiehao Wu, Haochuan Cui, Yiqiu Hu, Wenli Zhou, Lei Zhu, Qian Peng, Wenhao Li, Xiangfeng Wang

This paper introduces a scalable RL framework, called Cluster Value Decomposition Reinforcement Learning (CVD-RL), to surmount the scalability hurdles inherent in large-scale VMS.

Cloud Computing reinforcement-learning +3

A Survey of Automatic Prompt Engineering: An Optimization Perspective

no code implementations17 Feb 2025 Wenwu Li, Xiangfeng Wang, Wenhao Li, Bo Jin

The rise of foundation models has shifted focus from resource-intensive fine-tuning to prompt engineering, a paradigm that steers model behavior through input design rather than weight updates.

cross-modal alignment Prompt Engineering +1

Generative Multi-Agent Collaboration in Embodied AI: A Systematic Review

no code implementations17 Feb 2025 Di wu, Xian Wei, Guang Chen, Hao Shen, Xiangfeng Wang, Wenhao Li, Bo Jin

Embodied multi-agent systems (EMAS) have attracted growing attention for their potential to address complex, real-world challenges in areas such as logistics and robotics.

GraphThought: Graph Combinatorial Optimization with Thought Generation

no code implementations17 Feb 2025 Zixiao Huang, Lifeng Guo, Junjie Sheng, Haosheng Chen, Wenhao Li, Bo Jin, Changhong Lu, Xiangfeng Wang

Large language models (LLMs) have demonstrated remarkable capabilities across various domains, especially in text processing and generative tasks.

Combinatorial Optimization

SkyRover: A Modular Simulator for Cross-Domain Pathfinding

no code implementations13 Feb 2025 Wenhui Ma, Wenhao Li, Bo Jin, Changhong Lu, Xiangfeng Wang

Unmanned Aerial Vehicles (UAVs) and Automated Guided Vehicles (AGVs) increasingly collaborate in logistics, surveillance, inspection tasks and etc.

Benchmarking

TrackDiffuser: Nearly Model-Free Bayesian Filtering with Diffusion Model

no code implementations8 Feb 2025 Yangguang He, Wenhao Li, Minzhe Li, Juan Zhang, Xiangfeng Wang, Bo Jin

State estimation remains a fundamental challenge across numerous domains, from autonomous driving, aircraft tracking to quantum system control.

Autonomous Driving model

Verbalized Bayesian Persuasion

no code implementations3 Feb 2025 Wenhao Li, Yue Lin, Xiangfeng Wang, Bo Jin, Hongyuan Zha, Baoxiang Wang

Information design (ID) explores how a sender influence the optimal behavior of receivers to achieve specific objectives.

Persuasion Strategies

FPPL: An Efficient and Non-IID Robust Federated Continual Learning Framework

1 code implementation4 Nov 2024 Yuchen He, Chuyun Shen, Xiangfeng Wang, Bo Jin

In this work, an efficient and non-IID robust federated continual learning framework, called Federated Prototype-Augmented Prompt Learning (FPPL), is proposed.

Continual Learning Contrastive Learning +2

Masked Autoencoders are Parameter-Efficient Federated Continual Learners

1 code implementation4 Nov 2024 Yuchen He, Xiangfeng Wang

On the server side, it reconstructs the uploaded restore information to capture the data distribution across previous tasks and different clients, using these reconstructed images to fine-tune discriminative prompt and classifier parameters tailored for classification, thereby alleviating catastrophic forgetting and non-IID issues on a global scale.

Continual Learning Federated Learning +1

Interactive 3D Medical Image Segmentation with SAM 2

1 code implementation5 Aug 2024 Chuyun Shen, Wenhao Li, Yuhang Shi, Xiangfeng Wang

The Segment Anything Model (SAM), though effective for 2D images, requires expensive semi-auto slice-by-slice annotations for 3D medical images.

Image Segmentation Medical Image Segmentation +2

In-Context Former: Lightning-fast Compressing Context for Large Language Model

no code implementations19 Jun 2024 Xiangfeng Wang, Zaiyi Chen, Zheyong Xie, Tong Xu, Yongyi He, Enhong Chen

With the rising popularity of Transformer-based large language models (LLMs), reducing their high inference costs has become a significant research focus.

Language Modeling Language Modelling +2

Complementary Information Mutual Learning for Multimodality Medical Image Segmentation

no code implementations5 Jan 2024 Chuyun Shen, Wenhao Li, Haoqing Chen, Xiaoling Wang, Fengping Zhu, Yuxin Li, Xiangfeng Wang, Bo Jin

CIML adopts the idea of addition and removes inter-modal redundant information through inductive bias-driven task decomposition and message passing-based redundancy filtering.

Image Segmentation Inductive Bias +4

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

1 code implementation6 Dec 2023 Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang

The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can language agents be alternatives to PPO agents in traditional sequential decision-making tasks?

Benchmarking Decision Making +2

Negotiated Reasoning: On Provably Addressing Relative Over-Generalization

no code implementations8 Jun 2023 Junjie Sheng, Wenhao Li, Bo Jin, Hongyuan Zha, Jun Wang, Xiangfeng Wang

Recent methods have shown that assigning reasoning ability to agents can mitigate RO algorithmically and empirically, but there has been a lack of theoretical understanding of RO, let alone designing provably RO-free methods.

Multi-agent Reinforcement Learning

Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning

no code implementations18 May 2023 Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin, Hongyuan Zha

The difficulty of appropriately assigning credit is particularly heightened in cooperative MARL with sparse reward, due to the concurrent time and structural scales involved.

Decision Making Diversity +5

CoLa-Diff: Conditional Latent Diffusion Model for Multi-Modal MRI Synthesis

1 code implementation24 Mar 2023 Lan Jiang, Ye Mao, Xi Chen, Xiangfeng Wang, Chao Li

Diffusion model has emerged as an effective technique for image synthesis by modelling complex and variable data distributions.

CoLA Image Generation

Boundary-aware Supervoxel-level Iteratively Refined Interactive 3D Image Segmentation with Multi-agent Reinforcement Learning

no code implementations19 Mar 2023 Chaofan Ma, Qisen Xu, Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Yanfeng Wang, Ya zhang

Interactive segmentation has recently been explored to effectively and efficiently harvest high-quality segmentation masks by iteratively incorporating user hints.

Image Segmentation Interactive Segmentation +6

Learning Roles with Emergent Social Value Orientations

no code implementations31 Jan 2023 Wenhao Li, Xiangfeng Wang, Bo Jin, Jingyi Lu, Hongyuan Zha

Social dilemmas can be considered situations where individual rationality leads to collective irrationality.

Multi-agent Reinforcement Learning Role Embedding

Decentralized Entropic Optimal Transport for Distributed Distribution Comparison

no code implementations28 Jan 2023 Xiangfeng Wang, Hongteng Xu, Moyi Yang

Distributed distribution comparison aims to measure the distance between the distributions whose data are scattered across different agents in a distributed system and cannot even be shared directly among the agents.

Domain Adaptation Privacy Preserving

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

no code implementations29 Nov 2022 Haochuan Cui, Junjie Sheng, Bo Jin, Yiqiu Hu, Li Su, Lei Zhu, Wenli Zhou, Xiangfeng Wang

With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences.

Cloud Computing Scheduling

Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning

no code implementations21 Nov 2022 Junjie Sheng, Lu Wang, Fangkai Yang, Bo Qiao, Hang Dong, Xiangfeng Wang, Bo Jin, Jun Wang, Si Qin, Saravan Rajmohan, QIngwei Lin, Dongmei Zhang

To address these two limitations, this paper formulates the oversubscription for cloud as a chance-constrained optimization problem and propose an effective Chance Constrained Multi-Agent Reinforcement Learning (C2MARL) method to solve this problem.

Multi-agent Reinforcement Learning reinforcement-learning +2

Obtaining Dyadic Fairness by Optimal Transport

1 code implementation9 Feb 2022 Moyi Yang, Junjie Sheng, Xiangfeng Wang, Wenyan Liu, Bo Jin, Jun Wang, Hongyuan Zha

Fairness has been taken as a critical metric in machine learning models, which is considered as an important component of trustworthy machine learning.

Fairness Link Prediction

Multi-Agent Path Finding with Prioritized Communication Learning

1 code implementation8 Feb 2022 Wenhao Li, Hongjun Chen, Bo Jin, Wenzhe Tan, Hongyuan Zha, Xiangfeng Wang

The learning-based, fully decentralized framework has been introduced to alleviate real-time problems and simultaneously pursue optimal planning policy.

Multi-Agent Path Finding Multi-agent Reinforcement Learning +1

VMAgent: Scheduling Simulator for Reinforcement Learning

2 code implementations9 Dec 2021 Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang

A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling.

Cloud Computing reinforcement-learning +3

Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration

no code implementations15 Nov 2021 Wenhao Li, Qisen Xu, Chuyun Shen, Bin Hu, Fengping Zhu, Yuxin Li, Bo Jin, Xiangfeng Wang

Based on the confidential information, a self-adaptive reward function is designed to provide more detailed feedback, and a simulated label generation mechanism is proposed on unsupervised data to reduce over-reliance on labeled data.

Image Segmentation Interactive Segmentation +4

Dealing with Non-Stationarity in MARL via Trust-Region Decomposition

no code implementations ICLR 2022 Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Hongyuan Zha

In this paper, we introduce a novel notion, the $\delta$-measurement, to explicitly measure the non-stationarity of a policy sequence, which can be further proved to be bounded by the KL-divergence of consecutive joint policies.

Multi-agent Reinforcement Learning

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

no code implementations9 Feb 2021 Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan Zha

In order to improve the efficiency of cooperation and exploration, we propose a structured diversification emergence MARL framework named {\sc{Rochico}} based on reinforced organization control and hierarchical consensus learning.

Multi-agent Reinforcement Learning

Fair Differential Privacy Can Mitigate the Disparate Impact on Model Accuracy

no code implementations1 Jan 2021 Wenyan Liu, Xiangfeng Wang, Xingjian Lu, Junhong Cheng, Bo Jin, Xiaoling Wang, Hongyuan Zha

This paper proposes a fair differential privacy algorithm (FairDP) to mitigate the disparate impact on model accuracy for each class.

Fairness

FDA3 : Federated Defense Against Adversarial Attacks for Cloud-Based IIoT Applications

no code implementations28 Jun 2020 Yunfei Song, Tian Liu, Tongquan Wei, Xiangfeng Wang, Zhe Tao, Mingsong Chen

Along with the proliferation of Artificial Intelligence (AI) and Internet of Things (IoT) techniques, various kinds of adversarial attacks are increasingly emerging to fool Deep Neural Networks (DNNs) used by Industrial IoT (IIoT) applications.

Federated Learning

F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning

no code implementations17 Apr 2020 Wenhao Li, Bo Jin, Xiangfeng Wang, Junchi Yan, Hongyuan Zha

Traditional centralized multi-agent reinforcement learning (MARL) algorithms are sometimes unpractical in complicated applications, due to non-interactivity between agents, curse of dimensionality and computation complexity.

Multi-agent Reinforcement Learning Reinforcement Learning +3

HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem

no code implementations11 Feb 2020 Yun Hua, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Xiaofeng He, Hongyuan Zha

In spite of the success of existing meta reinforcement learning methods, they still have difficulty in learning a meta policy effectively for RL problems with sparse reward.

Meta-Learning Meta Reinforcement Learning +3

Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning

no code implementations CVPR 2020 Xuan Liao, Wenhao Li, Qisen Xu, Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Ya zhang, Yan-Feng Wang

We here propose to model the dynamic process of iterative interactive image segmentation as a Markov decision process (MDP) and solve it with reinforcement learning (RL).

Image Segmentation Medical Image Segmentation +6

Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

no code implementations20 Nov 2019 Jun-Jie Wang, Xiangfeng Wang, Bo Jin, Junchi Yan, Wenjie Zhang, Hongyuan Zha

To this end, we propose a novel heterogeneous graph-based knowledge transfer method (HGKT) for GZSL, agnostic to unseen classes and instances, by leveraging graph neural network.

Generalized Zero-Shot Learning Graph Neural Network +1

A Fast Proximal Point Method for Computing Exact Wasserstein Distance

1 code implementation12 Feb 2018 Yujia Xie, Xiangfeng Wang, Ruijia Wang, Hongyuan Zha

However, as we will demonstrate, regularized variations with large regularization parameter will degradate the performance in several important machine learning applications, and small regularization parameter will fail due to numerical stability issues with existing algorithms.

BIG-bench Machine Learning

Deep Extreme Multi-label Learning

1 code implementation12 Apr 2017 Wenjie Zhang, Junchi Yan, Xiangfeng Wang, Hongyuan Zha

Extreme multi-label learning (XML) or classification has been a practical and important problem since the boom of big data.

Classification Extreme Multi-Label Classification +3

Asynchronous Distributed ADMM for Large-Scale Optimization- Part I: Algorithm and Convergence Analysis

no code implementations9 Sep 2015 Tsung-Hui Chang, Mingyi Hong, Wei-Cheng Liao, Xiangfeng Wang

By formulating the learning problem as a consensus problem, the ADMM can be used to solve the consensus problem in a fully parallel fashion over a computer network with a star topology.

Distributed Optimization

Asynchronous Distributed ADMM for Large-Scale Optimization- Part II: Linear Convergence Analysis and Numerical Performance

no code implementations9 Sep 2015 Tsung-Hui Chang, Wei-Cheng Liao, Mingyi Hong, Xiangfeng Wang

Unfortunately, a direct synchronous implementation of such algorithm does not scale well with the problem size, as the algorithm speed is limited by the slowest computing nodes.

Joint Active Learning with Feature Selection via CUR Matrix Decomposition

no code implementations4 Mar 2015 Changsheng Li, Xiangfeng Wang, Weishan Dong, Junchi Yan, Qingshan Liu, Hongyuan Zha

In particular, our method runs in one-shot without the procedure of iterative sample selection for progressive labeling.

Active Learning feature selection

Dynamic Structure Embedded Online Multiple-Output Regression for Stream Data

no code implementations18 Dec 2014 Changsheng Li, Fan Wei, Weishan Dong, Qingshan Liu, Xiangfeng Wang, Xin Zhang

MORES can \emph{dynamically} learn the structure of the coefficients change in each update step to facilitate the model's continuous refinement.

regression

Cannot find the paper you are looking for? You can Submit a new open access paper.