Search Results for author: Mengyue Yang

Found 21 papers, 6 papers with code

Beyond Prior Limits: Addressing Distribution Misalignment in Particle Filtering

no code implementations30 Jan 2025 Yiwei Shi, Jingyu Hu, Yu Zhang, Mengyue Yang, Weinan Zhang, Cunjia Liu, Weiru Liu

Particle filtering is a Bayesian inference method and a fundamental tool in state estimation for dynamic systems, but its effectiveness is often limited by the constraints of the initial prior distribution, a phenomenon we define as the Prior Boundary Phenomenon.

Bayesian Inference

Attention-Driven Hierarchical Reinforcement Learning with Particle Filtering for Source Localization in Dynamic Fields

no code implementations22 Jan 2025 Yiwei Shi, Mengyue Yang, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu

In many real-world scenarios, such as gas leak detection or environmental pollutant tracking, solving the Inverse Source Localization and Characterization problem involves navigating complex, dynamic fields with sparse and noisy observations.

Bayesian Inference Computational Efficiency +4

Natural Language Reinforcement Learning

1 code implementation21 Nov 2024 Xidong Feng, Ziyu Wan, Haotian Fu, Bo Liu, Mengyue Yang, Girish A. Koushik, Zhiyuan Hu, Ying Wen, Jun Wang

Reinforcement Learning (RL) mathematically formulates decision-making with Markov Decision Process (MDP).

Decision Making reinforcement-learning +2

Causal Representation Learning from Multimodal Biomedical Observations

no code implementations10 Nov 2024 Yuewen Sun, Lingjing Kong, Guangyi Chen, Loka Li, Gongxu Luo, Zijian Li, Yixuan Zhang, Yujia Zheng, Mengyue Yang, Petar Stojanov, Eran Segal, Eric P. Xing, Kun Zhang

Theoretically, we consider a nonparametric latent distribution (c. f., parametric assumptions in previous work) that allows for causal relationships across potentially different modalities.

Representation Learning

Efficient Reinforcement Learning with Large Language Model Priors

no code implementations10 Oct 2024 Xue Yan, Yan Song, Xidong Feng, Mengyue Yang, Haifeng Zhang, Haitham Bou Ammar, Jun Wang

In sequential decision-making (SDM) tasks, methods like reinforcement learning (RL) and heuristic search have made notable advances in specific cases.

Bayesian Inference Decision Making +9

Seeking the Sufficiency and Necessity Causal Features in Multimodal Representation Learning

no code implementations29 Aug 2024 BoYu Chen, Junjie Liu, Zhu Li, Mengyue Yang

We address these challenges by first conceptualizing multimodal representations as comprising modality-invariant and modality-specific components.

Representation Learning

Attaining Human`s Desirable Outcomes in Human-AI Interaction via Structural Causal Games

no code implementations26 May 2024 Anjie Liu, Jianhong Wang, Haoxuan Li, Xu Chen, Jun Wang, Samuel Kaski, Mengyue Yang

In human-AI interaction, a prominent goal is to attain human`s desirable outcome with the assistance of AI agents, which can be ideally delineated as a problem of seeking the optimal Nash Equilibrium that matches the human`s desirable outcome.

Natural Language Reinforcement Learning

no code implementations11 Feb 2024 Xidong Feng, Ziyu Wan, Mengyue Yang, Ziyan Wang, Girish A. Koushik, Yali Du, Ying Wen, Jun Wang

Reinforcement Learning (RL) has shown remarkable abilities in learning policies for decision-making tasks.

Decision Making reinforcement-learning +2

InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization

no code implementations23 Jan 2024 Jiarui Jin, Zexue He, Mengyue Yang, Weinan Zhang, Yong Yu, Jun Wang, Julian McAuley

Subsequently, we minimize the mutual information between the observation estimation and the relevance estimation conditioned on the input features.

Learning-To-Rank Recommendation Systems

Specify Robust Causal Representation from Mixed Observations

1 code implementation21 Oct 2023 Mengyue Yang, Xinyu Cai, Furui Liu, Weinan Zhang, Jun Wang

Under the hypothesis that the intrinsic latent factors follow some casual generative models, we argue that by learning a causal representation, which is the minimal sufficient causes of the whole system, we can improve the robustness and generalization performance of machine learning models.

Invariant Learning via Probability of Sufficient and Necessary Causes

1 code implementation NeurIPS 2023 Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang

To capture the information of sufficient and necessary causes, we employ a classical concept, the probability of sufficiency and necessary causes (PNS), which indicates the probability of whether one is the necessary and sufficient cause.

Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank

no code implementations5 Aug 2023 Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang

Notice that these ranking metrics do not consider the effects of the contextual dependence among the items in the list, we design a new family of simulation-based ranking metrics, where existing metrics can be regarded as special cases.

Learning-To-Rank

ChessGPT: Bridging Policy Learning and Language Modeling

1 code implementation NeurIPS 2023 Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang

Thus, we propose ChessGPT, a GPT model bridging policy learning and language modeling by integrating data from these two sources in Chess games.

Decision Making Language Modeling +1

Generalizable Information Theoretic Causal Representation

no code implementations17 Feb 2022 Mengyue Yang, Xinyu Cai, Furui Liu, Xu Chen, Zhitang Chen, Jianye Hao, Jun Wang

It is evidence that representation learning can improve model's performance over multiple downstream tasks in many real-world scenarios, such as image classification and recommender systems.

counterfactual Image Classification +2

Debiased Recommendation with User Feature Balancing

no code implementations16 Jan 2022 Mengyue Yang, Guohao Cai, Furui Liu, Zhenhua Dong, Xiuqiang He, Jianye Hao, Jun Wang, Xu Chen

To alleviate these problems, in this paper, we propose a novel debiased recommendation framework based on user feature balancing.

Causal Inference Recommendation Systems

Informative Robust Causal Representation for Generalizable Deep Learning

no code implementations29 Sep 2021 Mengyue Yang, Furui Liu, Xu Chen, Zhitang Chen, Jianye Hao, Jun Wang

In many real-world scenarios, such as image classification and recommender systems, it is evidence that representation learning can improve model's performance over multiple downstream tasks.

counterfactual Deep Learning +3

Top-N Recommendation with Counterfactual User Preference Simulation

no code implementations2 Sep 2021 Mengyue Yang, Quanyu Dai, Zhenhua Dong, Xu Chen, Xiuqiang He, Jun Wang

To alleviate this problem, in this paper, we propose to reformulate the recommendation task within the causal inference framework, which enables us to counterfactually simulate user ranking-based preferences to handle the data scarce problem.

Causal Inference counterfactual +1

Causal World Models by Unsupervised Deconfounding of Physical Dynamics

no code implementations28 Dec 2020 Minne Li, Mengyue Yang, Furui Liu, Xu Chen, Zhitang Chen, Jun Wang

The capability of imagining internally with a mental model of the world is vitally important for human cognition.

counterfactual

CausalVAE: Structured Causal Disentanglement in Variational Autoencoder

2 code implementations CVPR 2021 Mengyue Yang, Furui Liu, Zhitang Chen, Xinwei Shen, Jianye Hao, Jun Wang

Learning disentanglement aims at finding a low dimensional representation which consists of multiple explanatory and generative factors of the observational data.

counterfactual Disentanglement

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

1 code implementation2 Apr 2020 Mengyue Yang, Qingyang Li, Zhiwei Qin, Jieping Ye

In this paper, we propose a hierarchical adaptive contextual bandit method (HATCH) to conduct the policy learning of contextual bandits with a budget constraint.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.