Search Results for author: Sirui Chen

Found 9 papers, 4 papers with code

UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems

no code implementations17 Jan 2024 Changshuo Zhang, Sirui Chen, Xiao Zhang, Sunhao Dai, Weijie Yu, Jun Xu

Reinforcement learning (RL) has gained traction for enhancing user long-term experiences in recommender systems by effectively exploring users' interests.

Fairness Recommendation Systems +1

Controllable Multi-Objective Re-ranking with Policy Hypernetworks

1 code implementation8 Jun 2023 Sirui Chen, YuAn Wang, Zijing Wen, Zhiyu Li, Changshuo Zhang, Xiao Zhang, Quan Lin, Cheng Zhu, Jun Xu

In this paper, we propose a framework called controllable multi-objective re-ranking (CMR) which incorporates a hypernetwork to generate parameters for a re-ranking model according to different preference weights.

Recommendation Systems Re-Ranking

STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning

1 code implementation15 Apr 2023 Sirui Chen, Zhaowei Zhang, Yaodong Yang, Yali Du

It first decomposes the global return back to each time step, then utilizes the Shapley Value to redistribute the individual payoff from the decomposed global reward.

Multi-agent Reinforcement Learning reinforcement-learning

P-MMF: Provider Max-min Fairness Re-ranking in Recommender System

1 code implementation12 Mar 2023 Chen Xu, Sirui Chen, Jun Xu, Weiran Shen, Xiao Zhang, Gang Wang, Zhenghua Dong

In this paper, we proposed an online re-ranking model named Provider Max-min Fairness Re-ranking (P-MMF) to tackle the problem.

Fairness Recommendation Systems +1

CEP3: Community Event Prediction with Neural Point Process on Graph

no code implementations21 May 2022 Xuhong Wang, Sirui Chen, Yixuan He, Minjie Wang, Quan Gan, Yupu Yang, Junchi Yan

Many real world applications can be formulated as event forecasting on Continuous Time Dynamic Graphs (CTDGs) where the occurrence of a timed event between two entities is represented as an edge along with its occurrence timestamp in the graphs. However, most previous works approach the problem in compromised settings, either formulating it as a link prediction task on the graph given the event time or a time prediction problem given which event will happen next.

Link Prediction

Reinforcement Re-ranking with 2D Grid-based Recommendation Panels

no code implementations11 Apr 2022 Sirui Chen, Xiao Zhang, Xu Chen, Zhiyu Li, YuAn Wang, Quan Lin, Jun Xu

Then, it defines \emph{the MDP discrete time steps as the ranks in the initial ranking list, and the actions as the prediction of the user-item preference and the selection of the slots}.

Recommendation Systems Re-Ranking

DiffSRL: Learning Dynamical State Representation for Deformable Object Manipulation with Differentiable Simulator

1 code implementation24 Oct 2021 Sirui Chen, Yunhao Liu, Jialong Li, Shang Wen Yao, Tingxiang Fan, Jia Pan

We propose DiffSRL, a dynamic state representation learning pipeline utilizing differentiable simulation that can embed complex dynamics models as part of the end-to-end training.

Deformable Object Manipulation Motion Planning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.