Search Results for author: Jiyan Jiang

Found 6 papers, 2 papers with code

Multi-Objective Online Learning

no code implementations29 Sep 2021 Jiyan Jiang, Wenpeng Zhang, Shiji Zhou, Lihong Gu, Xiaodong Zeng, Wenwu Zhu

This paper presents a systematic study of multi-objective online learning.

Asynchronous Decentralized Online Learning

no code implementations NeurIPS 2021 Jiyan Jiang, Wenpeng Zhang, Jinjie Gu, Wenwu Zhu

To overcome this problem, we study decentralized online learning in the asynchronous setting, which allows different learners to work at their own pace.

Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems

no code implementations25 Aug 2023 Tianchi Cai, Shenliao Bao, Jiyan Jiang, Shiji Zhou, Wenpeng Zhang, Lihong Gu, Jinjie Gu, Guannan Zhang

Model-free RL-based recommender systems have recently received increasing research attention due to their capability to handle partial feedback and long-term rewards.

Recommendation Systems reinforcement-learning

Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning

no code implementations6 Sep 2023 Tianchi Cai, Jiyan Jiang, Wenpeng Zhang, Shiji Zhou, Xierui Song, Li Yu, Lihong Gu, Xiaodong Zeng, Jinjie Gu, Guannan Zhang

We further show that this method is guaranteed to converge to the optimal policy, which cannot be achieved by previous value-based reinforcement learning methods for marketing budget allocation.

Marketing reinforcement-learning

ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference

1 code implementation5 Dec 2023 Tianchi Cai, Xierui Song, Jiyan Jiang, Fei Teng, Jinjie Gu, Guannan Zhang

Aligning language models to human expectations, e. g., being helpful and harmless, has become a pressing challenge for large language models.

Language Modelling Large Language Model

Cannot find the paper you are looking for? You can Submit a new open access paper.