no code implementations • ICML 2020 • Chaosheng Dong, Bo Zeng
Numerical results confirm the effectiveness of our model and the computational efficacy of algorithms.
1 code implementation • 24 May 2024 • Zhe Li, Bicheng Ying, Zidong Liu, Chaosheng Dong, Haibo Yang
This paper proposes a novel dimension-free communication algorithm -- DeComFL, which leverages the zeroth-order optimization techniques and reduces the communication cost from $\mathscr{O}(d)$ to $\mathscr{O}(1)$ by transmitting only a constant number of scalar values between clients and the server in each round, regardless of the dimension $d$ of the model parameters.
no code implementations • 22 Apr 2024 • Yanhui Guo, Shaoyuan Xu, Jinmiao Fu, Jia Liu, Chaosheng Dong, Bryan Wang
This paper introduces \textbf{Q-tuning}, a novel approach for continual prompt tuning that enables the lifelong learning of a pre-trained language model.
no code implementations • 11 Feb 2024 • Chaosheng Dong, Yijia Wang
This paper studies generalized inverse reinforcement learning (GIRL) in Markov decision processes (MDPs), that is, the problem of learning the basic components of an MDP given observed behavior (policy) that might not be optimal.
no code implementations • 8 Nov 2023 • Tianchen Zhou, Jia Liu, Yang Jiao, Chaosheng Dong, Yetian Chen, Yan Gao, Yi Sun
Online learning to rank (ONL2R) is a foundational problem for recommender systems and has received increasing attention in recent years.
no code implementations • NeurIPS 2023 • Haibo Yang, Zhuqing Liu, Jia Liu, Chaosheng Dong, Michinari Momma
In recent years, multi-objective optimization (MOO) emerges as a foundational problem underpinning many multi-agent multi-task learning applications.
no code implementations • 25 Jun 2023 • Yuchen Zhuang, Xin Shen, Yan Zhao, Chaosheng Dong, Ming Wang, Jin Li, Chao Zhang
The detection of the underlying shopping intentions of users based on their historical interactions is a crucial aspect for e-commerce platforms, such as Amazon, to enhance the convenience and efficiency of their customers' shopping experiences.
no code implementations • 19 Jun 2023 • Minghe Zhang, Chaosheng Dong, Jinmiao Fu, Tianchen Zhou, Jia Liang, Jia Liu, Bo Liu, Michinari Momma, Bryan Wang, Yan Gao, Yi Sun
In this paper, we introduce AdaSelection, an adaptive sub-sampling method to identify the most informative sub-samples within each minibatch to speed up the training of large-scale deep learning models without sacrificing model performance.
no code implementations • 7 Jul 2022 • Debabrata Mahapatra, Chaosheng Dong, Yetian Chen, Deqiang Meng, Michinari Momma
Moreover, it formulates multiple goals that may be conflicting yet important to optimize for simultaneously, e. g., in product search, a ranking model can be trained based on product quality and purchase likelihood to increase revenue.
no code implementations • ICLR 2022 • Tianchen Zhou, Jia Liu, Chaosheng Dong, Yi Sun
We show that the delay impacts in both cases can still be upper bounded by an additive penalty on both the regret and total incentive costs.
no code implementations • 19 May 2021 • Tianchen Zhou, Jia Liu, Chaosheng Dong, Jingyuan Deng
In this paper, we investigate a new multi-armed bandit (MAB) online learning model that considers real-world phenomena in many recommender systems: (i) the learning agent cannot pull the arms by itself and thus has to offer rewards to users to incentivize arm-pulling indirectly; and (ii) if users with specific arm preferences are well rewarded, they induce a "self-reinforcing" effect in the sense that they will attract more users of similar arm preferences.
no code implementations • 27 Apr 2021 • Chaosheng Dong, Xiaojie Jin, Weihao Gao, Yijia Wang, Hongyi Zhang, Xiang Wu, Jianchao Yang, Xiaobing Liu
Deep learning models in large-scale machine learning systems are often continuously trained with enormous data from production environments.
no code implementations • 12 Oct 2020 • Chaosheng Dong, Yijia Wang, Bo Zeng
We study the problem of learning the objective functions or constraints of a multiobjective decision making model, based on a set of sequentially arrived decisions.
no code implementations • 4 Oct 2020 • Shi Yu, Haoran Wang, Chaosheng Dong
Our approach allows the learner to continuously estimate real-time risk preferences using concurrent observed portfolios and market price data.
no code implementations • 30 Sep 2020 • Chaosheng Dong, Bo Zeng
To hedge against the uncertainties in the hypothetical DMP, the data, and the parameter space, we investigate in this paper the distributionally robust approach for inverse multiobjective optimization.
no code implementations • NeurIPS 2018 • Chaosheng Dong, Yiran Chen, Bo Zeng
Inverse optimization is a powerful paradigm for learning preferences and restrictions that explain the behavior of a decision maker, based on a set of external signal and the corresponding decision pairs.
no code implementations • 2 Aug 2018 • Chaosheng Dong, Bo Zeng
Given a set of human's decisions that are observed, inverse optimization has been developed and utilized to infer the underlying decision making problem.