no code implementations • 15 Aug 2023 • Bowei He, Xu He, Renrui Zhang, Yingxue Zhang, Ruiming Tang, Chen Ma
The high-throughput data requires the model to be updated in a timely manner for capturing the user interest dynamics, which leads to the emergence of streaming recommender systems.
1 code implementation • 21 Mar 2023 • Bowei He, Xu He, Yingxue Zhang, Ruiming Tang, Chen Ma
Personalized recommender systems have been widely studied and deployed to reduce information overload and satisfy users' diverse needs.
1 code implementation • IEEE Access 2022 • Zhangli Lan, Songbai Cai, Xu He, Xinpeng Wen
The group convolution was used to avoid model underfitting in the capsule layer.
Ranked #1 on
Lesion Classification
on HAM10000
1 code implementation • 27 Dec 2021 • Xiaofeng Pan, Yibin Shen, Jing Zhang, Xu He, Yang Huang, Hong Wen, Chengjun Mao, Bo Cao
In this paper, we propose a novel CTR model named MOEF for recommendations under frequent changes of occasions.
no code implementations • 15 Dec 2021 • Shuo Sun, Wanqi Xue, Rundong Wang, Xu He, Junlei Zhu, Jian Li, Bo An
Reinforcement learning (RL) techniques have shown great success in many challenging quantitative trading tasks, such as portfolio management and algorithmic trading.
no code implementations • NeurIPS 2021 • Wei Qiu, Xinrun Wang, Runsheng Yu, Rundong Wang, Xu He, Bo An, Svetlana Obraztsova, Zinovi Rabinovich
Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE).
Multi-agent Reinforcement Learning
reinforcement-learning
+3
no code implementations • 29 Sep 2021 • Zhuoyi Lin, Biao Ye, Xu He, Shuo Sun, Rundong Wang, Rui Yin, Xu Chi, Chee Keong Kwoh
A machine learning system is typically composed of model and data.
no code implementations • 16 Feb 2021 • Wei Qiu, Xinrun Wang, Runsheng Yu, Xu He, Rundong Wang, Bo An, Svetlana Obraztsova, Zinovi Rabinovich
Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE).
Multi-agent Reinforcement Learning
reinforcement-learning
+3
no code implementations • 1 Jan 2021 • Wei Qiu, Xinrun Wang, Runsheng Yu, Xu He, Rundong Wang, Bo An, Svetlana Obraztsova, Zinovi Rabinovich
Centralized training with decentralized execution (CTDE) has become an important paradigm in multi-agent reinforcement learning (MARL).
Multi-agent Reinforcement Learning
reinforcement-learning
+3
no code implementations • 22 Dec 2020 • Runsheng Yu, Yu Gong, Xu He, Bo An, Yu Zhu, Qingwen Liu, Wenwu Ou
Recently, many existing studies regard the cold-start personalized preference prediction as a few-shot learning problem, where each user is the task and recommended items are the classes, and the gradient-based meta learning method (MAML) is leveraged to address this challenge.
1 code implementation • 3 Sep 2020 • Xu He, Nicole Helbig, Matthieu J. Verstraete, Eric Bousquet
We present TB2J, a Python package for the automatic computation of magnetic interactions, including exchange and Dzyaloshinskii-Moriya interactions, between atoms of magnetic crystals from the results of density functional calculations.
Materials Science
no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang
First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set.
no code implementations • 21 Aug 2020 • Xu He, Bo An, Yanghua Li, Haikai Chen, Rundong Wang, Xinrun Wang, Runsheng Yu, Xin Li, Zhirong Wang
Thus, the global policy of the whole page could be sub-optimal.
Multi-agent Reinforcement Learning
Reinforcement Learning (RL)
no code implementations • ICML Workshop LifelongML 2020 • Xu He, Min Lin
We compare these approaches in terms of both compression and forgetting and empirically study the reasons that limit the performance of continual learning methods based on variational posterior approximation.
1 code implementation • 7 Jun 2020 • Xu He, Haipeng Chen, Bo An
However, previous works rarely consider the uncertainty when humans provide feedback, especially in cases that the optimal actions are not obvious to the trainers.
no code implementations • 23 May 2020 • Donghao Xu, Zhezhang Ding, Xu He, Huijing Zhao, Mathieu Moze, François Aioun, Franck Guillemard
In this study, a method of learning cost parameters of a motion planner from naturalistic driving data is proposed.
no code implementations • ICML 2020 • Rundong Wang, Xu He, Runsheng Yu, Wei Qiu, Bo An, Zinovi Rabinovich
Under the limited bandwidth constraint, a communication protocol is required to generate informative messages.
1 code implementation • 26 Jun 2019 • Uthpala Herath, Pedram Tavadze, Xu He, Eric Bousquet, Sobhit Singh, Francisco Muñoz, Aldo H. Romero
A file with a specific property evaluated for each $k$-point in a $k-$mesh and for each band can be used to project other properties such as electron-phonon mean path, Fermi velocity, electron effective mass, etc.
Materials Science
no code implementations • ICML Workshop LifelongML 2020 • Xu He, Jakub Sygnowski, Alexandre Galashov, Andrei A. Rusu, Yee Whye Teh, Razvan Pascanu
One particular formalism that studies learning under non-stationary distribution is provided by continual learning, where the non-stationarity is imposed by a sequence of distinct tasks.
1 code implementation • 7 Oct 2018 • Fei Wang, Jinsong Han, Shiyuan Zhang, Xu He, Dong Huang
We build CSI-Net, a unified Deep Neural Network~(DNN), to learn the representation of WiFi signals.
no code implementations • ICLR 2018 • Xu He, Herbert Jaeger
Catastrophic interference has been a major roadblock in the research of continual learning.
no code implementations • 16 Jul 2017 • Xu He, Herbert Jaeger
Catastrophic interference has been a major roadblock in the research of continual learning.