no code implementations • 23 Aug 2024 • Ruiyang Xu, Jialun Cao, Yaojie Lu, Hongyu Lin, Xianpei Han, Ben He, Shing-Chi Cheung, Le Sun
However, there is an unignorable programming language bias in existing code benchmarks -- over 95% code generation benchmarks are dominated by Python, leaving the LLMs' capabilities in other programming languages such as Java and C/C++ unknown.
1 code implementation • 6 Dec 2023 • Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu
Reinforcement learning (RL) is a versatile framework for optimizing long-term goals.
no code implementations • 23 May 2023 • Ruiyang Xu, Jalaj Bhandari, Dmytro Korenkevych, Fan Liu, Yuchen He, Alex Nikulkov, Zheqing Zhu
Auction-based recommender systems are prevalent in online advertising platforms, but they are typically optimized to allocate recommendation slots based on immediate expected return metrics, neglecting the downstream effects of recommendations on user behavior.
1 code implementation • 13 Mar 2023 • Yisheng Xiao, Ruiyang Xu, Lijun Wu, Juntao Li, Tao Qin, Yan-Tie Liu, Min Zhang
Experiments on \textbf{3} different tasks (neural machine translation, summarization, and code generation) with \textbf{15} datasets in total confirm that our proposed simple method achieves significant performance improvement over the strong CMLM model.
no code implementations • 25 Feb 2023 • Ruiyang Xu, Di wu, Xin Luo
Traditional feature selections need to know the feature space before learning, and online streaming feature selection (OSFS) is proposed to process streaming features on the fly.
no code implementations • 10 Dec 2021 • Ruiyang Xu, Zhengxing Chen
Reinforcement learning (RL) has gained increasing attraction in the academia and tech industry with launches to a variety of impactful applications and products.
no code implementations • 25 Jun 2021 • Ruiyang Xu, Ayush Singh
Natural interface to database (NLIDB) has been researched a lot during the past decades.
no code implementations • 21 Mar 2021 • Prashank Kadam, Ruiyang Xu, Karl Lieberherr
This technique is applicable to any MCTS based algorithm to reduce the number of updates to the tree.
no code implementations • 17 Jan 2021 • Ruiyang Xu, Karl Lieberherr
After training, an off-the-shelf QSAT solver is used to evaluate the performance of the algorithm.
no code implementations • 11 Jan 2021 • Ruiyang Xu, Prashank Kadam, Karl Lieberherr
We propose a general framework, Persephone, to map the FOL description of a combinatorial problem to a semantic game so that it can be solved through a neural MCTS based reinforcement learning algorithm.
no code implementations • 8 Mar 2019 • Ruiyang Xu, Karl Lieberherr
Recent progress in reinforcement learning (RL) using self-game-play has shown remarkable performance on several board games (e. g., Chess and Go) as well as video games (e. g., Atari games and Dota2).