Search Results for author: Rui Lu

Found 11 papers, 5 papers with code

Augmenting Unsupervised Reinforcement Learning with Self-Reference

no code implementations16 Nov 2023 Andrew Zhao, Erle Zhu, Rui Lu, Matthieu Lin, Yong-Jin Liu, Gao Huang

Our approach achieves state-of-the-art results in terms of Interquartile Mean (IQM) performance and Optimality Gap reduction on the Unsupervised Reinforcement Learning Benchmark for model-free methods, recording an 86% IQM and a 16% Optimality Gap.

Attribute reinforcement-learning +1

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

no code implementations29 Oct 2023 Nan He, Hanyu Lai, Chenyang Zhao, Zirui Cheng, Junting Pan, Ruoyu Qin, Ruofan Lu, Rui Lu, Yunchen Zhang, Gangming Zhao, Zhaohui Hou, Zhiyuan Huang, Shaoqing Lu, Ding Liang, Mingjie Zhan

Based on TeacherLM-7. 1B, we augmented 58 NLP datasets and taught various student models with different parameters from OPT and BLOOM series in a multi-task setting.

Data Augmentation Language Modelling

AgentTuning: Enabling Generalized Agent Abilities for LLMs

1 code implementation19 Oct 2023 Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang

Though many prompting methods have been proposed to complete particular agent tasks, there is lack of research focusing on improving the agent capabilities of LLMs themselves without compromising their general abilities.

Memorization

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

2 code implementations NeurIPS 2023 Yang Yue, Rui Lu, Bingyi Kang, Shiji Song, Gao Huang

We first identify a fundamental pattern, self-excitation, as the primary cause of Q-value estimation divergence in offline RL.

Attribute Offline RL

Provable General Function Class Representation Learning in Multitask Bandits and MDPs

no code implementations31 May 2022 Rui Lu, Andrew Zhao, Simon S. Du, Gao Huang

While multitask representation learning has become a popular approach in reinforcement learning (RL) to boost the sample efficiency, the theoretical understanding of why and how it works is still limited.

Multi-Armed Bandits Reinforcement Learning (RL) +1

On the Integration of Self-Attention and Convolution

2 code implementations CVPR 2022 Xuran Pan, Chunjiang Ge, Rui Lu, Shiji Song, Guanfu Chen, Zeyi Huang, Gao Huang

In this paper, we show that there exists a strong underlying relation between them, in the sense that the bulk of computations of these two paradigms are in fact done with the same operation.

Representation Learning

On the Power of Multitask Representation Learning in Linear MDP

no code implementations15 Jun 2021 Rui Lu, Gao Huang, Simon S. Du

We first discover a \emph{Least-Activated-Feature-Abundance} (LAFA) criterion, denoted as $\kappa$, with which we prove that a straightforward least-square algorithm learns a policy which is $\tilde{O}(H^2\sqrt{\frac{\mathcal{C}(\Phi)^2 \kappa d}{NT}+\frac{\kappa d}{n}})$ sub-optimal.

Reinforcement Learning (RL) Representation Learning

A Deep Prediction Network for Understanding Advertiser Intent and Satisfaction

no code implementations20 Aug 2020 Liyi Guo, Rui Lu, Haoqi Zhang, Junqi Jin, Zhenzhe Zheng, Fan Wu, Jin Li, Haiyang Xu, Han Li, Wenkai Lu, Jian Xu, Kun Gai

For e-commerce platforms such as Taobao and Amazon, advertisers play an important role in the entire digital ecosystem: their behaviors explicitly influence users' browsing and shopping experience; more importantly, advertiser's expenditure on advertising constitutes a primary source of platform revenue.

Marketing

Occlusion-shared and Feature-separated Network for Occlusion Relationship Reasoning

1 code implementation ICCV 2019 Rui Lu, Feng Xue, Menghan Zhou, Anlong Ming, Yu Zhou

On one hand, considering the relevance between edge and orientation, two sub-networks are designed to share the occlusion cue.

Context-Constrained Accurate Contour Extraction for Occlusion Edge Detection

no code implementations21 Mar 2019 Rui Lu, Menghan Zhou, Anlong Ming, Yu Zhou

Occlusion edge detection requires both accurate locations and context constraints of the contour.

Edge Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.