Search Results for author: Yewen Li

Found 9 papers, 5 papers with code

Generative Auto-Bidding with Value-Guided Explorations

no code implementations20 Apr 2025 Jingtong Gao, Yewen Li, Shuai Mao, Nan Jiang, Yejing Wang, Qingpeng Cai, Fei Pan, Peng Jiang, Kun Gai, Bo An, Xiangyu Zhao

Auto-bidding, with its strong capability to optimize bidding decisions within dynamic and competitive online environments, has become a pivotal strategy for advertising platforms.

Reinforcement Learning (RL)

GAS: Generative Auto-bidding with Post-training Search

no code implementations22 Dec 2024 Yewen Li, Shuai Mao, Jingtong Gao, Nan Jiang, Yunjian Xu, Qingpeng Cai, Fei Pan, Peng Jiang, Bo An

We use weak-to-strong search alignment by training small critics for different preferences and an MCTS-inspired search to refine the model's output.

Computational Efficiency Sequential Decision Making

Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection

no code implementations5 Sep 2024 Yewen Li, Chaojie Wang, Xiaobo Xia, Xu He, Ruyi An, Dong Li, Tongliang Liu, Bo An, Xinrun Wang

Therefore, we appeal for more attention to incremental effectiveness on likelihood, i. e., whether a method could always surpass or at least match the performance of likelihood in U-OOD detection.

Out-of-Distribution Detection

Cradle: Empowering Foundation Agents Towards General Computer Control

1 code implementation5 Mar 2024 Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, Yujie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson, Bo An, Shuicheng Yan, Zongqing Lu

To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through the most unified and standardized interface, i. e., using screenshots as input and keyboard and mouse actions as output.

Efficient Exploration

Improving Unsupervised Hierarchical Representation with Reinforcement Learning

1 code implementation CVPR 2024 Ruyi An, Yewen Li, Xu He, Pengjie Gu, Mengchen Zhao, Dong Li, Jianye Hao, Chaojie Wang, Bo An, Mingyuan Zhou

To address this issue we first analyze the shortcomings of existing methods for mitigating the "posterior collapse" from an information theory perspective then highlight the necessity of regularization for explicitly propagating data information to higher-level latent variables while maintaining the dependency between different levels.

reinforcement-learning Reinforcement Learning +1

Pluralistic Image Completion with Probabilistic Mixture-of-Experts

no code implementations18 May 2022 Xiaobo Xia, Wenhao Yang, Jie Ren, Yewen Li, Yibing Zhan, Bo Han, Tongliang Liu

Second, the constraints for diversity are designed to be task-agnostic, which causes the constraints to not work well.

Diversity Mixture-of-Experts

Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network

1 code implementation30 Jun 2021 Zhibin Duan, Dongsheng Wang, Bo Chen, Chaojie Wang, Wenchao Chen, Yewen Li, Jie Ren, Mingyuan Zhou

However, they often assume in the prior that the topics at each layer are independently drawn from the Dirichlet distribution, ignoring the dependencies between the topics both at the same layer and across different layers.

Topic Models Variational Inference

Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning

1 code implementation19 Apr 2021 Jie Ren, Yewen Li, Zihan Ding, Wei Pan, Hao Dong

However, grasping distinguishable skills for some tasks with non-unique optima can be essential for further improving its learning efficiency and performance, which may lead to a multimodal policy represented as a mixture-of-experts (MOE).

Deep Reinforcement Learning Mixture-of-Experts +3

Cannot find the paper you are looking for? You can Submit a new open access paper.