Search Results for author: Shenzhi Wang

Found 7 papers, 1 papers with code

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

no code implementations • 29 May 2024 • Andrew Zhao, Quentin Xu, Matthieu Lin, Shenzhi Wang, Yong-Jin Liu, Zilong Zheng, Gao Huang

Recent advances in large language models (LLMs) have made them indispensable, raising significant concerns over managing their safety.

Paper
Add Code

LLM Agents for Psychology: A Study on Gamified Assessments

no code implementations • 19 Feb 2024 • Qisen Yang, Zekun Wang, Honghui Chen, Shenzhi Wang, Yifan Pu, Xin Gao, Wenhao Huang, Shiji Song, Gao Huang

Psychological measurement is essential for mental health, self-understanding, and personal development.

Paper
Add Code

Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

1 code implementation • NeurIPS 2023 • Shenzhi Wang, Qisen Yang, Jiawei Gao, Matthieu Gaetan Lin, Hao Chen, Liwei Wu, Ning Jia, Shiji Song, Gao Huang

Existing solutions tackle this problem by imposing a policy constraint on the policy improvement objective in both offline and online learning.

D4RL Reinforcement Learning (RL)

Paper
Code

Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation

no code implementations • 2 Oct 2023 • Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, Gao Huang

This study utilizes the intricate Avalon game as a testbed to explore LLMs' potential in deceptive environments.

Misinformation

Paper
Add Code

Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance

no code implementations • 4 Sep 2023 • Qisen Yang, Shenzhi Wang, Qihang Zhang, Gao Huang, Shiji Song

Offline reinforcement learning (RL) optimizes the policy on a previously collected dataset without any interactions with the environment, yet usually suffers from the distributional shift problem.

Offline RL reinforcement-learning +1

Paper
Add Code

Boosting Offline Reinforcement Learning with Action Preference Query

no code implementations • 6 Jun 2023 • Qisen Yang, Shenzhi Wang, Matthieu Gaetan Lin, Shiji Song, Gao Huang

In particular, online fine-tuning has become a commonly used method to correct the erroneous estimates of out-of-distribution data learned in the offline training phase.

Autonomous Driving D4RL +2

Paper
Add Code

Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison

no code implementations • CVPR 2021 • Shenzhi Wang, Liwei Wu, Lei Cui, Yujun Shen

More concretely, we employ a Local-Net and Global-Net to extract features from any individual patch and its surrounding respectively.

Anomaly Detection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.