Search Results for author: Wenzhe Li

Found 12 papers, 4 papers with code

Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games

no code implementations6 Jun 2024 Jiawei Ge, Yuanhao Wang, Wenzhe Li, Chi Jin

Our experimental results highlight worst-case scenarios where meta-algorithms from prior state-of-the-art systems for multiplayer games fail to secure an equal share, while our algorithm succeeds, demonstrating the effectiveness of our approach.

FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning

no code implementations4 Jun 2024 Wenzhe Li, Zihan Ding, Seth Karten, Chi Jin

Recent advances in reinforcement learning (RL) heavily rely on a variety of well-designed benchmarks, which provide environmental platforms and consistent criteria to evaluate existing and novel algorithms.

Multi-agent Reinforcement Learning reinforcement-learning +2

A Survey on Transformers in Reinforcement Learning

no code implementations8 Jan 2023 Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu, Deheng Ye

Transformer has been considered the dominating neural architecture in NLP and CV, mostly under supervised settings.

reinforcement-learning Reinforcement Learning +2

Improving Graph-Based Text Representations with Character and Word Level N-grams

no code implementations12 Oct 2022 Wenzhe Li, Nikolaos Aletras

Graph-based text representation focuses on how text documents are represented as graphs for exploiting dependency information between tokens and documents within a corpus.

Graph Representation Learning text-classification +2

Latent-Variable Advantage-Weighted Policy Optimization for Offline RL

1 code implementation16 Mar 2022 Xi Chen, Ali Ghadirzadeh, Tianhe Yu, Yuan Gao, Jianhao Wang, Wenzhe Li, Bin Liang, Chelsea Finn, Chongjie Zhang

Offline reinforcement learning methods hold the promise of learning policies from pre-collected datasets without the need to query the environment for new transitions.

continuous-control Continuous Control +4

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

1 code implementation ICLR 2022 Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang

In this paper, we revisit the theoretical property of GCSL -- optimizing a lower bound of the goal reaching objective, and extend GCSL as a novel offline goal-conditioned RL algorithm.

Offline RL Reinforcement Learning (RL) +1

Estimating High Order Gradients of the Data Distribution by Denoising

no code implementations NeurIPS 2021 Chenlin Meng, Yang song, Wenzhe Li, Stefano Ermon

By leveraging Tweedie's formula on higher order moments, we generalize denoising score matching to estimate higher order derivatives.

Audio Synthesis Denoising +2

Offline Reinforcement Learning with Reverse Model-based Imagination

1 code implementation NeurIPS 2021 Jianhao Wang, Wenzhe Li, Haozhe Jiang, Guangxiang Zhu, Siyuan Li, Chongjie Zhang

These reverse imaginations provide informed data augmentation for model-free policy learning and enable conservative generalization beyond the offline dataset.

Data Augmentation Offline RL +3

Tractable Computation of Expected Kernels

1 code implementation21 Feb 2021 Wenzhe Li, Zhe Zeng, Antonio Vergari, Guy Van Den Broeck

Computing the expectation of kernel functions is a ubiquitous task in machine learning, with applications from classical support vector machines to exploiting kernel embeddings of distributions in probabilistic modeling, statistical inference, causal discovery, and deep learning.

Causal Discovery

Unifying Local and Global Change Detection in Dynamic Networks

no code implementations9 Oct 2017 Wenzhe Li, Dong Guo, Greg Ver Steeg, Aram Galstyan

Many real-world networks are complex dynamical systems, where both local (e. g., changing node attributes) and global (e. g., changing network topology) processes unfold over time.

Change Detection

Scalable MCMC for Mixed Membership Stochastic Blockmodels

no code implementations16 Oct 2015 Wenzhe Li, Sungjin Ahn, Max Welling

We propose a stochastic gradient Markov chain Monte Carlo (SG-MCMC) algorithm for scalable inference in mixed-membership stochastic blockmodels (MMSB).

Variational Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.