Search Results for author: Jialian Li

Found 11 papers, 1 papers with code

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds

no code implementations • CVPR 2022 • Jialian Li, Jingyi Zhang, Zhiyong Wang, Siqi Shen, Chenglu Wen, Yuexin Ma, Lan Xu, Jingyi Yu, Cheng Wang

Quantitative and qualitative experiments show that our method outperforms the techniques based only on RGB images.

Ranked #3 on 3D Human Pose Estimation on SLOPER4D (using extra training data)

3D Human Pose Estimation

Paper
Add Code

Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model

no code implementations • 13 Mar 2022 • Jialian Li, Tongzheng Ren, Dong Yan, Hang Su, Jun Zhu

Our goal is to identify a near-optimal robust policy for the perturbed testing environment, which introduces additional technical difficulties as we need to simultaneously estimate the training environment uncertainty from samples and find the worst-case perturbation for testing.

Paper
Add Code

Nearly Horizon-Free Offline Reinforcement Learning

no code implementations • NeurIPS 2021 • Tongzheng Ren, Jialian Li, Bo Dai, Simon S. Du, Sujay Sanghavi

To the best of our knowledge, these are the \emph{first} set of nearly horizon-free bounds for episodic time-homogeneous offline tabular MDP and linear MDP with anchor points.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information

no code implementations • ICLR 2020 • Yichi Zhou, Jialian Li, Jun Zhu

Posterior sampling for reinforcement learning (PSRL) is a useful framework for making decisions in an unknown environment.

counterfactual Multi-agent Reinforcement Learning +2

Paper
Add Code

Lazy-CFR: fast and near-optimal regret minimization for extensive games with imperfect information

no code implementations • ICLR 2020 • Yichi Zhou, Tongzheng Ren, Jialian Li, Dong Yan, Jun Zhu

In this paper, we present Lazy-CFR, a CFR algorithm that adopts a lazy update strategy to avoid traversing the whole game tree in each round.

counterfactual

Paper
Add Code

Fast Regularity-Constrained Plane Reconstruction

no code implementations • 20 May 2019 • Yangbin Lin, Jialian Li, Cheng Wang, Zhonggui Chen, Zongyue Wang, Jonathan Li

Man-made environments typically comprise planar structures that exhibit numerous geometric relationships, such as parallelism, coplanarity, and orthogonality.

Paper
Add Code

Sample-efficient policy learning in multi-agent Reinforcement Learning via meta-learning

no code implementations • ICLR 2019 • Jialian Li, Hang Su, Jun Zhu

We can solve these tasks by first building models for other agents and then finding the optimal policy with these models.

Meta-Learning Multi-agent Reinforcement Learning +2

Paper
Add Code

Lazy-CFR: fast and near optimal regret minimization for extensive games with imperfect information

no code implementations • 10 Oct 2018 • Yichi Zhou, Tongzheng Ren, Jialian Li, Dong Yan, Jun Zhu

In this paper, we present a novel technique, lazy update, which can avoid traversing the whole game tree in CFR, as well as a novel analysis on the regret of CFR with lazy update.

counterfactual

Paper
Add Code

Identify the Nash Equilibrium in Static Games with Random Payoffs

no code implementations • ICML 2017 • Yichi Zhou, Jialian Li, Jun Zhu

We study the problem on how to learn the pure Nash Equilibrium of a two-player zero-sum static game with random payoffs under unknown distributions via efficient payoff queries.

Paper
Add Code

The YouTube-8M Kaggle Competition: Challenges and Methods

1 code implementation • 28 Jun 2017 • Haosheng Zou, Kun Xu, Jialian Li, Jun Zhu

We took part in the YouTube-8M Video Understanding Challenge hosted on Kaggle, and achieved the 10th place within less than one month's time.

General Classification Video Classification +1

Paper
Code

Conditional Generative Moment-Matching Networks

no code implementations • NeurIPS 2016 • Yong Ren, Jialian Li, Yucen Luo, Jun Zhu

Maximum mean discrepancy (MMD) has been successfully applied to learn deep generative models for characterizing a joint distribution of variables via kernel mean embedding.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.