Search Results for author: Mingqi Yuan

Found 11 papers, 3 papers with code

Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives

no code implementations29 May 2024 Mingqi Yuan, Huijiang Wang, Kai-Fung Chu, Fumiya Iida, Bo Li, Wenjun Zeng

These challenges arise from the need for accurate real-time perception of human actions, adaptive control algorithms for robots, and the effective coordination between human and robotic movements.

Hand Pose Estimation

RLLTE: Long-Term Evolution Project of Reinforcement Learning

2 code implementations28 Sep 2023 Mingqi Yuan, Zequn Zhang, Yang Xu, Shihao Luo, Bo Li, Xin Jin, Wenjun Zeng

We present RLLTE: a long-term evolution, extremely modular, and open-source framework for reinforcement learning (RL) research and application.

Language Modelling Large Language Model +2

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning

1 code implementation26 Jan 2023 Mingqi Yuan, Bo Li, Xin Jin, Wenjun Zeng

We present AIRS: Automatic Intrinsic Reward Shaping that intelligently and adaptively provides high-quality intrinsic rewards to enhance exploration in reinforcement learning (RL).

Benchmarking reinforcement-learning +1

Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning

no code implementations19 Sep 2022 Mingqi Yuan, Bo Li, Xin Jin, Wenjun Zeng

Exploration is critical for deep reinforcement learning in complex environments with high-dimensional observations and sparse rewards.

Atari Games Benchmarking +3

Rényi State Entropy for Exploration Acceleration in Reinforcement Learning

no code implementations8 Mar 2022 Mingqi Yuan, Man-on Pun, Dong Wang

One of the most critical challenges in deep reinforcement learning is to maintain the long-term exploration capability of the agent.

reinforcement-learning Reinforcement Learning (RL)

Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning

no code implementations19 Jul 2021 Mingqi Yuan, Mon-on Pun, Dong Wang, Yi Chen, Haojun Li

Furthermore, we leverage a variational auto-encoder (VAE) model to capture the life-long novelty of states, which is combined with the global JFI score to form multimodal intrinsic rewards.

Efficient Exploration Fairness +2

Hybrid Adversarial Imitation Learning

no code implementations4 Feb 2021 Mingqi Yuan

Extrapolating beyond-demonstrator (BD) performance through the imitation learning (IL) algorithm aims to learn from and outperform the demonstrator.

Imitation Learning Reinforcement Learning (RL)

Fairness-Oriented User Scheduling for Bursty Downlink Transmission Using Multi-Agent Reinforcement Learning

no code implementations30 Dec 2020 Mingqi Yuan, Qi Cao, Man-on Pun, Yi Chen

In this work, we develop practical user scheduling algorithms for downlink bursty traffic with emphasis on user fairness.

Distributed Optimization Fairness +3

Cannot find the paper you are looking for? You can Submit a new open access paper.