Search Results for author: Xuguang Lan

Found 18 papers, 4 papers with code

Human Action Recognition Based on Spatial-Temporal Attention

no code implementations ICLR 2019 Wensong Chan, Zhiqiang Tian, Xuguang Lan

Many state-of-the-art methods of recognizing human action are based on attention mechanism, which shows the importance of attention mechanism in action recognition.

Action Recognition Temporal Action Localization

Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning

no code implementations28 Feb 2024 Zeyang Liu, Lipeng Wan, Xinrui Yang, Zhuoran Chen, Xingyu Chen, Xuguang Lan

To address this limitation, we propose Imagine, Initialize, and Explore (IIE), a novel method that offers a promising solution for efficient multi-agent exploration in complex scenarios.

Action Generation SMAC+ +1

ESMC: Entire Space Multi-Task Model for Post-Click Conversion Rate via Parameter Constraint

no code implementations18 Jul 2023 Zhenhao Jiang, Biao Zeng, Hao Feng, Jin Liu, Jicong Fan, Jie Zhang, Jia Jia, Ning Hu, Xingyu Chen, Xuguang Lan

We propose a novel Entire Space Multi-Task Model for Post-Click Conversion Rate via Parameter Constraint (ESMC) and two alternatives: Entire Space Multi-Task Model with Siamese Network (ESMS) and Entire Space Multi-Task Model in Global Domain (ESMG) to address the PSC issue.

Decision Making Recommendation Systems +1

MMRDN: Consistent Representation for Multi-View Manipulation Relationship Detection in Object-Stacked Scenes

no code implementations25 Apr 2023 Han Wang, Jiayuan Zhang, Lipeng Wan, Xingyu Chen, Xuguang Lan, Nanning Zheng

Manipulation relationship detection (MRD) aims to guide the robot to grasp objects in the right order, which is important to ensure the safety and reliability of grasping in object stacked scenes.

Position Relationship Detection

Greedy-based Value Representation for Efficient Coordination in Multi-agent Reinforcement Learning

no code implementations29 Sep 2021 Lipeng Wan, Zeyang Liu, Xingyu Chen, Han Wang, Xuguang Lan

Due to the representation limitation of the joint Q value function, multi-agent reinforcement learning (MARL) methods with linear or monotonic value decomposition can not ensure the optimal consistency (i. e. the correspondence between the individual greedy actions and the maximal true Q value), leading to instability and poor coordination.

Multi-agent Reinforcement Learning reinforcement-learning +1

INVIGORATE: Interactive Visual Grounding and Grasping in Clutter

no code implementations25 Aug 2021 Hanbo Zhang, Yunfan Lu, Cunjun Yu, David Hsu, Xuguang Lan, Nanning Zheng

This paper presents INVIGORATE, a robot system that interacts with human through natural language and grasps a specified object in clutter.

Blocking Object +5

Multi-agent Policy Optimization with Approximatively Synchronous Advantage Estimation

no code implementations7 Dec 2020 Lipeng Wan, Xuwei Song, Xuguang Lan, Nanning Zheng

General methods for policy based multi-agent reinforcement learning to solve the challenge introduce differentiate value functions or advantage functions for individual agents.

Multi-agent Reinforcement Learning Starcraft

A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning

2 code implementations ECCV 2020 Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng

Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.

Generalized Zero-Shot Learning

REGNet: REgion-based Grasp Network for End-to-end Grasp Detection in Point Clouds

1 code implementation28 Feb 2020 Binglei Zhao, Hanbo Zhang, Xuguang Lan, Haoyu Wang, Zhiqiang Tian, Nanning Zheng

Reliable robotic grasping in unstructured environments is a crucial but challenging task.

Robotics

Hindsight Trust Region Policy Optimization

1 code implementation29 Jul 2019 Hanbo Zhang, Site Bai, Xuguang Lan, David Hsu, Nanning Zheng

We propose \emph{Hindsight Trust Region Policy Optimization}(HTRPO), a new RL algorithm that extends the highly successful TRPO algorithm with \emph{hindsight} to tackle the challenge of sparse rewards.

Atari Games Policy Gradient Methods +1

A Real-time Robotic Grasp Approach with Oriented Anchor Box

no code implementations8 Sep 2018 Hanbo Zhang, Xinwen Zhou, Xuguang Lan, Jin Li, Zhiqiang Tian, Nanning Zheng

The main component of our approach is a grasp detection network with oriented anchor boxes as detection priors.

Robotics

ROI-based Robotic Grasp Detection for Object Overlapping Scenes

no code implementations30 Aug 2018 Hanbo Zhang, Xuguang Lan, Site Bai, Xinwen Zhou, Zhiqiang Tian, Nanning Zheng

Experimental results demonstrate that ROI-GD performs much better in object overlapping scenes and at the meantime, remains comparable with state-of-the-art grasp detection algorithms on Cornell Grasp Dataset and Jacquard Dataset.

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.