no code implementations • 10 Jan 2022 • Guowei Cui, Xiaoping Chen
This paper shows how to introduce virtual action to extend action models to make the graph to be connected: i) explicitly defines static predicate (type, permanent properties, etc) or dynamic predicate (state); ii) constructs a full virtual action or a semi-virtual action for each state; iii) finds the cause of the planning failure through a progressive planning approach.
no code implementations • 2 Nov 2020 • Nan Lin, YuXuan Li, Yujun Zhu, Ruolin Wang, Xiayu Zhang, Jianmin Ji, Keke Tang, Xiaoping Chen, Xinming Zhang
Our meta policy tries to manipulate the next optimal state and actual action is produced by the inverse dynamics model.
no code implementations • SIGDIAL (ACL) 2020 • Keting Lu, Shiqi Zhang, Peter Stone, Xiaoping Chen
More interestingly, the robot was able to learn from navigation tasks to improve its dialog strategies.
no code implementations • SIGDIAL (ACL) 2020 • Yan Cao, Keting Lu, Xiaoping Chen, Shiqi Zhang
Reinforcement learning methods have been used to compute dialog policies from language-based interaction experiences.
no code implementations • 22 Apr 2020 • Keting Lu, Shiqi Zhang, Xiaoping Chen
First, we develop an algorithm, called Experience Grafting (EG), to enable RL agents to reorganize segments of the few high-quality trajectories from the experience pool to generate many synthetic trajectories while retaining the quality.
no code implementations • 5 Apr 2020 • Shi Yin, Shangfei Wang, Xiaoping Chen, Enhong Chen
These 1D heatmaps reduce spatial complexity significantly compared to current heatmap regression methods, which use 2D heatmaps to represent the joint distributions of x and y coordinates.
no code implementations • 28 Sep 2018 • Keting Lu, Shiqi Zhang, Peter Stone, Xiaoping Chen
In this work, we integrate logical-probabilistic KRR with model-based RL, enabling agents to simultaneously reason with declarative knowledge and learn from interaction experiences.
no code implementations • 28 Aug 2018 • Shi Yin, Yi Zhou, Chenguang Li, Shangfei Wang, Jianmin Ji, Xiaoping Chen, Ruili Wang
We propose KDSL, a new word sense disambiguation (WSD) framework that utilizes knowledge to automatically generate sense-labeled data for supervised learning.
no code implementations • 20 Aug 2018 • Keting Lu, Shiqi Zhang, Xiaoping Chen
Reinforcement learning methods have been used for learning dialogue policies.
no code implementations • 28 Apr 2018 • Ce Qi, Xiaoping Chen, Pingyu Wang, Fei Su
The proposed training strategy uses the anchors with IoUs between the first and second threshold, which can consistently improve the performance of face detection.
no code implementations • 26 Dec 2016 • Keke Tang, Peng Song, Xiaoping Chen
Depth scans acquired from different views may contain nuisances such as noise, occlusion, and varying point density.
no code implementations • 9 Jun 2016 • Dongcai Lu, Feng Wu, Xiaoping Chen
Understanding user instructions in natural language is an active research topic in AI and robotics.
no code implementations • NeurIPS 2013 • Aijun Bai, Feng Wu, Xiaoping Chen
Monte-Carlo tree search is drawing great interest in the domain of planning under uncertainty, particularly when little or no domain knowledge is available.