1 code implementation • 31 Jul 2023 • Tao Huang, Kai Chen, Wang Wei, Jianan Li, Yonghao Long, Qi Dou
Based on this value function, a chaining policy is learned to instruct subtask policies to terminate at the state with the highest value so that all subsequent policies are more likely to be connected for accomplishing the task.
1 code implementation • 1 Jan 2023 • Yonghao Long, Wang Wei, Tao Huang, Yuehao Wang, Qi Dou
We showcase the improvement of our simulation environment with the designed new features, and validate effectiveness of incorporating human factors in embodied intelligence through the use of human demonstrations and reinforcement learning as a representative example.
no code implementations • 28 Mar 2022 • Tang Xinyao, Wang Wei, Song Huansheng, Zhao Chunhui
In this paper, we propose a 3D vehicle localization network CenterLoc3D for roadside monocular cameras, which directly predicts centroid and eight vertexes in image space, and the dimension of 3D bounding boxes without 2D detectors.
no code implementations • 29 Nov 2015 • Zhang Yi, Xiao Yanghua, Hwang Seung-won, Wang Wei
However, as such increase of recall often invites false positives and decreases precision in return, we propose the following two techniques: First, we identify concepts with different relatedness to generate linear orderings and pairwise ordering constraints.