2 code implementations • 16 Feb 2024 • Shengjie Qiu, Junhao Zheng, Zhen Liu, Yicheng Luo, Qianli Ma
As for the E2O problem, we use knowledge distillation to maintain the model's discriminative ability for old entities.
no code implementations • 22 Dec 2023 • Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, Jingxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang
This paper presents a general framework model for integrating and learning structured reasoning into AI agents' policies.
no code implementations • 5 Dec 2023 • Zhengyao Jiang, Yingchen Xu, Nolan Wagener, Yicheng Luo, Michael Janner, Edward Grefenstette, Tim Rocktäschel, Yuandong Tian
However, the extensive collection of human motion-captured data and the derived datasets of humanoid trajectories, such as MoCapAct, paves the way to tackle these challenges.
1 code implementation • NeurIPS 2023 • Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang
Thus, we propose ChessGPT, a GPT model bridging policy learning and language modeling by integrating data from these two sources in Chess games.
no code implementations • 30 Mar 2023 • Yicheng Luo, Jackie Kay, Edward Grefenstette, Marc Peter Deisenroth
While offline RL algorithms can in principle be used for finetuning, in practice, their online performance improves slowly.
1 code implementation • 24 Mar 2023 • Yicheng Luo, Zhengyao Jiang, samuel cohen, Edward Grefenstette, Marc Peter Deisenroth
In this paper, we introduce Optimal Transport Reward labeling (OTR), an algorithm that assigns rewards to offline trajectories, with a few high-quality demonstrations.
1 code implementation • 25 Aug 2022 • Yicheng Luo, Jing Ren, Xuefei Zhe, Di Kang, Yajing Xu, Peter Wonka, Linchao Bao
The network takes a line cloud as input , i. e., a nonstructural and unordered set of 3D line segments extracted from multi-view images, and outputs a 3D wireframe of the underlying building, which consists of a sparse set of 3D junctions connected by line segments.
no code implementations • ICCV 2021 • Haotian Zhang, Yicheng Luo, Fangbo Qin, Yijia He, Xiao Liu
The line description ability of ELSD also outperforms the previous works on the line matching task.
Ranked #1 on Line Segment Detection on wireframe dataset
1 code implementation • 10 Oct 2020 • Yicheng Luo, Antonio Filieri, Yuan Zhou
Probabilistic software analysis aims at quantifying the probability of a target event occurring during the execution of a program processing uncertain incoming data or written itself using probabilistic programming constructs.