no code implementations • 13 Apr 2024 • Zhihao Cao, Zidong Wang, Siwen Xie, Anji Liu, Lifeng Fan
Our findings illustrate the potential of AI-imbued assistive robots in improving the well-being of vulnerable groups.
no code implementations • 8 Mar 2024 • ZiHao Wang, Anji Liu, Haowei Lin, Jiaqi Li, Xiaojian Ma, Yitao Liang
We explore how iterative revising a chain of thoughts with the help of information retrieval significantly improves large language models' reasoning and generation ability in long-horizon generation tasks, while hugely mitigating hallucination.
no code implementations • 28 Nov 2023 • Anji Liu, Mathias Niepert, Guy Van Den Broeck
In addition to proposing a new framework for constrained image generation, this paper highlights the benefit of more tractable models and motivates the development of expressive TPMs.
no code implementations • 10 Nov 2023 • ZiHao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang
Achieving human-like planning and control with multimodal observations in an open world is a key milestone for more functional generalist agents.
no code implementations • 31 Oct 2023 • Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang
A popular paradigm for offline Reinforcement Learning (RL) tasks is to first fit the offline trajectories to a sequence model, and then prompt the model for actions that lead to high expected return.
no code implementations • 12 Oct 2023 • Shaofei Cai, Bowei Zhang, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
We propose to follow reference videos as instructions, which offer expressive goal specifications while eliminating the need for expensive text-gameplay annotations.
no code implementations • 22 Aug 2023 • Ceyao Zhang, Kaijie Yang, Siyi Hu, ZiHao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang
Building agents with adaptive behavior in cooperative tasks stands as a paramount goal in the realm of multi-agent systems.
no code implementations • 16 Feb 2023 • Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang
In this paper, we theoretically and empirically discover that the performance of a PC can exceed that of its teacher model.
1 code implementation • 3 Feb 2023 • ZiHao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
We investigate the challenge of task planning for multi-task embodied agents in open-world environments.
2 code implementations • CVPR 2023 • Shaofei Cai, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
We study the problem of learning goal-conditioned policies in Minecraft, a popular, widely accessible yet challenging open-ended environment for developing human-level multi-task agents.
1 code implementation • 22 Nov 2022 • Meihua Dang, Anji Liu, Guy Van Den Broeck
The growing operation increases model capacity by increasing the size of the latent space.
1 code implementation • 20 Nov 2022 • Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma
To bridge this gap, we study the problem of few-shot adaptation in the context of human-in-the-loop reinforcement learning.
no code implementations • 10 Oct 2022 • Anji Liu, Honghua Zhang, Guy Van Den Broeck
We propose to overcome such bottleneck by latent variable distillation: we leverage the less tractable but more expressive deep generative models to provide extra supervision over the latent variables of PCs.
1 code implementation • ICLR 2022 • Anji Liu, Stephan Mandt, Guy Van Den Broeck
To overcome such problems, we establish a new class of tractable lossless compression models that permit efficient encoding and decoding: Probabilistic Circuits (PCs).
no code implementations • NeurIPS 2021 • Anji Liu, Guy Van Den Broeck
Instead, we re-think regularization for PCs and propose two intuitive techniques, data softening and entropy regularization, that both take advantage of PCs' tractability and still have an efficient implementation as a computation graph.
1 code implementation • NeurIPS 2021 • Antonio Vergari, YooJung Choi, Anji Liu, Stefano Teso, Guy Van Den Broeck
Circuit representations are becoming the lingua franca to express and reason about tractable generative and discriminative models.
no code implementations • NeurIPS 2021 • Antonio Vergari, YooJung Choi, Anji Liu, Stefano Teso, Guy Van Den Broeck
Circuit representations are becoming the lingua franca to express and reason about tractable generative and discriminative models.
no code implementations • 15 Jun 2020 • Anji Liu, Yitao Liang, Ji Liu, Guy Van Den Broeck, Jianshu Chen
Second, and more importantly, we demonstrate how the proposed necessary conditions can be adopted to design more effective parallel MCTS algorithms.
1 code implementation • 25 Feb 2020 • Anji Liu, Yitao Liang, Guy Van Den Broeck
Off-policy reinforcement learning (RL) is concerned with learning a rewarding policy by executing another policy that gathers samples of experience.
4 code implementations • ICLR 2020 • Anji Liu, Jianshu Chen, Mingze Yu, Yu Zhai, Xuewen Zhou, Ji Liu
Monte Carlo Tree Search (MCTS) algorithms have achieved great success on many challenging benchmarks (e. g., Computer Go).