1 code implementation • 24 Dec 2024 • Shaofei Cai, Zhancun Mu, Kaichen He, Bowei Zhang, Xinyue Zheng, Anji Liu, Yitao Liang
Minecraft has emerged as a valuable testbed for embodied intelligence and sequential decision-making research, yet the development and validation of novel agents remains hindered by significant engineering challenges.
no code implementations • 7 Dec 2024 • Shaofei Cai, Bowei Zhang, ZiHao Wang, Haowei Lin, Xiaojian Ma, Anji Liu, Yitao Liang
Developing agents that can follow multimodal instructions remains a fundamental challenge in robotics and AI.
1 code implementation • 23 Oct 2024 • Shaofei Cai, ZiHao Wang, Kewei Lian, Zhancun Mu, Xiaojian Ma, Anji Liu, Yitao Liang
Using this approach, we train ROCKET-1, a low-level policy that predicts actions based on concatenated visual observations and segmentation masks, supported by real-time object tracking from SAM-2.
no code implementations • 2 Oct 2024 • Anji Liu, Oliver Broadrick, Mathias Niepert, Guy Van Den Broeck
When we apply this approach to autoregressive copula models, the combined model outperforms both models individually in unconditional and conditional text generation.
no code implementations • 27 Jun 2024 • ZiHao Wang, Shaofei Cai, Zhancun Mu, Haowei Lin, Ceyao Zhang, Xuejie Liu, Qing Li, Anji Liu, Xiaojian Ma, Yitao Liang
First, we introduce a self-supervised approach to learn a behavior encoder that produces discretized tokens for behavior trajectories $\tau = \{o_0, a_0, \dots\}$ and an imitation learning policy decoder conditioned on these tokens.
no code implementations • 24 Jun 2024 • Zixia Jia, Junpeng Li, Shichuan Zhang, Anji Liu, Zilong Zheng
Traditional supervised learning heavily relies on human-annotated datasets, especially in data-hungry neural approaches.
Document-level Relation Extraction Multi-Label Classification +3
no code implementations • 12 Jun 2024 • Yizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang, Song-Chun Zhu, Xue Feng
To address these issues, we propose Hierarchical Opponent modeling and Planning (HOP), a novel multi-agent decision-making algorithm that enables few-shot adaptation to unseen policies in mixed-motive environments.
1 code implementation • 24 May 2024 • Vinh Tong, Trung-Dung Hoang, Anji Liu, Guy Van Den Broeck, Mathias Niepert
We achieve FIDs of 2. 38 (10 NFE), and 2. 27 (10 NFE) on unconditional CIFAR10 and AFHQv2 in 5-10 minutes of training.
no code implementations • CVPR 2024 • Zhihao Cao, Zidong Wang, Siwen Xie, Anji Liu, Lifeng Fan
Our findings illustrate the potential of AI-imbued assistive robots in improving the well-being of vulnerable groups.
1 code implementation • 8 Mar 2024 • ZiHao Wang, Anji Liu, Haowei Lin, Jiaqi Li, Xiaojian Ma, Yitao Liang
We explore how iterative revising a chain of thoughts with the help of information retrieval significantly improves large language models' reasoning and generation ability in long-horizon generation tasks, while hugely mitigating hallucination.
1 code implementation • 28 Nov 2023 • Anji Liu, Mathias Niepert, Guy Van Den Broeck
Further, with the help of an image encoder and decoder, our method can readily accept semantic constraints on specific regions of the image, which opens up the potential for more controlled image generation tasks.
1 code implementation • 10 Nov 2023 • ZiHao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang
Achieving human-like planning and control with multimodal observations in an open world is a key milestone for more functional generalist agents.
no code implementations • 31 Oct 2023 • Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang
A popular paradigm for offline Reinforcement Learning (RL) tasks is to first fit the offline trajectories to a sequence model, and then prompt the model for actions that lead to high expected return.
no code implementations • 12 Oct 2023 • Shaofei Cai, Bowei Zhang, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
We propose to follow reference videos as instructions, which offer expressive goal specifications while eliminating the need for expensive text-gameplay annotations.
no code implementations • 22 Aug 2023 • Ceyao Zhang, Kaijie Yang, Siyi Hu, ZiHao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang
Building agents with adaptive behavior in cooperative tasks stands as a paramount goal in the realm of multi-agent systems.
no code implementations • 16 Feb 2023 • Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang
In this paper, we theoretically and empirically discover that the performance of a PC can exceed that of its teacher model.
1 code implementation • 3 Feb 2023 • ZiHao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
We investigate the challenge of task planning for multi-task embodied agents in open-world environments.
2 code implementations • CVPR 2023 • Shaofei Cai, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
We study the problem of learning goal-conditioned policies in Minecraft, a popular, widely accessible yet challenging open-ended environment for developing human-level multi-task agents.
1 code implementation • 22 Nov 2022 • Meihua Dang, Anji Liu, Guy Van Den Broeck
The growing operation increases model capacity by increasing the size of the latent space.
1 code implementation • 20 Nov 2022 • Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma
To bridge this gap, we study the problem of few-shot adaptation in the context of human-in-the-loop reinforcement learning.
no code implementations • 10 Oct 2022 • Anji Liu, Honghua Zhang, Guy Van Den Broeck
We propose to overcome such bottleneck by latent variable distillation: we leverage the less tractable but more expressive deep generative models to provide extra supervision over the latent variables of PCs.
1 code implementation • ICLR 2022 • Anji Liu, Stephan Mandt, Guy Van Den Broeck
To overcome such problems, we establish a new class of tractable lossless compression models that permit efficient encoding and decoding: Probabilistic Circuits (PCs).
no code implementations • NeurIPS 2021 • Anji Liu, Guy Van Den Broeck
Instead, we re-think regularization for PCs and propose two intuitive techniques, data softening and entropy regularization, that both take advantage of PCs' tractability and still have an efficient implementation as a computation graph.
1 code implementation • NeurIPS 2021 • Antonio Vergari, YooJung Choi, Anji Liu, Stefano Teso, Guy Van Den Broeck
Circuit representations are becoming the lingua franca to express and reason about tractable generative and discriminative models.
no code implementations • NeurIPS 2021 • Antonio Vergari, YooJung Choi, Anji Liu, Stefano Teso, Guy Van Den Broeck
Circuit representations are becoming the lingua franca to express and reason about tractable generative and discriminative models.
no code implementations • 15 Jun 2020 • Anji Liu, Yitao Liang, Ji Liu, Guy Van Den Broeck, Jianshu Chen
Second, and more importantly, we demonstrate how the proposed necessary conditions can be adopted to design more effective parallel MCTS algorithms.
1 code implementation • 25 Feb 2020 • Anji Liu, Yitao Liang, Guy Van Den Broeck
Off-policy reinforcement learning (RL) is concerned with learning a rewarding policy by executing another policy that gathers samples of experience.
4 code implementations • ICLR 2020 • Anji Liu, Jianshu Chen, Mingze Yu, Yu Zhai, Xuewen Zhou, Ji Liu
Monte Carlo Tree Search (MCTS) algorithms have achieved great success on many challenging benchmarks (e. g., Computer Go).