no code implementations • 5 Mar 2024 • Bo wang, Tianxiang Sun, Hang Yan, Siyin Wang, Qingyuan Cheng, Xipeng Qiu
The exploration of whether agents can align with their environment without relying on human-labeled data presents an intriguing research topic.