no code implementations • 16 Mar 2024 • Zhehui Huang, Guangyao Shi, Gaurav S. Sukhatme
The success of LLMs in these tasks leads us to wonder: What is the ability of LLMs to solve vehicle routing problems (VRPs) with natural language task descriptions?
no code implementations • 8 Dec 2023 • K. R. Zentner, Ujjwal Puri, Zhehui Huang, Gaurav S. Sukhatme
Then, we show that introducing a "fixup" phase is sufficient to guarantee a trust region is enforced on every policy update while adding fewer than 5% additional gradient steps in practice.
no code implementations • 28 Sep 2023 • Shashank Hegde, Zhehui Huang, Gaurav S. Sukhatme
We demonstrate that the neural policies estimated by HyperPPO are capable of decentralized control of a Crazyflie2. 1 quadrotor.
no code implementations • 23 Sep 2023 • Zhehui Huang, Zhaojing Yang, Rahul Krupani, Baskın Şenbaşlar, Sumeet Batra, Gaurav S. Sukhatme
In this work, we propose an end-to-end DRL approach to control quadrotor swarms in environments with obstacles.
1 code implementation • 15 Jun 2023 • Zhehui Huang, Sumeet Batra, Tao Chen, Rahul Krupani, Tushar Kumar, Artem Molchanov, Aleksei Petrenko, James A. Preiss, Zhaojing Yang, Gaurav S. Sukhatme
In addition to speed, such simulators need to model the physics of the robots and their interaction with the environment to a level acceptable for transferring policies learned in simulation to reality.
4 code implementations • ICML 2020 • Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun
In this work we aim to solve this problem by optimizing the efficiency and resource utilization of reinforcement learning algorithms instead of relying on distributed computation.