Search Results for author: Zhehui Huang

Found 6 papers, 2 papers with code

From Words to Routes: Applying Large Language Models to Vehicle Routing

no code implementations16 Mar 2024 Zhehui Huang, Guangyao Shi, Gaurav S. Sukhatme

The success of LLMs in these tasks leads us to wonder: What is the ability of LLMs to solve vehicle routing problems (VRPs) with natural language task descriptions?

Code Generation Text-to-Code Generation

Guaranteed Trust Region Optimization via Two-Phase KL Penalization

no code implementations8 Dec 2023 K. R. Zentner, Ujjwal Puri, Zhehui Huang, Gaurav S. Sukhatme

Then, we show that introducing a "fixup" phase is sufficient to guarantee a trust region is enforced on every policy update while adding fewer than 5% additional gradient steps in practice.

Computational Efficiency Reinforcement Learning (RL)

HyperPPO: A scalable method for finding small policies for robotic control

no code implementations28 Sep 2023 Shashank Hegde, Zhehui Huang, Gaurav S. Sukhatme

We demonstrate that the neural policies estimated by HyperPPO are capable of decentralized control of a Crazyflie2. 1 quadrotor.

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

1 code implementation15 Jun 2023 Zhehui Huang, Sumeet Batra, Tao Chen, Rahul Krupani, Tushar Kumar, Artem Molchanov, Aleksei Petrenko, James A. Preiss, Zhaojing Yang, Gaurav S. Sukhatme

In addition to speed, such simulators need to model the physics of the robots and their interaction with the environment to a level acceptable for transferring policies learned in simulation to reality.

Reinforcement Learning (RL)

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

4 code implementations ICML 2020 Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun

In this work we aim to solve this problem by optimizing the efficiency and resource utilization of reinforcement learning algorithms instead of relying on distributed computation.

FPS Games General Reinforcement Learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.