Search Results for author: Zhehui Huang

Found 6 papers, 2 papers with code

From Words to Routes: Applying Large Language Models to Vehicle Routing

no code implementations • 16 Mar 2024 • Zhehui Huang, Guangyao Shi, Gaurav S. Sukhatme

The success of LLMs in these tasks leads us to wonder: What is the ability of LLMs to solve vehicle routing problems (VRPs) with natural language task descriptions?

Code Generation Text-to-Code Generation

Paper
Add Code

Guaranteed Trust Region Optimization via Two-Phase KL Penalization

no code implementations • 8 Dec 2023 • K. R. Zentner, Ujjwal Puri, Zhehui Huang, Gaurav S. Sukhatme

Then, we show that introducing a "fixup" phase is sufficient to guarantee a trust region is enforced on every policy update while adding fewer than 5% additional gradient steps in practice.

Computational Efficiency Reinforcement Learning (RL)

Paper
Add Code

HyperPPO: A scalable method for finding small policies for robotic control

no code implementations • 28 Sep 2023 • Shashank Hegde, Zhehui Huang, Gaurav S. Sukhatme

We demonstrate that the neural policies estimated by HyperPPO are capable of decentralized control of a Crazyflie2. 1 quadrotor.

Paper
Add Code

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

no code implementations • 23 Sep 2023 • Zhehui Huang, Zhaojing Yang, Rahul Krupani, Baskın Şenbaşlar, Sumeet Batra, Gaurav S. Sukhatme

In this work, we propose an end-to-end DRL approach to control quadrotor swarms in environments with obstacles.

Collision Avoidance

Paper
Add Code

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

1 code implementation • 15 Jun 2023 • Zhehui Huang, Sumeet Batra, Tao Chen, Rahul Krupani, Tushar Kumar, Artem Molchanov, Aleksei Petrenko, James A. Preiss, Zhaojing Yang, Gaurav S. Sukhatme

In addition to speed, such simulators need to model the physics of the robots and their interaction with the environment to a level acceptable for transferring policies learned in simulation to reality.

Reinforcement Learning (RL)

Paper
Code

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

4 code implementations • ICML 2020 • Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun

In this work we aim to solve this problem by optimizing the efficiency and resource utilization of reinforcement learning algorithms instead of relying on distributed computation.

FPS Games General Reinforcement Learning +3

2,539

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.