Search Results for author: Zhehui Huang

Found 9 papers, 4 papers with code

Benchmark Real-time Adaptation and Communication Capabilities of Embodied Agent in Collaborative Scenarios

no code implementations30 Nov 2024 Shipeng Liu, Boshen Zhang, Zhehui Huang

To address this gap, we propose a novel benchmark that assesses agents' reactive adaptability and instantaneous communication capabilities at every step.

Effect of Adaptive Communication Support on Human-AI Collaboration

no code implementations26 Nov 2024 Shipeng Liu, FNU Shrutika, Boshen Zhang, Zhehui Huang, Feifei Qian

Leveraging the strong communication capabilities of Large Language Models (LLMs), we propose a Human-Robot Teaming Framework with Multi-Modal Language feedback (HRT-ML), a framework designed to enhance human-robot interaction by adjusting the frequency and content of language-based feedback.

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

1 code implementation12 Sep 2024 Liqiang Jing, Zhehui Huang, Xiaoyang Wang, Wenlin Yao, Wenhao Yu, Kaixin Ma, Hongming Zhang, Xinya Du, Dong Yu

To bridge this gap, we introduce DSBench, a comprehensive benchmark designed to evaluate data science agents with realistic tasks.

Can Large Language Models Solve Robot Routing?

1 code implementation16 Mar 2024 Zhehui Huang, Guangyao Shi, Gaurav S. Sukhatme

We systematically investigate the performance of LLMs in robot routing by constructing a dataset with 80 unique robot routing problems across 8 variants in both single and multi-robot settings.

Code Generation Text-to-Code Generation +1

Guaranteed Trust Region Optimization via Two-Phase KL Penalization

no code implementations8 Dec 2023 K. R. Zentner, Ujjwal Puri, Zhehui Huang, Gaurav S. Sukhatme

Then, we show that introducing a "fixup" phase is sufficient to guarantee a trust region is enforced on every policy update while adding fewer than 5% additional gradient steps in practice.

Computational Efficiency Reinforcement Learning (RL)

HyperPPO: A scalable method for finding small policies for robotic control

no code implementations28 Sep 2023 Shashank Hegde, Zhehui Huang, Gaurav S. Sukhatme

We demonstrate that the neural policies estimated by HyperPPO are capable of decentralized control of a Crazyflie2. 1 quadrotor.

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

1 code implementation15 Jun 2023 Zhehui Huang, Sumeet Batra, Tao Chen, Rahul Krupani, Tushar Kumar, Artem Molchanov, Aleksei Petrenko, James A. Preiss, Zhaojing Yang, Gaurav S. Sukhatme

In addition to speed, such simulators need to model the physics of the robots and their interaction with the environment to a level acceptable for transferring policies learned in simulation to reality.

Deep Reinforcement Learning Reinforcement Learning (RL)

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

4 code implementations ICML 2020 Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun

In this work we aim to solve this problem by optimizing the efficiency and resource utilization of reinforcement learning algorithms instead of relying on distributed computation.

FPS Games General Reinforcement Learning +4

Cannot find the paper you are looking for? You can Submit a new open access paper.