Search Results for author: Satvik Sharma

Found 6 papers, 2 papers with code

Learning to Efficiently Plan Robust Frictional Multi-Object Grasps

no code implementations13 Oct 2022 Wisdom C. Agboh, Satvik Sharma, Kishore Srinivas, Mallika Parulekar, Gaurav Datta, Tianshuang Qiu, Jeffrey Ichnowski, Eugen Solowjow, Mehmet Dogar, Ken Goldberg

In physical experiments, we find an 11. 7% increase in success rates, a 1. 7x increase in picks per hour, and an 8. 2x decrease in grasp planning time compared to prior work on multi-object grasping.


DEFT: Diverse Ensembles for Fast Transfer in Reinforcement Learning

no code implementations26 Sep 2022 Simeon Adebola, Satvik Sharma, Kaushik Shivakumar

In this work, we present Diverse Ensembles for Fast Transfer in RL (DEFT), a new ensemble-based method for reinforcement learning in highly multimodal environments and improved transfer to unseen environments.

Ensemble Learning reinforcement-learning +1

Automated Pruning of Polyculture Plants

no code implementations22 Aug 2022 Mark Presten, Rishi Parikh, Shrey Aeron, Sandeep Mukherjee, Simeon Adebola, Satvik Sharma, Mark Theis, Walter Teitelbaum, Ken Goldberg

Using an overhead camera to collect data from a physical scale garden testbed, the autonomous system utilizes a learned Plant Phenotyping convolutional neural network and a Bounding Disk Tracking algorithm to evaluate the individual plant distribution and estimate the state of the garden each day.

Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision

1 code implementation29 Jun 2022 Ryan Hoque, Lawrence Yunliang Chen, Satvik Sharma, Karthik Dharmarajan, Brijen Thananjeyan, Pieter Abbeel, Ken Goldberg

With continual learning, interventions from the remote pool of humans can also be used to improve the robot fleet control policy over time.

Continual Learning

AlphaGarden: Learning to Autonomously Tend a Polyculture Garden

1 code implementation11 Nov 2021 Mark Presten, Yahav Avigal, Mark Theis, Satvik Sharma, Rishi Parikh, Shrey Aeron, Sandeep Mukherjee, Sebastian Oehme, Simeon Adebola, Walter Teitelbaum, Varun Kamat, Ken Goldberg

This paper presents AlphaGarden: an autonomous polyculture garden that prunes and irrigates living plants in a 1. 5m x 3. 0m physical testbed.

Policy Gradient Bayesian Robust Optimization for Imitation Learning

no code implementations11 Jun 2021 Zaynah Javed, Daniel S. Brown, Satvik Sharma, Jerry Zhu, Ashwin Balakrishna, Marek Petrik, Anca D. Dragan, Ken Goldberg

Results suggest that PG-BROIL can produce a family of behaviors ranging from risk-neutral to risk-averse and outperforms state-of-the-art imitation learning algorithms when learning from ambiguous demonstrations by hedging against uncertainty, rather than seeking to uniquely identify the demonstrator's reward function.

Imitation Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.