Search Results for author: Russ Tedrake

Found 35 papers, 17 papers with code

Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation for Efficient Synthesis and Verification

1 code implementation11 Apr 2024 Lujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, huan zhang

The flexibility and efficiency of our framework allow us to demonstrate Lyapunov-stable output feedback control with synthesized NN-based controllers and NN-based observers with formal stability guarantees, for the first time in literature.

PoCo: Policy Composition from and for Heterogeneous Robot Learning

no code implementations4 Feb 2024 Lirui Wang, Jialiang Zhao, Yilun Du, Edward H. Adelson, Russ Tedrake

Training general robotic policies from heterogeneous data for different tasks is a significant challenge.

Robot Fleet Learning via Policy Merging

1 code implementation2 Oct 2023 Lirui Wang, Kaiqing Zhang, Allan Zhou, Max Simchowitz, Russ Tedrake

We show that FLEET-MERGE consolidates the behavior of policies trained on 50 tasks in the Meta-World environment, with good performance on nearly all training tasks at test time.

Robot Manipulation

Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

no code implementations24 Jun 2023 H. J. Terry Suh, Glen Chou, Hongkai Dai, Lujie Yang, Abhishek Gupta, Russ Tedrake

However, in order to apply them effectively in offline optimization paradigms such as offline Reinforcement Learning (RL) or Imitation Learning (IL), we require a more careful consideration of how uncertainty estimation interplays with first-order methods that attempt to minimize them.

Imitation Learning Offline RL +2

Synthesizing Stable Reduced-Order Visuomotor Policies for Nonlinear Systems via Sums-of-Squares Optimization

no code implementations24 Apr 2023 Glen Chou, Russ Tedrake

To solve this problem approximately, we propose two approaches: the first solves a sequence of sum-of-squares optimization problems to iteratively improve a policy which is provably-stable by construction, while the second directly performs gradient-based optimization on the parameters of the polynomial policy, and its closed-loop stability is verified a posteriori.

Smoothed Online Learning for Prediction in Piecewise Affine Systems

no code implementations NeurIPS 2023 Adam Block, Max Simchowitz, Russ Tedrake

The problem of piecewise affine (PWA) regression and planning is of foundational importance to the study of online learning, control, and robotics, where it provides a theoretically and empirically tractable setting to study systems undergoing sharp changes in the dynamics.

Can Direct Latent Model Learning Solve Linear Quadratic Gaussian Control?

no code implementations30 Dec 2022 Yi Tian, Kaiqing Zhang, Russ Tedrake, Suvrit Sra

We study the task of learning state representations from potentially high-dimensional observations, with the goal of controlling an unknown partially observable system.

Representation Learning

Does Learning from Decentralized Non-IID Unlabeled Data Benefit from Self Supervision?

1 code implementation20 Oct 2022 Lirui Wang, Kaiqing Zhang, Yunzhu Li, Yonglong Tian, Russ Tedrake

Decentralized learning has been advocated and widely deployed to make efficient use of distributed datasets, with an extensive focus on supervised learning (SL) problems.

Contrastive Learning Representation Learning +1

Elliptical Slice Sampling for Probabilistic Verification of Stochastic Systems with Signal Temporal Logic Specifications

no code implementations28 Feb 2022 Guy Scher, Sadra Sadraddini, Russ Tedrake, Hadas Kress-Gazit

Central to our approach is a method for efficient and rejection-free sampling of signals from a Gaussian distribution such that satisfy or violate a given STL formula.

Decision Making Motion Planning

Learning Multi-Object Dynamics with Compositional Neural Radiance Fields

no code implementations24 Feb 2022 Danny Driess, Zhiao Huang, Yunzhu Li, Russ Tedrake, Marc Toussaint

We present a method to learn compositional multi-object dynamics models from image observations based on implicit object encoders, Neural Radiance Fields (NeRFs), and graph neural networks.

Object

Globally Convergent Policy Search over Dynamic Filters for Output Estimation

no code implementations23 Feb 2022 Jack Umenberger, Max Simchowitz, Juan C. Perdomo, Kaiqing Zhang, Russ Tedrake

In this paper, we provide a new perspective on this challenging problem based on the notion of $\textit{informativity}$, which intuitively requires that all components of a filter's internal state are representative of the true state of the underlying dynamical system.

Do Differentiable Simulators Give Better Policy Gradients?

no code implementations2 Feb 2022 H. J. Terry Suh, Max Simchowitz, Kaiqing Zhang, Russ Tedrake

Differentiable simulators promise faster computation time for reinforcement learning by replacing zeroth-order gradient estimates of a stochastic objective with an estimate based on first-order gradients.

Learning Models as Functionals of Signed-Distance Fields for Manipulation Planning

no code implementations2 Oct 2021 Danny Driess, Jung-Su Ha, Marc Toussaint, Russ Tedrake

We show that representing objects as signed-distance fields not only enables to learn and represent a variety of models with higher accuracy compared to point-cloud and occupancy measure representations, but also that SDF-based models are suitable for optimization-based planning.

Shortest Paths in Graphs of Convex Sets

1 code implementation27 Jan 2021 Tobia Marcucci, Jack Umenberger, Pablo A. Parrilo, Russ Tedrake

Given a graph, the shortest-path problem requires finding a sequence of edges with minimum cumulative length that connects a source vertex to a target vertex.

Robot Navigation Discrete Mathematics Optimization and Control

Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems

no code implementations NeurIPS 2020 Aman Sinha, Matthew O'Kelly, Russ Tedrake, John Duchi

Learning-based methodologies increasingly find applications in safety-critical domains like autonomous driving and medical robotics.

Autonomous Driving Computational Efficiency

The Surprising Effectiveness of Linear Models for Visual Foresight in Object Pile Manipulation

no code implementations21 Feb 2020 H. J. Terry Suh, Russ Tedrake

In this paper, we tackle the problem of pushing piles of small objects into a desired target set using visual feedback.

Robotics

Self-Supervised Correspondence in Visuomotor Policy Learning

1 code implementation16 Sep 2019 Peter Florence, Lucas Manuelli, Russ Tedrake

In this paper we explore using self-supervised correspondence for improving the generalization performance and sample efficiency of visuomotor policy learning.

Imitation Learning

kPAM-SC: Generalizable Manipulation Planning using KeyPoint Affordance and Shape Completion

no code implementations16 Sep 2019 Wei Gao, Russ Tedrake

Using the proposed hybrid object representation, we formulate the manipulation task as a motion planning problem which encodes both the object target configuration and physical feasibility for a category of objects.

Keypoint Detection Motion Planning +1

Connecting Touch and Vision via Cross-Modal Prediction

1 code implementation CVPR 2019 Yunzhu Li, Jun-Yan Zhu, Russ Tedrake, Antonio Torralba

To connect vision and touch, we introduce new tasks of synthesizing plausible tactile signals from visual inputs as well as imagining how we interact with objects given tactile data as input.

SurfelWarp: Efficient Non-Volumetric Single View Dynamic Reconstruction

1 code implementation30 Apr 2019 Wei Gao, Russ Tedrake

We contribute a dense SLAM system that takes a live stream of depth images as input and reconstructs non-rigid deforming scenes in real time, without templates or prior models.

Dynamic Reconstruction

kPAM: KeyPoint Affordances for Category-Level Robotic Manipulation

no code implementations15 Mar 2019 Lucas Manuelli, Wei Gao, Peter Florence, Russ Tedrake

However, representing an object with a parameterized transformation defined on a fixed template cannot capture large intra-category shape variation, and specifying a target pose at a category level can be physically infeasible or fail to accomplish the task -- e. g. knowing the pose and size of a coffee mug relative to some canonical mug is not sufficient to successfully hang it on a rack by its handle.

Robotics

Linear Encodings for Polytope Containment Problems

1 code implementation12 Mar 2019 Sadra Sadraddini, Russ Tedrake

This problem is rooted in computational convexity, and arises in applications such as verification and control of dynamical systems.

Optimization and Control Computational Geometry

FilterReg: Robust and Efficient Probabilistic Point-Set Registration using Gaussian Filter and Twist Parameterization

1 code implementation CVPR 2019 Wei Gao, Russ Tedrake

Additionally, we present a simple and efficient twist parameterization that generalizes our method to the registration of articulated and deformable objects.

Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation

1 code implementation NeurIPS 2018 Matthew O'Kelly, Aman Sinha, Hongseok Namkoong, John Duchi, Russ Tedrake

While recent developments in autonomous vehicle (AV) technology highlight substantial progress, we lack tools for rigorous and scalable testing.

Autonomous Driving

Propagation Networks for Model-Based Control Under Partial Observation

1 code implementation28 Sep 2018 Yunzhu Li, Jiajun Wu, Jun-Yan Zhu, Joshua B. Tenenbaum, Antonio Torralba, Russ Tedrake

There has been an increasing interest in learning dynamics simulators for model-based control.

Sampling-based Polytopic Trees for Approximate Optimal Control of Piecewise Affine Systems

1 code implementation25 Sep 2018 Sadra Sadraddini, Russ Tedrake

Piecewise affine (PWA) systems are widely used to model highly nonlinear behaviors such as contact dynamics in robot locomotion and manipulation.

Systems and Control Robotics Optimization and Control

LVIS: Learning from Value Function Intervals for Contact-Aware Robot Controllers

1 code implementation16 Sep 2018 Robin Deits, Twan Koolen, Russ Tedrake

Guided policy search is a popular approach for training controllers for high-dimensional systems, but it has a number of pitfalls.

Robotics

Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation

3 code implementations22 Jun 2018 Peter R. Florence, Lucas Manuelli, Russ Tedrake

In this paper we present Dense Object Nets, which build on recent developments in self-supervised dense descriptor learning, as a consistent object representation for visual understanding and manipulation.

Object

Evaluating Robustness of Neural Networks with Mixed Integer Programming

6 code implementations ICLR 2019 Vincent Tjeng, Kai Xiao, Russ Tedrake

The computational speedup allows us to verify properties on convolutional networks with an order of magnitude more ReLUs than networks previously verified by any complete verifier.

LabelFusion: A Pipeline for Generating Ground Truth Labels for Real RGBD Data of Cluttered Scenes

1 code implementation15 Jul 2017 Pat Marion, Peter R. Florence, Lucas Manuelli, Russ Tedrake

We use an RGBD camera to collect video of a scene from multiple viewpoints and leverage existing reconstruction techniques to produce a 3D dense reconstruction.

3D Reconstruction Object +2

Funnel Libraries for Real-Time Robust Feedback Motion Planning

no code implementations15 Jan 2016 Anirudha Majumdar, Russ Tedrake

We consider the problem of generating motion plans for a robot that are guaranteed to succeed despite uncertainty in the environment, parametric model uncertainty, and disturbances.

Motion Planning

Pushbroom Stereo for High-Speed Navigation in Cluttered Environments

no code implementations26 Jul 2014 Andrew J. Barry, Russ Tedrake

We present a novel stereo vision algorithm that is capable of obstacle detection on a mobile-CPU processor at 120 frames per second.

Position Vocal Bursts Intensity Prediction

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

no code implementations NeurIPS 2008 John W. Roberts, Russ Tedrake

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the estimate of the gradient.

Cannot find the paper you are looking for? You can Submit a new open access paper.