Search Results for author: Stephen Giguere

Found 6 papers, 3 papers with code

Projected Natural Actor-Critic

no code implementations NeurIPS 2013 Philip S. Thomas, William C. Dabney, Stephen Giguere, Sridhar Mahadevan

Natural actor-critics are a popular class of policy search algorithms for finding locally optimal policies for Markov decision processes.

reinforcement-learning Reinforcement Learning (RL)

A Manifold Approach to Learning Mutually Orthogonal Subspaces

no code implementations8 Mar 2017 Stephen Giguere, Francisco Garcia, Sridhar Mahadevan

Although many machine learning algorithms involve learning subspaces with particular characteristics, optimizing a parameter matrix that is constrained to represent a subspace can be challenging.

Domain Adaptation Riemannian optimization

Distributional Depth-Based Estimation of Object Articulation Models

1 code implementation12 Aug 2021 Ajinkya Jain, Stephen Giguere, Rudolf Lioutikov, Scott Niekum

Our core contributions include a novel representation for distributions over rigid body transformations and articulation model parameters based on screw theory, von Mises-Fisher distributions, and Stiefel manifolds.

Benchmarking Object

Fairness Guarantees under Demographic Shift

no code implementations ICLR 2022 Stephen Giguere, Blossom Metevier, Yuriy Brun, Philip S. Thomas, Scott Niekum, Bruno Castro da Silva

Recent studies have demonstrated that using machine learning for social applications can lead to injustice in the form of racist, sexist, and otherwise unfair and discriminatory outcomes.

Fairness

SOPE: Spectrum of Off-Policy Estimators

1 code implementation NeurIPS 2021 Christina J. Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum

In this paper, we present a new perspective on this bias-variance trade-off and show the existence of a spectrum of estimators whose endpoints are SIS and IS.

Decision Making Off-policy evaluation

Cannot find the paper you are looking for? You can Submit a new open access paper.