Search Results for author: Firdaus Janoos

Found 6 papers, 2 papers with code

Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO

2 code implementations • 25 May 2020 • Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry

We study the roots of algorithmic progress in deep policy gradient algorithms through a case study on two popular algorithms: Proximal Policy Optimization (PPO) and Trust Region Policy Optimization (TRPO).

reinforcement-learning Reinforcement Learning (RL)

1,108

Paper
Code

Implementation Matters in Deep RL: A Case Study on PPO and TRPO

2 code implementations • ICLR 2020 • Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry

We study the roots of algorithmic progress in deep policy gradient algorithms through a case study on two popular algorithms, Proximal Policy Optimization and Trust Region Policy Optimization.

reinforcement-learning Reinforcement Learning (RL)

110

Paper
Code

A Closer Look at Deep Policy Gradients

no code implementations • ICLR 2020 • Andrew Ilyas, Logan Engstrom, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry

We study how the behavior of deep policy gradient algorithms reflects the conceptual framework motivating their development.

Value prediction

Paper
Add Code

Active Mean Fields for Probabilistic Image Segmentation: Connections with Chan-Vese and Rudin-Osher-Fatemi Models

no code implementations • 22 Jan 2015 • Marc Niethammer, Kilian M. Pohl, Firdaus Janoos, William M. Wells III

A specific implementation of that model is the Chan-Vese segmentation model (CV), in which the binary segmentation task is defined by a Gaussian likelihood and a prior regularizing the length of the segmentation boundary.

Image Denoising Image Segmentation +2

Paper
Add Code

Multi-scale Graphical Models for Spatio-Temporal Processes

no code implementations • NeurIPS 2014 • Firdaus Janoos, Huseyin Denli, Niranjan Subrahmanya

Learning the dependency structure between spatially distributed observations of a spatio-temporal process is an important problem in many fields such as geology, geophysics, atmospheric sciences, oceanography, etc.

Geophysics Time Series +1

Paper
Add Code

Identification of Recurrent Patterns in the Activation of Brain Networks

no code implementations • NeurIPS 2012 • Firdaus Janoos, Weichang Li, Niranjan Subrahmanya, Istvan Morocz, William Wells

Identifying patterns from the neuroimaging recordings of brain activity related to the unobservable psychological or mental state of an individual can be treated as a unsupervised pattern recognition problem.

Clustering Time Series Analysis

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.