Search Results for author: Sinong Geng

Found 11 papers, 1 papers with code

Improving Offline RL by Blending Heuristics

no code implementations1 Jun 2023 Sinong Geng, Aldo Pacchiano, Andrey Kolobov, Ching-An Cheng

We propose Heuristic Blending (HUBL), a simple performance-improving technique for a broad class of offline RL algorithms based on value bootstrapping.

D4RL Offline RL

A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models

no code implementations11 Apr 2023 Sinong Geng, Houssam Nassif, Carlos A. Manzanares

We use these estimated Q-functions, along with a clustering algorithm, to select a subset of states that are the most pivotal for driving changes in Q-functions.

Discrete Choice Models

Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions

1 code implementation15 Jul 2020 Sinong Geng, Houssam Nassif, Carlos A. Manzanares, A. Max Reppen, Ronnie Sircar

We name our method PQR, as it sequentially estimates the Policy, the $Q$-function, and the Reward function by deep learning.

reinforcement-learning Reinforcement Learning (RL)

Stochastic Learning for Sparse Discrete Markov Random Fields with Controlled Gradient Approximation Error

no code implementations12 May 2020 Sinong Geng, Zhaobin Kuang, Jie Liu, Stephen Wright, David Page

We study the $L_1$-regularized maximum likelihood estimator/estimation (MLE) problem for discrete Markov random fields (MRFs), where efficient and scalable learning requires both sparse regularization and approximate inference.

Temporal Poisson Square Root Graphical Models

no code implementations ICML 2018 Sinong Geng, Zhaobin Kuang, Peggy Peissig, David Page

We propose temporal Poisson square root graphical models (TPSQRs), a generalization of Poisson square root graphical models (PSQRs) specifically designed for modeling longitudinal event data.

Partially Linear Additive Gaussian Graphical Models

no code implementations8 Jun 2019 Sinong Geng, Minhao Yan, Mladen Kolar, Oluwasanmi Koyejo

We propose a partially linear additive Gaussian graphical model (PLA-GGM) for the estimation of associations between random variables distorted by observed confounders.

Joint Nonparametric Precision Matrix Estimation with Confounding

no code implementations16 Oct 2018 Sinong Geng, Mladen Kolar, Oluwasanmi Koyejo

Empirical results are presented using simulated and real brain imaging data, which suggest that our approach improves precision matrix estimation, as compared to baselines, when confounding is present.

An Efficient Pseudo-likelihood Method for Sparse Binary Pairwise Markov Network Estimation

no code implementations27 Feb 2017 Sinong Geng, Zhaobin Kuang, David Page

In this way, many insights and optimization procedures for sparse logistic regression can be applied to the learning of discrete Markov networks.

regression

Cannot find the paper you are looking for? You can Submit a new open access paper.