Search Results for author: Mingzhang Yin

Found 17 papers, 8 papers with code

Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

no code implementations18 Oct 2023 Mingzhang Yin, Ruijiang Gao, Weiran Lin, Steven M. Shugan

Cross-pollinating the machine learning and experiment design, GBS is scalable to products with hundreds of attributes and can design personalized products for heterogeneous consumers.

Confounding-Robust Policy Improvement with Human-AI Teams

no code implementations13 Oct 2023 Ruijiang Gao, Mingzhang Yin

In addition, we propose a personalized deferral collaboration system to leverage the diverse expertise of different human decision-makers.

Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping

no code implementations12 Aug 2022 Russell Z. Kunes, Mingzhang Yin, Max Land, Doron Haviv, Dana Pe'er, Simon Tavaré

Gradient estimation is often necessary for fitting generative models with discrete latent variables, in contexts such as reinforcement learning and variational autoencoder (VAE) training.

Probabilistic Conformal Prediction Using Conditional Random Samples

1 code implementation14 Jun 2022 Zhendong Wang, Ruijiang Gao, Mingzhang Yin, Mingyuan Zhou, David M. Blei

This paper proposes probabilistic conformal prediction (PCP), a predictive inference algorithm that estimates a target variable by a discontinuous predictive set.

Conformal Prediction Prediction Intervals

Partial Identification with Noisy Covariates: A Robust Optimization Approach

no code implementations22 Feb 2022 Wenshuo Guo, Mingzhang Yin, Yixin Wang, Michael I. Jordan

Directly adjusting for these imperfect measurements of the covariates can lead to biased causal estimates.

Causal Inference

Probabilistic Best Subset Selection via Gradient-Based Optimization

1 code implementation11 Jun 2020 Mingzhang Yin, Nhat Ho, Bowei Yan, Xiaoning Qian, Mingyuan Zhou

This paper proposes a novel optimization method to solve the exact L0-regularized regression problem, which is also known as the best subset selection.


Pairwise Supervised Hashing with Bernoulli Variational Auto-Encoder and Self-Control Gradient Estimator

no code implementations21 May 2020 Siamak Zamani Dadaneh, Shahin Boluki, Mingzhang Yin, Mingyuan Zhou, Xiaoning Qian

Semantic hashing has become a crucial component of fast similarity search in many large-scale information retrieval systems, in particular, for text data.

Information Retrieval Retrieval

Discrete Action On-Policy Learning with Action-Value Critic

1 code implementation10 Feb 2020 Yuguang Yue, Yunhao Tang, Mingzhang Yin, Mingyuan Zhou

Reinforcement learning (RL) in discrete action space is ubiquitous in real-world applications, but its complexity grows exponentially with the action-space dimension, making it challenging to apply existing on-policy gradient based deep RL algorithms efficiently.

OpenAI Gym Reinforcement Learning (RL)

Meta-Learning without Memorization

1 code implementation ICLR 2020 Mingzhang Yin, George Tucker, Mingyuan Zhou, Sergey Levine, Chelsea Finn

If this is not done, the meta-learner can ignore the task training data and learn a single model that performs all of the meta-training tasks zero-shot, but does not adapt effectively to new image classes.

Few-Shot Image Classification Memorization +1

Semi-Implicit Generative Model

no code implementations29 May 2019 Mingzhang Yin, Mingyuan Zhou

To combine explicit and implicit generative models, we introduce semi-implicit generator (SIG) as a flexible hierarchical model that can be trained in the maximum likelihood framework.

ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables

1 code implementation4 May 2019 Mingzhang Yin, Yuguang Yue, Mingyuan Zhou

To address the challenge of backpropagating the gradient through categorical variables, we propose the augment-REINFORCE-swap-merge (ARSM) gradient estimator that is unbiased and has low variance.

Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy

no code implementations13 Mar 2019 Yunhao Tang, Mingzhang Yin, Mingyuan Zhou

Due to the high variance of policy gradients, on-policy optimization algorithms are plagued with low sample efficiency.

ARM: Augment-REINFORCE-Merge Gradient for Stochastic Binary Networks

1 code implementation ICLR 2019 Mingzhang Yin, Mingyuan Zhou

To backpropagate the gradients through stochastic binary layers, we propose the augment-REINFORCE-merge (ARM) estimator that is unbiased, exhibits low variance, and has low computational complexity.

Data Augmentation Variational Inference

Semi-Implicit Variational Inference

1 code implementation ICML 2018 Mingzhang Yin, Mingyuan Zhou

Semi-implicit variational inference (SIVI) is introduced to expand the commonly used analytic variational distribution family, by mixing the variational parameter with a flexible distribution.

Bayesian Inference Variational Inference

Convergence of Gradient EM on Multi-component Mixture of Gaussians

no code implementations NeurIPS 2017 Bowei Yan, Mingzhang Yin, Purnamrita Sarkar

In this paper, we study convergence properties of the gradient variant of Expectation-Maximization algorithm~\cite{lange1995gradient} for Gaussian Mixture Models for arbitrary number of clusters and mixing coefficients.

Learning Theory

Convergence Analysis of Gradient EM for Multi-component Gaussian Mixture

no code implementations23 May 2017 Bowei Yan, Mingzhang Yin, Purnamrita Sarkar

In this paper, we study convergence properties of the gradient Expectation-Maximization algorithm \cite{lange1995gradient} for Gaussian Mixture Models for general number of clusters and mixing coefficients.

Learning Theory

Cannot find the paper you are looking for? You can Submit a new open access paper.