Search Results for author: Ann B. Lee

Found 18 papers, 12 papers with code

Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free Inference

no code implementations8 Feb 2024 Luca Masserano, Alex Shen, Michele Doro, Tommaso Dorigo, Rafael Izbicki, Ann B. Lee

An open scientific challenge is how to classify events with reliable measures of uncertainty, when we have a mechanistic model of the data-generating process but the distribution over both labels and latent nuisance parameters is different between train and target data.

Domain Adaptation Uncertainty Quantification +1

Detecting Distributional Differences in Labeled Sequence Data with Application to Tropical Cyclone Satellite Imagery

1 code implementation4 Feb 2022 Trey McNeely, Galen Vincent, Kimberly M. Wood, Rafael Izbicki, Ann B. Lee

We prove that type I error control is guaranteed as long as the distribution of the label series is well-estimated, which is made easier by the extensive historical data for binary TC event labels.

Time Series Time Series Analysis

Identifying Distributional Differences in Convective Evolution Prior to Rapid Intensification in Tropical Cyclones

no code implementations24 Sep 2021 Trey McNeely, Galen Vincent, Rafael Izbicki, Kimberly M. Wood, Ann B. Lee

Tropical cyclone (TC) intensity forecasts are issued by human forecasters who evaluate spatio-temporal observations (e. g., satellite imagery) and model output (e. g., numerical weather prediction, statistical models) to produce forecasts every 6 hours.

Likelihood-Free Frequentist Inference: Bridging Classical Statistics and Machine Learning for Reliable Simulator-Based Inference

2 code implementations8 Jul 2021 Niccolò Dalmasso, Luca Masserano, David Zhao, Rafael Izbicki, Ann B. Lee

In this work, we propose a unified and modular inference framework that bridges classical statistics and modern machine learning providing (i) a practical approach to the Neyman construction of confidence sets with frequentist finite-sample coverage for any value of the unknown parameters; and (ii) interpretable diagnostics that estimate the empirical coverage across the entire parameter space.

Open-Ended Question Answering valid

Wildfire Smoke and Air Quality: How Machine Learning Can Guide Forest Management

no code implementations9 Oct 2020 Lorenzo Tomaselli, Coty Jen, Ann B. Lee

Prescribed burns are currently the most effective method of reducing the risk of widespread wildfires, but a largely missing component in forest management is knowing which fuels one can safely burn to minimize exposure to toxic smoke.

BIG-bench Machine Learning Clustering +1

HECT: High-Dimensional Ensemble Consistency Testing for Climate Models

no code implementations8 Oct 2020 Niccolò Dalmasso, Galen Vincent, Dorit Hammerling, Ann B. Lee

Climate models play a crucial role in understanding the effect of environmental and man-made changes on climate to help mitigate climate risks and inform governmental decisions.

Vocal Bursts Intensity Prediction

Confidence Sets and Hypothesis Testing in a Likelihood-Free Inference Setting

2 code implementations ICML 2020 Niccolò Dalmasso, Rafael Izbicki, Ann B. Lee

In this paper, we present $\texttt{ACORE}$ (Approximate Computation via Odds Ratio Estimation), a frequentist approach to LFI that first formulates the classical likelihood ratio test (LRT) as a parametrized classification problem, and then uses the equivalence of tests and confidence sets to build confidence regions for parameters of interest.

Two-sample testing

Conditional Density Estimation Tools in Python and R with Applications to Photometric Redshifts and Likelihood-Free Cosmological Inference

5 code implementations30 Aug 2019 Niccolò Dalmasso, Taylor Pospisil, Ann B. Lee, Rafael Izbicki, Peter E. Freeman, Alex I. Malz

We provide sample code in $\texttt{Python}$ and $\texttt{R}$ as well as examples of applications to photometric redshift estimation and likelihood-free cosmological inference via CDE.

Astronomy Density Estimation +2

(f)RFCDE: Random Forests for Conditional Density Estimation and Functional Data

1 code implementation17 Jun 2019 Taylor Pospisil, Ann B. Lee

Furthermore, in settings with heteroskedasticity or multimodality, a regression point estimate with standard errors do not fully capture the uncertainty in our predictions.

Computation Methodology

Validation of Approximate Likelihood and Emulator Models for Computationally Intensive Simulations

1 code implementation27 May 2019 Niccolò Dalmasso, Ann B. Lee, Rafael Izbicki, Taylor Pospisil, Ilmun Kim, Chieh-An Lin

At the heart of our approach is a two-sample test that quantifies the quality of the fit at fixed parameter values, and a global test that assesses goodness-of-fit across simulation parameters.

ABC-CDE: Towards Approximate Bayesian Computation with Complex High-Dimensional Data and Limited Simulations

1 code implementation14 May 2018 Rafael Izbicki, Ann B. Lee, Taylor Pospisil

Approximate Bayesian Computation (ABC) is typically used when the likelihood is either unavailable or intractable but where data can be simulated under different parameter settings using a forward model.

Density Estimation

RFCDE: Random Forests for Conditional Density Estimation

1 code implementation16 Apr 2018 Taylor Pospisil, Ann B. Lee

Random forests is a common non-parametric regression technique which performs well for mixed-type data and irrelevant covariates, while being robust to monotonic variable transformations.

Density Estimation General Classification +1

Converting High-Dimensional Regression to High-Dimensional Conditional Density Estimation

1 code implementation26 Apr 2017 Rafael Izbicki, Ann B. Lee

There is a growing demand for nonparametric conditional density estimators (CDEs) in fields such as astronomy and economics.

Astronomy Density Estimation +2

A Spectral Series Approach to High-Dimensional Nonparametric Regression

no code implementations1 Feb 2016 Ann B. Lee, Rafael Izbicki

We expand the unknown regression on the data in terms of the eigenfunctions of a kernel-based operator, and we take advantage of orthogonality of the basis with respect to the underlying data distribution, P, to speed up computations and tuning of parameters.

regression Vocal Bursts Intensity Prediction

Treelets--An adaptive multi-scale basis for sparse unordered data

3 code implementations3 Jul 2007 Ann B. Lee, Boaz Nadler, Larry Wasserman

In many modern applications, including analysis of gene expression and text documents, the data are noisy, high-dimensional, and unordered--with no particular meaning to the given order of the variables.

Methodology

Cannot find the paper you are looking for? You can Submit a new open access paper.