Search Results for author: XuanLong Nguyen

Found 31 papers, 8 papers with code

Dendrogram of mixing measures: Hierarchical clustering and model selection for finite mixture models

no code implementations4 Mar 2024 Dat Do, Linh Do, Scott A. McKinley, Jonathan Terhorst, XuanLong Nguyen

The dendrogram's construction is derived from the theory of convergence of the mixing measures, and as a result, we can both consistently select the true number of mixing components and obtain the pointwise optimal convergence rate for parameter estimation from the tree, even when the model parameters are only weakly identifiable.

Clustering Model Selection

Interpolation for Robust Learning: Data Augmentation on Wasserstein Geodesics

no code implementations4 Feb 2023 Jiacheng Zhu, JieLin Qiu, Aritra Guha, Zhuolin Yang, XuanLong Nguyen, Bo Li, Ding Zhao

Our work provides a new perspective of model robustness through the lens of Wasserstein geodesic-based interpolation with a practical off-the-shelf strategy that can be combined with existing robust training methods.

Data Augmentation

Strong identifiability and parameter learning in regression with heterogeneous response

no code implementations8 Dec 2022 Dat Do, Linh Do, XuanLong Nguyen

We provide simulation studies and data illustrations, which shed some light on the parameter learning behavior found in several popular regression mixture models reported in the literature.

regression

PhysioMTL: Personalizing Physiological Patterns using Optimal Transport Multi-Task Regression

1 code implementation19 Mar 2022 Jiacheng Zhu, Gregory Darnell, Agni Kumar, Ding Zhao, Bo Li, XuanLong Nguyen, Shirley You Ren

The proposed method learns an individual-specific predictive model from heterogeneous observations, and enables estimation of an optimal transport map that yields a push forward operation onto the demographic features for each task.

counterfactual Heart Rate Variability +1

Beyond Black Box Densities: Parameter Learning for the Deviated Components

no code implementations5 Feb 2022 Dat Do, Nhat Ho, XuanLong Nguyen

As we collect additional samples from a data population for which a known density function estimate may have been previously obtained by a black box method, the increased complexity of the data set may result in the true density being deviated from the known estimate by a mixture distribution.

Scalable nonparametric Bayesian learning for heterogeneous and dynamic velocity fields

no code implementations15 Feb 2021 Sunrit Chakraborty, Aritra Guha, Rayleigh Lei, XuanLong Nguyen

Analysis of heterogeneous patterns in complex spatio-temporal data finds usage across various domains in applied science and engineering, including training autonomous vehicles to navigate in complex traffic scenarios.

Autonomous Vehicles Navigate

Functional optimal transport: map estimation and domain adaptation for functional data

1 code implementation7 Feb 2021 Jiacheng Zhu, Aritra Guha, Dat Do, Mengdi Xu, XuanLong Nguyen, Ding Zhao

We introduce a formulation of optimal transport problem for distributions on function spaces, where the stochastic map between functional domains can be partially represented in terms of an (infinite-dimensional) Hilbert-Schmidt operator mapping a Hilbert space of functions to another.

Domain Adaptation Transfer Learning

Robust Unsupervised Learning of Temporal Dynamic Interactions

no code implementations18 Jun 2020 Aritra Guha, Rayleigh Lei, Jiacheng Zhu, XuanLong Nguyen, Ding Zhao

These distance metrics can serve as an objective for assessing the stability of an interaction learning algorithm.

Representation Learning

Rk-means: Fast Clustering for Relational Data

no code implementations11 Oct 2019 Ryan Curtin, Ben Moseley, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich

When the data matrix needs to be obtained from a relational database via a feature extraction query, the computation cost can be prohibitive, as the data matrix may be (much) larger than the total input relation size.

Clustering

On Efficient Multilevel Clustering via Wasserstein Distances

1 code implementation19 Sep 2019 Viet Huynh, Nhat Ho, Nhan Dam, XuanLong Nguyen, Mikhail Yurochkin, Hung Bui, and Dinh Phung

We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data.

Clustering

Dirichlet Simplex Nest and Geometric Inference

1 code implementation27 May 2019 Mikhail Yurochkin, Aritra Guha, Yuekai Sun, XuanLong Nguyen

We propose Dirichlet Simplex Nest, a class of probabilistic models suitable for a variety of data types, and develop fast and provably accurate inference algorithms by accounting for the model's convex geometry and low dimensional simplicial structure.

Scalable inference of topic evolution via models for latent geometric structures

1 code implementation NeurIPS 2019 Mikhail Yurochkin, Zhiwei Fan, Aritra Guha, Paraschos Koutris, XuanLong Nguyen

We develop new models and algorithms for learning the temporal dynamics of the topic polytopes and related geometric objects that arise in topic model based inference.

UPS: optimizing Undirected Positive Sparse graph for neural graph filtering

no code implementations ICLR 2018 Mikhail Yurochkin, Dung Thai, Hung Hai Bui, XuanLong Nguyen

In this work we propose a novel approach for learning graph representation of the data using gradients obtained via backpropagation.

Multi-way Interacting Regression via Factorization Machines

1 code implementation NeurIPS 2017 Mikhail Yurochkin, XuanLong Nguyen, Nikolaos Vasiloglou

We propose a Bayesian regression method that accounts for multi-way interactions of arbitrary orders among the predictor variables.

regression

Multilevel Clustering via Wasserstein Means

1 code implementation ICML 2017 Nhat Ho, XuanLong Nguyen, Mikhail Yurochkin, Hung Hai Bui, Viet Huynh, Dinh Phung

We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data.

Clustering

Geometric Dirichlet Means algorithm for topic inference

no code implementations NeurIPS 2016 Mikhail Yurochkin, XuanLong Nguyen

We propose a geometric algorithm for topic learning and inference that is built on the convex geometry of topics arising from the Latent Dirichlet Allocation (LDA) model and its nonparametric extensions.

Clustering Variational Inference

Singularity structures and impacts on parameter estimation in finite mixtures of distributions

no code implementations9 Sep 2016 Nhat Ho, XuanLong Nguyen

Our study makes explicit the deep links between model singularities, parameter estimation convergence rates and minimax lower bounds, and the algebraic geometry of the parameter space for mixtures of continuous distributions.

Optimal change point detection in Gaussian processes

no code implementations3 Jun 2015 Hossein Keshavarz, Clayton Scott, XuanLong Nguyen

By contrast, the standard CUSUM method, which does not account for the covariance structure, is shown to be asymptotically optimal only in the increasing domain.

Change Point Detection Gaussian Processes +2

Nonlinear Model Predictive Control of A Gasoline HCCI Engine Using Extreme Learning Machines

no code implementations16 Jan 2015 Vijay Manikandan Janakiraman, XuanLong Nguyen, Dennis Assanis

Using the ELM engine models, an MPC based control algorithm with a simplified quadratic program update is derived for real time implementation.

Model Predictive Control

Stochastic Gradient Based Extreme Learning Machines For Online Learning of Advanced Combustion Engines

no code implementations16 Jan 2015 Vijay Manikandan Janakiraman, XuanLong Nguyen, Dennis Assanis

The algorithm is applied to two case studies: An online regression learning for system identification of a Homogeneous Charge Compression Ignition (HCCI) Engine and an online classification learning (with class imbalance) for identifying the dynamic operating envelope of the HCCI Engine.

Identifiability and optimal rates of convergence for parameters of multiple types in finite mixtures

no code implementations11 Jan 2015 Nhat Ho, XuanLong Nguyen

This paper studies identifiability and convergence behaviors for parameters of multiple types in finite mixtures, and the effects of model fitting with extra mixing components.

Parallel Feature Selection Inspired by Group Testing

no code implementations NeurIPS 2014 Yingbo Zhou, Utkarsh Porwal, Ce Zhang, Hung Q. Ngo, XuanLong Nguyen, Christopher Ré, Venu Govindaraju

Superior performance of our method is demonstrated on a challenging relation extraction task from a very large data set that have both redundant features and sample size in the order of millions.

feature selection General Classification +1

Bayesian Nonparametric Multilevel Clustering with Group-Level Contexts

no code implementations9 Jan 2014 Vu Nguyen, Dinh Phung, XuanLong Nguyen, Svetha Venkatesh, Hung Hai Bui

We present a Bayesian nonparametric framework for multilevel clustering which utilizes group-level context information to simultaneously discover low-dimensional structures of the group contents and partitions groups into clusters.

Clustering

Bayesian inference as iterated random functions with applications to sequential inference in graphical models

no code implementations NeurIPS 2013 Arash A. Amini, XuanLong Nguyen

We propose a general formalism of iterated random functions with semigroup property, under which exact and approximate Bayesian posterior updates can be viewed as specific instances.

Bayesian Inference Change Point Detection

Modeling The Stable Operating Envelope For Partially Stable Combustion Engines Using Class Imbalance Learning

no code implementations24 Jun 2013 Vijay Manikandan Janakiraman, XuanLong Nguyen, Jeff Sterniak, Dennis Assanis

In this paper, a machine learning based approach is employed to identify the stable operating boundary of HCCI combustion directly from experimental data.

Borrowing strengh in hierarchical Bayes: Posterior concentration of the Dirichlet base measure

no code implementations4 Jan 2013 XuanLong Nguyen

This paper studies posterior concentration behavior of the base probability measure of a Dirichlet measure, given observations associated with the sampled Dirichlet processes, as the number of observations tends to infinity.

Posterior contraction of the population polytope in finite admixture models

no code implementations1 Jun 2012 XuanLong Nguyen

We study the posterior contraction behavior of the latent population structure that arises in admixture models as the amount of data increases.

Topic Models

Convergence of latent mixing measures in finite and infinite mixture models

no code implementations15 Sep 2011 XuanLong Nguyen

This paper studies convergence behavior of latent mixing measures that arise in finite and infinite mixture models, using transportation distances (i. e., Wasserstein metrics).

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.