no code implementations • 4 Mar 2024 • Dat Do, Linh Do, Scott A. McKinley, Jonathan Terhorst, XuanLong Nguyen
The dendrogram's construction is derived from the theory of convergence of the mixing measures, and as a result, we can both consistently select the true number of mixing components and obtain the pointwise optimal convergence rate for parameter estimation from the tree, even when the model parameters are only weakly identifiable.
no code implementations • 4 Feb 2023 • Jiacheng Zhu, JieLin Qiu, Aritra Guha, Zhuolin Yang, XuanLong Nguyen, Bo Li, Ding Zhao
Our work provides a new perspective of model robustness through the lens of Wasserstein geodesic-based interpolation with a practical off-the-shelf strategy that can be combined with existing robust training methods.
no code implementations • 8 Dec 2022 • Dat Do, Linh Do, XuanLong Nguyen
We provide simulation studies and data illustrations, which shed some light on the parameter learning behavior found in several popular regression mixture models reported in the literature.
1 code implementation • 19 Mar 2022 • Jiacheng Zhu, Gregory Darnell, Agni Kumar, Ding Zhao, Bo Li, XuanLong Nguyen, Shirley You Ren
The proposed method learns an individual-specific predictive model from heterogeneous observations, and enables estimation of an optimal transport map that yields a push forward operation onto the demographic features for each task.
no code implementations • 5 Feb 2022 • Dat Do, Nhat Ho, XuanLong Nguyen
As we collect additional samples from a data population for which a known density function estimate may have been previously obtained by a black box method, the increased complexity of the data set may result in the true density being deviated from the known estimate by a mixture distribution.
no code implementations • 15 Feb 2021 • Sunrit Chakraborty, Aritra Guha, Rayleigh Lei, XuanLong Nguyen
Analysis of heterogeneous patterns in complex spatio-temporal data finds usage across various domains in applied science and engineering, including training autonomous vehicles to navigate in complex traffic scenarios.
1 code implementation • 7 Feb 2021 • Jiacheng Zhu, Aritra Guha, Dat Do, Mengdi Xu, XuanLong Nguyen, Ding Zhao
We introduce a formulation of optimal transport problem for distributions on function spaces, where the stochastic map between functional domains can be partially represented in terms of an (infinite-dimensional) Hilbert-Schmidt operator mapping a Hilbert space of functions to another.
no code implementations • 18 Jun 2020 • Aritra Guha, Rayleigh Lei, Jiacheng Zhu, XuanLong Nguyen, Ding Zhao
These distance metrics can serve as an objective for assessing the stability of an interaction learning algorithm.
no code implementations • 11 Oct 2019 • Ryan Curtin, Ben Moseley, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich
When the data matrix needs to be obtained from a relational database via a feature extraction query, the computation cost can be prohibitive, as the data matrix may be (much) larger than the total input relation size.
1 code implementation • 19 Sep 2019 • Viet Huynh, Nhat Ho, Nhan Dam, XuanLong Nguyen, Mikhail Yurochkin, Hung Bui, and Dinh Phung
We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data.
1 code implementation • 27 May 2019 • Mikhail Yurochkin, Aritra Guha, Yuekai Sun, XuanLong Nguyen
We propose Dirichlet Simplex Nest, a class of probabilistic models suitable for a variety of data types, and develop fast and provably accurate inference algorithms by accounting for the model's convex geometry and low dimensional simplicial structure.
no code implementations • 22 Dec 2018 • Mahmoud Abo Khamis, Ryan R. Curtin, Benjamin Moseley, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich
This new width is sandwiched between the submodular and the fractional hypertree widths.
1 code implementation • NeurIPS 2019 • Mikhail Yurochkin, Zhiwei Fan, Aritra Guha, Paraschos Koutris, XuanLong Nguyen
We develop new models and algorithms for learning the temporal dynamics of the topic polytopes and related geometric objects that arise in topic model based inference.
no code implementations • ICLR 2018 • Mikhail Yurochkin, Dung Thai, Hung Hai Bui, XuanLong Nguyen
In this work we propose a novel approach for learning graph representation of the data using gradients obtained via backpropagation.
1 code implementation • NeurIPS 2017 • Mikhail Yurochkin, Aritra Guha, XuanLong Nguyen
We propose new algorithms for topic modeling when the number of topics is unknown.
1 code implementation • NeurIPS 2017 • Mikhail Yurochkin, XuanLong Nguyen, Nikolaos Vasiloglou
We propose a Bayesian regression method that accounts for multi-way interactions of arbitrary orders among the predictor variables.
1 code implementation • ICML 2017 • Nhat Ho, XuanLong Nguyen, Mikhail Yurochkin, Hung Hai Bui, Viet Huynh, Dinh Phung
We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data.
no code implementations • NeurIPS 2016 • Mikhail Yurochkin, XuanLong Nguyen
We propose a geometric algorithm for topic learning and inference that is built on the convex geometry of topics arising from the Latent Dirichlet Allocation (LDA) model and its nonparametric extensions.
no code implementations • 9 Sep 2016 • Nhat Ho, XuanLong Nguyen
Our study makes explicit the deep links between model singularities, parameter estimation convergence rates and minimax lower bounds, and the algebraic geometry of the parameter space for mixtures of continuous distributions.
no code implementations • 15 Jan 2016 • Hossein Keshavarz, Clayton Scott, XuanLong Nguyen
Gaussian random fields are a powerful tool for modeling environmental processes.
no code implementations • 3 Jun 2015 • Hossein Keshavarz, Clayton Scott, XuanLong Nguyen
By contrast, the standard CUSUM method, which does not account for the covariance structure, is shown to be asymptotically optimal only in the increasing domain.
no code implementations • 16 Jan 2015 • Vijay Manikandan Janakiraman, XuanLong Nguyen, Dennis Assanis
Using the ELM engine models, an MPC based control algorithm with a simplified quadratic program update is derived for real time implementation.
no code implementations • 16 Jan 2015 • Vijay Manikandan Janakiraman, XuanLong Nguyen, Dennis Assanis
The algorithm is applied to two case studies: An online regression learning for system identification of a Homogeneous Charge Compression Ignition (HCCI) Engine and an online classification learning (with class imbalance) for identifying the dynamic operating envelope of the HCCI Engine.
no code implementations • 11 Jan 2015 • Nhat Ho, XuanLong Nguyen
This paper studies identifiability and convergence behaviors for parameters of multiple types in finite mixtures, and the effects of model fitting with extra mixing components.
no code implementations • NeurIPS 2014 • Yingbo Zhou, Utkarsh Porwal, Ce Zhang, Hung Q. Ngo, XuanLong Nguyen, Christopher Ré, Venu Govindaraju
Superior performance of our method is demonstrated on a challenging relation extraction task from a very large data set that have both redundant features and sample size in the order of millions.
no code implementations • 9 Jan 2014 • Vu Nguyen, Dinh Phung, XuanLong Nguyen, Svetha Venkatesh, Hung Hai Bui
We present a Bayesian nonparametric framework for multilevel clustering which utilizes group-level context information to simultaneously discover low-dimensional structures of the group contents and partitions groups into clusters.
no code implementations • NeurIPS 2013 • Arash A. Amini, XuanLong Nguyen
We propose a general formalism of iterated random functions with semigroup property, under which exact and approximate Bayesian posterior updates can be viewed as specific instances.
no code implementations • 24 Jun 2013 • Vijay Manikandan Janakiraman, XuanLong Nguyen, Jeff Sterniak, Dennis Assanis
In this paper, a machine learning based approach is employed to identify the stable operating boundary of HCCI combustion directly from experimental data.
no code implementations • 4 Jan 2013 • XuanLong Nguyen
This paper studies posterior concentration behavior of the base probability measure of a Dirichlet measure, given observations associated with the sampled Dirichlet processes, as the number of observations tends to infinity.
no code implementations • 1 Jun 2012 • XuanLong Nguyen
We study the posterior contraction behavior of the latent population structure that arises in admixture models as the amount of data increases.
no code implementations • 15 Sep 2011 • XuanLong Nguyen
This paper studies convergence behavior of latent mixing measures that arise in finite and infinite mixture models, using transportation distances (i. e., Wasserstein metrics).