Search Results for author: Shuchin Aeron

Found 56 papers, 19 papers with code

Estimation of entropy-regularized optimal transport maps between non-compactly supported measures

1 code implementation • 20 Nov 2023 • Matthew Werenski, James M. Murphy, Shuchin Aeron

In the case that the target measure is compactly supported or strongly log-concave, we show that for a recently proposed in-sample estimator, the expected squared $L^2$-error decays at least as fast as $O(n^{-1/3})$ where $n$ is the sample size.

Paper
Code

On neural and dimensional collapse in supervised and unsupervised contrastive learning with hard negative sampling

no code implementations • 9 Nov 2023 • Ruijie Jiang, Thuan Nguyen, Shuchin Aeron, Prakash Ishwar

For a widely-studied data model and general loss and sample-hardening functions we prove that the Supervised Contrastive Learning (SCL), Hard-SCL (HSCL), and Unsupervised Contrastive Learning (UCL) risks are minimized by representations that exhibit Neural Collapse (NC), i. e., the class means form an Equianglular Tight Frame (ETF) and data from the same class are mapped to the same representation.

Contrastive Learning

Paper
Add Code

Systematic comparison of semi-supervised and self-supervised learning for medical image classification

1 code implementation • 18 Jul 2023 • Zhe Huang, Ruijie Jiang, Shuchin Aeron, Michael C. Hughes

Yet past benchmarks do not focus on medical tasks and rarely compare self- and semi- methods together on an equal footing.

Image Classification Medical Image Classification +1

Paper
Code

A principled approach to model validation in domain generalization

1 code implementation • 2 Apr 2023 • Boyang Lyu, Thuan Nguyen, Matthias Scheutz, Prakash Ishwar, Shuchin Aeron

Domain generalization aims to learn a model with good generalization ability, that is, the learned model should not only perform well on several seen domains but also on unseen domains with different data distributions.

Classification Domain Generalization +1

Paper
Code

On Rank Energy Statistics via Optimal Transport: Continuity, Convergence, and Change Point Detection

no code implementations • 15 Feb 2023 • Matthew Werenski, Shoaib Bin Masud, James M. Murphy, Shuchin Aeron

This paper considers the use of recently proposed optimal transport-based multivariate test statistics, namely rank energy and its variant the soft rank energy derived from entropically regularized optimal transport, for the unsupervised nonparametric change point detection (CPD) problem.

Change Point Detection

Paper
Add Code

Alternating minimization algorithm with initialization analysis for r-local and k-sparse unlabeled sensing

no code implementations • 14 Nov 2022 • Ahmed Abbasi, Abiy Tasissa, Shuchin Aeron

The unlabeled sensing problem is to recover an unknown signal from permuted linear measurements.

Paper
Add Code

Trade-off between reconstruction loss and feature alignment for domain generalization

1 code implementation • 26 Oct 2022 • Thuan Nguyen, Boyang Lyu, Prakash Ishwar, Matthias Scheutz, Shuchin Aeron

To deal with challenging settings in DG where both data and label of the unseen domain are not available at training time, the most common approach is to design the classifiers based on the domain-invariant representation features, i. e., the latent representations that are unchanged and transferable between domains.

Domain Generalization Transfer Learning

Paper
Code

Geometric Sparse Coding in Wasserstein Space

no code implementations • 21 Oct 2022 • Marshall Mueller, Shuchin Aeron, James M. Murphy, Abiy Tasissa

We show this approach leads to sparse representations in Wasserstein space and addresses the problem of non-uniqueness of barycentric representation.

Dictionary Learning

Paper
Add Code

Nonparametric and Regularized Dynamical Wasserstein Barycenters for Sequential Observations

no code implementations • 4 Oct 2022 • Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Eric L. Miller

We consider probabilistic models for sequential observations which exhibit gradual transitions among a finite number of states.

Time Series Time Series Analysis

Paper
Add Code

Supervised Contrastive Learning with Hard Negative Samples

1 code implementation • 31 Aug 2022 • Ruijie Jiang, Thuan Nguyen, Prakash Ishwar, Shuchin Aeron

In this paper, motivated by the effectiveness of hard-negative sampling strategies in H-UCL and the usefulness of label information in SCL, we propose a contrastive learning framework called hard-negative supervised contrastive learning (H-SCL).

Contrastive Learning Self-Supervised Learning

Paper
Code

Joint covariate-alignment and concept-alignment: a framework for domain generalization

1 code implementation • 1 Aug 2022 • Thuan Nguyen, Boyang Lyu, Prakash Ishwar, Matthias Scheutz, Shuchin Aeron

Particularly, our framework proposes to jointly minimize both the covariate-shift as well as the concept-shift between the seen domains for a better performance on the unseen domain.

Concept Alignment Domain Generalization

Paper
Code

Easy Variational Inference for Categorical Models via an Independent Binary Approximation

1 code implementation • 31 May 2022 • Michael T. Wojnowicz, Shuchin Aeron, Eric L. Miller, Michael C. Hughes

This approximation makes inference straightforward and fast; using well-known auxiliary variables for probit or logistic regression, the product of binary models admits conjugate closed-form variational inference that is embarrassingly parallel across categories and invariant to category ordering.

Variational Inference

Paper
Code

Measure Estimation in the Barycentric Coding Model

1 code implementation • 28 Jan 2022 • Matthew Werenski, Ruijie Jiang, Abiy Tasissa, Shuchin Aeron, James M. Murphy

Our first main result leverages the Riemannian geometry of Wasserstein-2 space to provide a procedure for recovering the barycentric coordinates as the solution to a quadratic optimization problem assuming access to the true reference measures.

Paper
Code

Conditional entropy minimization principle for learning domain invariant representation features

2 code implementations • 25 Jan 2022 • Thuan Nguyen, Boyang Lyu, Prakash Ishwar, Matthias Scheutz, Shuchin Aeron

Invariance-principle-based methods such as Invariant Risk Minimization (IRM), have recently emerged as promising approaches for Domain Generalization (DG).

Domain Generalization

Paper
Code

Hard Negative Sampling via Regularized Optimal Transport for Contrastive Representation Learning

2 code implementations • 4 Nov 2021 • Ruijie Jiang, Prakash Ishwar, Shuchin Aeron

We study the problem of designing hard negative sampling distributions for unsupervised contrastive representation learning.

Contrastive Learning Representation Learning

Paper
Code

Interpretable contrastive word mover's embedding

1 code implementation • 1 Nov 2021 • Ruijie Jiang, Julia Gouvea, Eric Miller, David Hammer, Shuchin Aeron

This paper shows that a popular approach to the supervised embedding of documents for classification, namely, contrastive Word Mover's Embedding, can be significantly enhanced by adding interpretability.

Paper
Code

Multivariate rank via entropic optimal transport: sample efficiency and generative modeling

1 code implementation • 29 Oct 2021 • Shoaib Bin Masud, Matthew Werenski, James M. Murphy, Shuchin Aeron

We leverage this result to demonstrate fast convergence of sample sRE and sRMMD to their population version making them useful for high-dimensional GoF testing.

feature selection Image Generation +1

Paper
Code

r-local sensing: Improved algorithm and applications

2 code implementations • 26 Oct 2021 • Ahmed Ali Abbasi, Abiy Tasissa, Shuchin Aeron

The unlabeled sensing problem is to solve a noisy linear system of equations under unknown permutation of the measurements.

Paper
Code

Dynamical Wasserstein Barycenters for Time-series Modeling

1 code implementation • NeurIPS 2021 • Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Eric L. Miller

We propose a dynamical Wasserstein barycentric (DWB) model that estimates the system state over time as well as the data-generating distributions of pure states in an unsupervised manner.

Time Series Time Series Analysis

Paper
Code

Barycentric-alignment and reconstruction loss minimization for domain generalization

1 code implementation • 4 Sep 2021 • Boyang Lyu, Thuan Nguyen, Prakash Ishwar, Matthias Scheutz, Shuchin Aeron

To bridge this gap between theory and practice, we introduce a new upper bound that is free of terms having such dual dependence, resulting in a fully optimizable risk upper bound for the unseen domain.

Domain Generalization Representation Learning

Paper
Code

Soft and subspace robust multivariate rank tests based on entropy regularized optimal transport

1 code implementation • 16 Mar 2021 • Shoaib Bin Masud, Boyang Lyu, Shuchin Aeron

In this paper, we extend the recently proposed multivariate rank energy distance, based on the theory of optimal transport, for statistical testing of distributional similarity, to soft rank energy distance.

Change Point Detection Time Series +1

Paper
Code

Multiview Sensing With Unknown Permutations: An Optimal Transport Approach

no code implementations • 12 Mar 2021 • Yanting Ma, Petros T. Boufounos, Hassan Mansour, Shuchin Aeron

In several applications, including imaging of deformable objects while in motion, simultaneous localization and mapping, and unlabeled sensing, we encounter the problem of recovering a signal that is measured subject to unknown permutations.

Simultaneous Localization and Mapping

Paper
Add Code

Automatic coding of students' writing via Contrastive Representation Learning in the Wasserstein space

no code implementations • 26 Nov 2020 • Ruijie Jiang, Julia Gouvea, David Hammer, Eric Miller, Shuchin Aeron

This work is a step towards building a statistical machine learning (ML) method for achieving an automated support for qualitative analyses of students' writing, here specifically in score laboratory reports in introductory biology for sophistication of argumentation and reasoning.

BIG-bench Machine Learning Contrastive Learning +4

Paper
Add Code

Robust Machine Learning via Privacy/Rate-Distortion Theory

no code implementations • 22 Jul 2020 • Ye Wang, Shuchin Aeron, Adnan Siraj Rakin, Toshiaki Koike-Akino, Pierre Moulin

Robust machine learning formulations have emerged to address the prevalent vulnerability of deep neural networks to adversarial examples.

BIG-bench Machine Learning

Paper
Add Code

Representation Learning via Adversarially-Contrastive Optimal Transport

no code implementations • ICML 2020 • Anoop Cherian, Shuchin Aeron

To maximize extraction of such informative cues from the data, we set the problem within the context of contrastive representation learning and to that end propose a novel objective via optimal transport.

Action Recognition Contrastive Learning +3

Paper
Add Code

Domain Adaptation for Robust Workload Level Alignment Between Sessions and Subjects using fNIRS

no code implementations • 2 Jul 2020 • Boyang Lyu, Thao Pham, Giles Blaney, Zachary Haga, Angelo Sassaroli, Sergio Fantini, Shuchin Aeron

Results: In a sample of six subjects, G-W resulted in an alignment accuracy of 68 $\pm$ 4 % (weighted mean $\pm$ standard error) for session-by-session alignment, FG-W resulted in an alignment accuracy of 55 $\pm$ 2 % for subject-by-subject alignment.

Domain Adaptation

Paper
Add Code

On Matched Filtering for Statistical Change Point Detection

no code implementations • 9 Jun 2020 • Kevin C. Cheng, Eric L. Miller, Michael C. Hughes, Shuchin Aeron

Non-parametric and distribution-free two-sample tests have been the foundation of many change point detection algorithms.

Activity Recognition Change Point Detection

Paper
Add Code

R-local unlabeled sensing: A novel graph matching approach for multiview unlabeled sensing under local permutations

1 code implementation • 14 Nov 2019 • Ahmed Abbasi, Abiy Tasissa, Shuchin Aeron

Unlabeled sensing is a linear inverse problem where the measurements are scrambled under an unknown permutation leading to loss of correspondence between the measurements and the rows of the sensing matrix.

Graph Matching

Paper
Code

Optimal Transport Based Change Point Detection and Time Series Segment Clustering

no code implementations • 4 Nov 2019 • Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Erika Hussey, Eric L. Miller

Two common problems in time series analysis are the decomposition of the data stream into disjoint segments that are each in some sense "homogeneous" - a problem known as Change Point Detection (CPD) - and the grouping of similar nonadjacent segments, a problem that we call Time Series Segment Clustering (TSSC).

Change Point Detection Clustering +2

Paper
Add Code

On the modes of convergence of Stochastic Optimistic Mirror Descent (OMD) for saddle point problems

no code implementations • 2 Aug 2019 • Yanting Ma, Shuchin Aeron, Hassan Mansour

In this article, we study the convergence of Mirror Descent (MD) and Optimistic Mirror Descent (OMD) for saddle point problems satisfying the notion of coherence as proposed in Mertikopoulos et al. We prove convergence of OMD with exact gradients for coherent saddle point problems, and show that monotone convergence only occurs after some sufficiently large number of iterations.

Paper
Add Code

Multi-View Graph Embedding Using Randomized Shortest Paths

no code implementations • 20 Aug 2018 • Anuththari Gamage, Brian Rappaport, Shuchin Aeron, Xiaozhe Hu

This data is well represented by multi-view graphs, which consist of several distinct sets of edges over the same nodes.

Clustering Graph Embedding

Paper
Add Code

Principal Component Analysis with Tensor Train Subspace

no code implementations • 13 Mar 2018 • Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Tensor train is a hierarchical tensor network structure that helps alleviate the curse of dimensionality by parameterizing large-scale multidimensional data via a set of network of low-rank tensors.

Paper
Add Code

LEARNING SEMANTIC WORD RESPRESENTATIONS VIA TENSOR FACTORIZATION

no code implementations • ICLR 2018 • Eric Bailey, Charles Meyer, Shuchin Aeron

We present two new word embedding techniques based on tensor factorization and show that they outperform common methods on several semantic NLP tasks when given the same data.

Outlier Detection

Paper
Add Code

Tensor Train Neighborhood Preserving Embedding

no code implementations • 3 Dec 2017 • Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

In this paper, we propose a Tensor Train Neighborhood Preserving Embedding (TTNPE) to embed multi-dimensional tensor data into low dimensional tensor subspace.

Classification Dimensionality Reduction +1

Paper
Add Code

Faster Clustering via Non-Backtracking Random Walks

no code implementations • 26 Aug 2017 • Brian Rappaport, Anuththari Gamage, Shuchin Aeron

VEC employs a novel application of the state-of-the-art word2vec model to embed a graph in Euclidean space via random walks on the nodes of the graph.

Clustering Graph Clustering

Paper
Add Code

Efficient Low Rank Tensor Ring Completion

no code implementations • ICCV 2017 • Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Using the matrix product state (MPS) representation of the recently proposed tensor ring decompositions, in this paper we propose a tensor completion algorithm, which is an alternating minimization algorithm that alternates over the factors in the MPS representation.

Matrix Completion

Paper
Add Code

Sample, computation vs storage tradeoffs for classification using tensor subspace models

no code implementations • 18 Jun 2017 • Mohammadhossein Chaghazardi, Shuchin Aeron

Our main tool is the use of tensor subspaces, i. e. subspaces with a Kronecker structure, for embedding the data into lower dimensions.

General Classification

Paper
Add Code

Word Embeddings via Tensor Factorization

1 code implementation • 10 Apr 2017 • Eric Bailey, Shuchin Aeron

We show that embeddings based on tensor factorization can be used to discern the various meanings of polysemous words without being explicitly trained to do so, and motivate the intuition behind why this works in a way that doesn't with existing methods.

Outlier Detection Word Embeddings

Paper
Code

Unsupervised clustering under the Union of Polyhedral Cones (UOPC) model

no code implementations • 15 Oct 2016 • Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Similar to the Union of Subspaces (UOS) model where each data from each subspace is generated from a (unknown) basis, in the UOPC model each data from each cone is assumed to be generated from a finite number of (unknown) \emph{extreme rays}. To cluster data under this model, we consider several algorithms - (a) Sparse Subspace Clustering by Non-negative constraints Lasso (NCL), (b) Least squares approximation (LSA), and (c) K-nearest neighbor (KNN) algorithm to arrive at affinity between data points.

Clustering

Paper
Add Code

Low-tubal-rank Tensor Completion using Alternating Minimization

no code implementations • 5 Oct 2016 • Xiao-Yang Liu, Shuchin Aeron, Vaneet Aggarwal, Xiaodong Wang

The low-tubal-rank tensor model has been recently proposed for real-world multidimensional data.

2k Low-Rank Matrix Completion

Paper
Add Code

Algorithms for item categorization based on ordinal ranking data

no code implementations • 29 Sep 2016 • Josh Girson, Shuchin Aeron

In this context we modify an existing algorithm - namely the label propagation algorithm to a variant that uses the distance between the nodes for weighting the label propagation - to identify the categories.

Community Detection Stochastic Block Model +1

Paper
Add Code

Tensor Completion by Alternating Minimization under the Tensor Train (TT) Model

no code implementations • 19 Sep 2016 • Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Using the matrix product state (MPS) representation of tensor train decompositions, in this paper we propose a tensor completion algorithm which alternates over the matrices (tensors) in the MPS representation.

Matrix Completion

Paper
Add Code

On Deterministic Conditions for Subspace Clustering under Missing Data

no code implementations • 11 Jul 2016 • Wenqi Wang, Shuchin Aeron, Vaneet Aggarwal

In this paper we present deterministic conditions for success of sparse subspace clustering (SSC) under missing data, when data is assumed to come from a Union of Subspaces (UoS) model.

Clustering

Paper
Add Code

On deterministic conditions for subspace clustering under missing data

no code implementations • 15 Apr 2016 • Wenqi Wang, Shuchin Aeron, Vaneet Aggarwal

We provide extensive set of simulation results for clustering as well as completion of data under missing entries, under the UoS model.

Clustering

Paper
Add Code

Denoising and Completion of 3D Data via Multidimensional Dictionary Learning

no code implementations • 31 Dec 2015 • Zemin Zhang, Shuchin Aeron

In this paper a new dictionary learning algorithm for multidimensional data is proposed.

Dictionary Learning Image Denoising

Paper
Add Code

Multilinear Subspace Clustering

no code implementations • 21 Dec 2015 • Eric Kernfeld, Nathan Majumder, Shuchin Aeron, Misha Kilmer

In this paper we present a new model and an algorithm for unsupervised clustering of 2-D data such as images.

Clustering

Paper
Add Code

Group-Invariant Subspace Clustering

no code implementations • 15 Oct 2015 • Shuchin Aeron, Eric Kernfeld

In this paper we consider the problem of group invariant subspace clustering where the data is assumed to come from a union of group-invariant subspaces of a vector space, i. e. subspaces which are invariant with respect to action of a given group.

Clustering

Paper
Add Code

Information-theoretic Bounds on Matrix Completion under Union of Subspaces Model

no code implementations • 14 Aug 2015 • Vaneet Aggarwal, Shuchin Aeron

In this short note we extend some of the recent results on matrix completion under the assumption that the columns of the matrix can be grouped (clustered) into subspaces (not necessarily disjoint or independent).

Clustering Matrix Completion

Paper
Add Code

Adaptive Sampling of RF Fingerprints for Fine-grained Indoor Localization

no code implementations • 10 Aug 2015 • Xiao-Yang Liu, Shuchin Aeron, Vaneet Aggarwal, Xiaodong Wang, Min-You Wu

In contrast to several existing work that rely on random sampling, this paper shows that adaptivity in sampling can lead to significant improvements in localization accuracy.

Indoor Localization

Paper
Add Code

An algorithm for online tensor prediction

no code implementations • 28 Jul 2015 • John Pothier, Josh Girson, Shuchin Aeron

Then following a similar construction as in [3], we exploit this algorithm to propose an online algorithm for learning and prediction of tensors with provable regret guarantees.

Paper
Add Code

Exact tensor completion using t-SVD

no code implementations • 16 Feb 2015 • Zemin Zhang, Shuchin Aeron

Using this factorization one can derive notion of tensor rank, referred to as the tensor tubal rank, which has optimality properties similar to that of matrix rank derived from SVD.

Matrix Completion

Paper
Add Code

Clustering multi-way data: a novel algebraic approach

no code implementations • 22 Dec 2014 • Eric Kernfeld, Shuchin Aeron, Misha Kilmer

In this paper, we develop a method for unsupervised clustering of two-way (matrix) data by combining two recent innovations from different fields: the Sparse Subspace Clustering (SSC) algorithm [10], which groups points coming from a union of subspaces into their respective subspaces, and the t-product [18], which was introduced to provide a matrix-like multiplication for third order tensors.

Clustering Image Clustering

Paper
Add Code

Novel methods for multilinear data completion and de-noising based on tensor-SVD

2 code implementations • CVPR 2014 • Zemin Zhang, Gregory Ely, Shuchin Aeron, Ning Hao, Misha Kilmer

Based on t-SVD, the notion of multilinear rank and a related tensor nuclear norm was proposed in [11] to characterize informational and structural complexity of multilinear data.

Paper
Code

First Order Methods for Robust Non-negative Matrix Factorization for Large Scale Noisy Data

no code implementations • 24 Mar 2014 • Jason Gejie Liu, Shuchin Aeron

Nonnegative matrix factorization (NMF) has been shown to be identifiable under the separability assumption, under which all the columns(or rows) of the input data matrix belong to the convex cone generated by only a few of these columns(or rows) [1].

Paper
Add Code

Robust Large Scale Non-negative Matrix Factorization using Proximal Point Algorithm

no code implementations • 8 Jan 2014 • Jason Gejie Liu, Shuchin Aeron

A robust algorithm for non-negative matrix factorization (NMF) is presented in this paper with the purpose of dealing with large-scale data, where the separability assumption is satisfied.

Paper
Add Code

Novel Factorization Strategies for Higher Order Tensors: Implications for Compression and Recovery of Multi-linear Data

no code implementations • 2 Jul 2013 • Zemin Zhang, Gregory Ely, Shuchin Aeron, Ning Hao, Misha Kilmer

In this paper we propose novel methods for compression and recovery of multilinear data under limited sampling.

Data Compression Tensor Decomposition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.