Search Results for author: Artur Dubrawski

Found 71 papers, 23 papers with code

Multimodal Structure Preservation Learning

no code implementations29 Oct 2024 Chang Liu, Jieshi Chen, Lee H. Harrison, Artur Dubrawski

When selecting data to build machine learning models in practical applications, factors such as availability, acquisition cost, and discriminatory power are crucial considerations.

Epidemiology

TimeSeriesExam: A time series understanding exam

no code implementations18 Oct 2024 Yifu Cai, Arjun Choudhry, Mononito Goswami, Artur Dubrawski

We believe that the ability to programatically generate questions is fundamental to assessing and improving LLM's ability to understand and reason about time series data.

Anomaly Detection Multiple-choice +1

Exploring Representations and Interventions in Time Series Foundation Models

no code implementations19 Sep 2024 Michał Wiliński, Mononito Goswami, Nina Żukowska, Willa Potosnak, Artur Dubrawski

These findings underscore the value of representational analysis for optimizing models and demonstrate how conceptual steering offers new possibilities for more controlled and efficient time series analysis with TSFMs.

Time Series Time Series Analysis

Implicit Reasoning in Deep Time Series Forecasting

no code implementations17 Sep 2024 Willa Potosnak, Cristian Challu, Mononito Goswami, Michał Wiliński, Nina Żukowska, Artur Dubrawski

Recently, time series foundation models have shown promising zero-shot forecasting performance on time series from a wide range of domains.

Memorization Time Series +1

Bifurcation Identification for Ultrasound-driven Robotic Cannulation

no code implementations10 Sep 2024 Cecilia G. Morales, Dhruv Srikanth, Jack H. Good, Keith A. Dufendach, Artur Dubrawski

This is primarily due to the limited availability of ground truth data, in particular, data from live subjects, needed for training and validating reliable models.

A SAT-based approach to rigorous verification of Bayesian networks

1 code implementation2 Aug 2024 Ignacy Stępka, Nicholas Gisolfi, Artur Dubrawski

Recent advancements in machine learning have accelerated its widespread adoption across various real-world applications.

Enhanced Uncertainty Estimation in Ultrasound Image Segmentation with MSU-Net

no code implementations31 Jul 2024 Rohini Banerjee, Cecilia G. Morales, Artur Dubrawski

Autonomous robotic ultrasound systems can aid in needle insertion for medication delivery and support non-experts in such tasks.

Image Segmentation Segmentation +1

A Rate-Distortion View of Uncertainty Quantification

1 code implementation16 Jun 2024 Ifigeneia Apostolopoulou, Benjamin Eysenbach, Frank Nielsen, Artur Dubrawski

In this paper, we introduce Distance Aware Bottleneck (DAB), i. e., a new method for enriching deep neural networks with this property.

Gaussian Processes Out of Distribution (OOD) Detection +1

Exploring Loss Design Techniques For Decision Tree Robustness To Label Noise

no code implementations27 May 2024 Lukasz Sztukiewicz, Jack Henry Good, Artur Dubrawski

In the real world, data is often noisy, affecting not only the quality of features but also the accuracy of labels.

Deep Learning

MOMENT: A Family of Open Time-series Foundation Models

2 code implementations6 Feb 2024 Mononito Goswami, Konrad Szafer, Arjun Choudhry, Yifu Cai, Shuo Li, Artur Dubrawski

We introduce MOMENT, a family of open-source foundation models for general-purpose time series analysis.

Time Series Time Series Analysis

Signal Quality Auditing for Time-series Data

no code implementations1 Feb 2024 Chufan Gao, Nicholas Gisolfi, Artur Dubrawski

Signal quality assessment (SQA) is required for monitoring the reliability of data acquisition systems, especially in AI-driven Predictive Maintenance (PMx) application contexts.

Denoising Time Series

Motion Informed Needle Segmentation in Ultrasound Images

no code implementations2 Dec 2023 Raghavv Goel, Cecilia Morales, Manpreet Singh, Artur Dubrawski, John Galeotti, Howie Choset

Third, to our knowledge we are the first to implement a learnable filter to incorporate non-linear needle motion for improving needle segmentation.

Decoder Segmentation

AQuA: A Benchmarking Tool for Label Quality Assessment

1 code implementation NeurIPS 2023 Mononito Goswami, Vedant Sanil, Arjun Choudhry, Arvind Srinivasan, Chalisa Udompanyawit, Artur Dubrawski

We hope that our proposed design space and benchmark enable practitioners to choose the right tools to improve their label quality and that our benchmark enables objective and rigorous evaluation of machine learning tools facing mislabeled data.

Benchmarking Label Error Detection +1

Hierarchically Coherent Multivariate Mixture Networks

1 code implementation11 May 2023 Kin G. Olivares, David Luo, Cristian Challu, Stefania La Vattiata, Max Mergenthaler, Artur Dubrawski

Large collections of time series data are often organized into hierarchies with different levels of aggregation; examples include product and geographical groupings.

Computational Efficiency Time Series

Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes

no code implementations24 Feb 2023 Chirag Nagpal, Vedant Sanil, Artur Dubrawski

In this paper we propose a statistical approach to recovering sparse phenogroups (or subtypes) that demonstrate differential treatment effects as compared to the study population.

Reslicing Ultrasound Images for Data Augmentation and Vessel Reconstruction

no code implementations18 Jan 2023 Cecilia Morales, Jason Yao, Tejas Rane, Robert Edman, Howie Choset, Artur Dubrawski

Robot-guided catheter insertion has the potential to deliver urgent medical care in situations where medical personnel are unavailable.

Data Augmentation Segmentation +1

HierarchicalForecast: A Reference Framework for Hierarchical Forecasting in Python

1 code implementation7 Jul 2022 Kin G. Olivares, Azul Garza, David Luo, Cristian Challú, Max Mergenthaler, Souhaib Ben Taieb, Shanika L. Wickramasuriya, Artur Dubrawski

Large collections of time series data are commonly organized into structures with different levels of aggregation; examples include product and geographical groupings.

BIG-bench Machine Learning Decision Making +2

Classifying Unstructured Clinical Notes via Automatic Weak Supervision

1 code implementation24 Jun 2022 Chufan Gao, Mononito Goswami, Jieshi Chen, Artur Dubrawski

Healthcare providers usually record detailed notes of the clinical care delivered to each patient for clinical, research, and billing purposes.

Text Classification

The Digital Twin Landscape at the Crossroads of Predictive Maintenance, Machine Learning and Physics Based Modeling

no code implementations21 Jun 2022 Brian Kunzer, Mario Berges, Artur Dubrawski

The application of a digital twin framework is highlighted in the field of predictive maintenance, and its extensions utilizing machine learning and physics based modeling.

Management

Weakly Supervised Classification of Vital Sign Alerts as Real or Artifact

no code implementations18 Jun 2022 Arnab Dey, Mononito Goswami, Joo Heung Yoon, Gilles Clermont, Michael Pinsky, Marilyn Hravnak, Artur Dubrawski

Our weakly supervised models perform competitively with traditional supervised techniques and require less involvement from domain experts, demonstrating their use as efficient and practical alternatives to supervised learning in HC applications of ML.

Weakly Supervised Classification

Doubting AI Predictions: Influence-Driven Second Opinion Recommendation

no code implementations29 Apr 2022 Maria De-Arteaga, Alexandra Chouldechova, Artur Dubrawski

Effective human-AI collaboration requires a system design that provides humans with meaningful ways to make sense of and critically evaluate algorithmic recommendations.

auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data

3 code implementations15 Apr 2022 Chirag Nagpal, Willa Potosnak, Artur Dubrawski

Applications of machine learning in healthcare often require working with time-to-event prediction tasks including prognostication of an adverse event, re-hospitalization or death.

BIG-bench Machine Learning counterfactual +1

Constrained Clustering and Multiple Kernel Learning without Pairwise Constraint Relaxation

1 code implementation23 Mar 2022 Benedikt Boecking, Vincent Jeanselme, Artur Dubrawski

However, the common practice of relaxing discrete constraints to a continuous domain to ease optimization when learning kernels or metrics can harm generalization, as information which only encodes linkage is transformed to informing distances.

Constrained Clustering

Generative Modeling Helps Weak Supervision (and Vice Versa)

1 code implementation22 Mar 2022 Benedikt Boecking, Nicholas Roberts, Willie Neiswanger, Stefano Ermon, Frederic Sala, Artur Dubrawski

The model outperforms baseline weak supervision label models on a number of multiclass image classification datasets, improves the quality of generated images, and further improves end-model performance through data augmentation with synthetic samples.

Data Augmentation Image Classification

Counterfactual Phenotyping with Censored Time-to-Events

3 code implementations22 Feb 2022 Chirag Nagpal, Mononito Goswami, Keith Dufendach, Artur Dubrawski

Estimation of treatment efficacy of real-world clinical interventions involves working with continuous outcomes such as time-to-death, re-hospitalization, or a composite event that may be subject to censoring.

counterfactual Counterfactual Reasoning

Weak Supervision for Affordable Modeling of Electrocardiogram Data

no code implementations9 Jan 2022 Mononito Goswami, Benedikt Boecking, Artur Dubrawski

We explore the use of multiple weak supervision sources to learn diagnostic models of abnormal heartbeats via human designed heuristics, without using ground truth labels on individual data points.

Time Series Time Series Analysis

Discovery of Crime Event Sequences with Constricted Spatio-Temporal Sequential Patterns

no code implementations3 Dec 2021 Piotr S. Maciąg, Robert Bembenik, Artur Dubrawski

We demonstrate that the set of CSTS patterns is a concise representation of all spatio-temporal sequential patterns that can be discovered in a given dataset.

STS

Provably Robust Model-Centric Explanations for Critical Decision-Making

no code implementations26 Oct 2021 Cecilia G. Morales, Nicholas Gisolfi, Robert Edman, James K. Miller, Artur Dubrawski

We recommend using a model-centric, Boolean Satisfiability (SAT) formalism to obtain useful explanations of trained model behavior, different and complementary to what can be gleaned from LIME and SHAP, popular data-centric explanation tools in Artificial Intelligence (AI).

Decision Making

Kernel Density Decision Trees

no code implementations29 Sep 2021 Jack Henry Good, Kyle Miller, Artur Dubrawski

FDTs address the sensitivity and tendency to overfitting of decision trees by representing uncertainty through fuzzy partitions.

Density Estimation

ACTIVE REFINEMENT OF WEAKLY SUPERVISED MODELS

no code implementations29 Sep 2021 Mononito Goswami, Chufan Gao, Benedikt Boecking, Saswati Ray, Artur Dubrawski

In domains such as clinical research, where data collection and its careful characterization is particularly expensive and tedious, this reliance on pointillisticaly labeled data is one of the biggest roadblocks to the adoption of modern data-hungry ML algorithms.

Active Learning

Deep Attentive Variational Inference

no code implementations ICLR 2022 Ifigeneia Apostolopoulou, Ian Char, Elan Rosenfeld, Artur Dubrawski

Moreover, the architecture for this class of models favors local interactions among the latent variables between neighboring layers when designing the conditioning factors of the involved distributions.

Variational Inference

End-to-End Weak Supervision

1 code implementation NeurIPS 2021 Salva Rühling Cachay, Benedikt Boecking, Artur Dubrawski

Aggregating multiple sources of weak supervision (WS) can ease the data-labeling bottleneck prevalent in many machine learning applications, by replacing the tedious manual collection of ground truth labels.

Classification

Dependency Structure Misspecification in Multi-Source Weak Supervision Models

no code implementations18 Jun 2021 Salva Rühling Cachay, Benedikt Boecking, Artur Dubrawski

Data programming (DP) has proven to be an attractive alternative to costly hand-labeling of data.

DMIDAS: Deep Mixed Data Sampling Regression for Long Multi-Horizon Time Series Forecasting

no code implementations7 Jun 2021 Cristian Challu, Kin G. Olivares, Gus Welter, Artur Dubrawski

We validate our proposed method, DMIDAS, on high-frequency healthcare and electricity price data with long forecasting horizons (~1000 timestamps) where we improve the prediction accuracy by 5% over state-of-the-art models, reducing the number of parameters of NBEATS by nearly 70%.

regression Time Series +1

Leveraging Expert Consistency to Improve Algorithmic Decision Support

1 code implementation24 Jan 2021 Maria De-Arteaga, Vincent Jeanselme, Artur Dubrawski, Alexandra Chouldechova

However, there is frequently a construct gap: a gap between the construct of interest to the decision-making task and what is captured in proxies used as labels to train ML models.

BIG-bench Machine Learning

Robust Multi-view Representation Learning

no code implementations1 Jan 2021 Sibi Venkatesan, Kyle Miller, Artur Dubrawski

Our synthetic and real-world experiments show promising results for the application of these models to robust representation learning.

Representation Learning Self-Driving Cars

Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling

1 code implementation ICLR 2021 Benedikt Boecking, Willie Neiswanger, Eric Xing, Artur Dubrawski

Our experiments demonstrate that only a small number of feedback iterations are needed to train models that achieve highly competitive test set performance without access to ground truth training labels.

Weakly Supervised Classification

Self-Reflective Variational Autoencoder

no code implementations10 Jul 2020 Ifigeneia Apostolopoulou, Elan Rosenfeld, Artur Dubrawski

The Variational Autoencoder (VAE) is a powerful framework for learning probabilistic latent variable generative models.

Variational Inference

Preference-based Reinforcement Learning with Finite-Time Guarantees

no code implementations NeurIPS 2020 Yichong Xu, Ruosong Wang, Lin F. Yang, Aarti Singh, Artur Dubrawski

If preferences are stochastic, and the preference probability relates to the hidden reward values, we present algorithms for PbRL, both with and without a simulator, that are able to identify the best policy up to accuracy $\varepsilon$ with high probability.

reinforcement-learning Reinforcement Learning +1

System-Level Predictive Maintenance: Review of Research Literature and Gap Analysis

no code implementations11 May 2020 Kyle Miller, Artur Dubrawski

This paper reviews current literature in the field of predictive maintenance from the system point of view.

Pairwise Feedback for Data Programming

no code implementations16 Dec 2019 Benedikt Boecking, Artur Dubrawski

We propose to improve modeling of latent class variables in the programmatic creation of labeled datasets by incorporating pairwise feedback into the process.

Mutually Regressive Point Processes

1 code implementation NeurIPS 2019 Ifigeneia Apostolopoulou, Scott Linderman, Kyle Miller, Artur Dubrawski

Despite many potential applications, existing point process models are limited in their ability to capture complex patterns of interaction.

Bayesian Inference Point Processes

Detecting Patterns of Physiological Response to Hemodynamic Stress via Unsupervised Deep Learning

no code implementations12 Nov 2019 Chufan Gao, Fabian Falck, Mononito Goswami, Anthony Wertz, Michael R. Pinsky, Artur Dubrawski

By analyzing the clusters of latent embeddings and visualizing them over time, we hypothesize that the clusters correspond to the physiological response patterns that match physicians' intuition.

BIG-bench Machine Learning Survival Prediction +2

Zeroth Order Non-convex optimization with Dueling-Choice Bandits

no code implementations3 Nov 2019 Yichong Xu, Aparna Joshi, Aarti Singh, Artur Dubrawski

We consider a novel setting of zeroth order non-convex optimization, where in addition to querying the function value at a given point, we can also duel two points and get the point with the larger function value.

Active Learning for Graph Neural Networks via Node Feature Propagation

no code implementations16 Oct 2019 Yuexin Wu, Yichong Xu, Aarti Singh, Yiming Yang, Artur Dubrawski

Graph Neural Networks (GNNs) for prediction tasks like node classification or edge prediction have received increasing attention in recent machine learning from graphically structured data.

Active Learning Clustering +3

Thresholding Bandit Problem with Both Duels and Pulls

no code implementations14 Oct 2019 Yichong Xu, Xi Chen, Aarti Singh, Artur Dubrawski

The Thresholding Bandit Problem (TBP) aims to find the set of arms with mean rewards greater than a given threshold.

Active Learning Graph Neural Networks via Node Feature Propagation

no code implementations25 Sep 2019 Yuexin Wu, Yichong Xu, Aarti Singh, Artur Dubrawski, Yiming Yang

Graph Neural Networks (GNNs) for prediction tasks like node classification or edge prediction have received increasing attention in recent machine learning from graphically structured data.

Active Learning Node Classification +1

DASGrad: Double Adaptive Stochastic Gradient

no code implementations25 Sep 2019 Kin Gutierrez, Cristian Challu, Jin Li, Artur Dubrawski

Adaptive moment methods have been remarkably successful for optimization under the presence of high dimensional or sparse gradients, in parallel to this, adaptive sampling probabilities for SGD have allowed optimizers to improve convergence rates by prioritizing examples to learn efficiently.

Transfer Learning

Nonlinear Semi-Parametric Models for Survival Analysis

1 code implementation14 May 2019 Chirag Nagpal, Rohan Sangave, Amit Chahar, Parth Shah, Artur Dubrawski, Bhiksha Raj

Semi-parametric survival analysis methods like the Cox Proportional Hazards (CPH) regression (Cox, 1972) are a popular approach for survival analysis.

regression Survival Analysis

Double Adaptive Stochastic Gradient Optimization

no code implementations6 Nov 2018 Kin Gutierrez, Jin Li, Cristian Challu, Artur Dubrawski

We observe that the benefits of~\textsc{DASGrad} increase with the model complexity and variability of the gradients, and we explore the resulting utility in extensions of distribution-matching multitask learning.

Deep Learning

On the Interaction Effects Between Prediction and Clustering

1 code implementation18 Jul 2018 Matt Barnes, Artur Dubrawski

Machine learning systems increasingly depend on pipelines of multiple algorithms to provide high quality and well structured predictions.

Clustering

Learning under selective labels in the presence of expert consistency

no code implementations2 Jul 2018 Maria De-Arteaga, Artur Dubrawski, Alexandra Chouldechova

We explore the problem of learning under selective labels in the context of algorithm-assisted decision making.

Data Augmentation Decision Making +1

Nonparametric Regression with Comparisons: Escaping the Curse of Dimensionality with Ordinal Information

no code implementations ICML 2018 Yichong Xu, Hariank Muthakana, Sivaraman Balakrishnan, Aarti Singh, Artur Dubrawski

Finally, we present experiments that show the efficacy of RR and investigate its robustness to various sources of noise and model-misspecification.

regression

Regression with Comparisons: Escaping the Curse of Dimensionality with Ordinal Information

no code implementations ICML 2018 Yichong Xu, Sivaraman Balakrishnan, Aarti Singh, Artur Dubrawski

In supervised learning, we typically leverage a fully labeled dataset to design methods for function estimation or prediction.

regression

Noise-Tolerant Interactive Learning Using Pairwise Comparisons

no code implementations NeurIPS 2017 Yichong Xu, Hongyang Zhang, Kyle Miller, Aarti Singh, Artur Dubrawski

We study the problem of interactively learning a binary classifier using noisy labeling and pairwise comparison oracles, where the comparison oracle answers which one in the given two instances is more likely to be positive.

Characterization of Hemodynamic Signal by Learning Multi-View Relationships

no code implementations17 Sep 2017 Eric Lei, Kyle Miller, Michael R. Pinsky, Artur Dubrawski

We aim to investigate the usefulness of nonlinear multi-view relations to characterize multi-view data in an explainable manner.

Clustering

Scaling Active Search using Linear Similarity Functions

1 code implementation30 Apr 2017 Sibi Venkatesan, James K. Miller, Jeff Schneider, Artur Dubrawski

In this paper, we consider the problem of Active Search where we are given a similarity function between data points.

Information Retrieval Retrieval

Noise-Tolerant Interactive Learning from Pairwise Comparisons

no code implementations19 Apr 2017 Yichong Xu, Hongyang Zhang, Aarti Singh, Kyle Miller, Artur Dubrawski

We study the problem of interactively learning a binary classifier using noisy labeling and pairwise comparison oracles, where the comparison oracle answers which one in the given two instances is more likely to be positive.

Clustering on the Edge: Learning Structure in Graphs

no code implementations5 May 2016 Matt Barnes, Artur Dubrawski

With the recent popularity of graphical clustering methods, there has been an increased focus on the information between samples.

Clustering Entity Resolution +2

Batched Lazy Decision Trees

no code implementations8 Mar 2016 Mathieu Guillame-Bert, Artur Dubrawski

We introduce a batched lazy algorithm for supervised classification using decision trees.

General Classification

Canonical Autocorrelation Analysis

no code implementations19 Nov 2015 Maria De-Arteaga, Artur Dubrawski, Peter Huggins

We present an extension of sparse Canonical Correlation Analysis (CCA) designed for finding multiple-to-multiple linear correlations within a single set of variables.

Anomaly Detection

Lass-0: sparse non-convex regression by local search

no code implementations13 Nov 2015 William Herlands, Maria De-Arteaga, Daniel Neill, Artur Dubrawski

We compute approximate solutions to L0 regularized linear regression using L1 regularization, also known as the Lasso, as an initialization step.

regression

Performance Bounds for Pairwise Entity Resolution

no code implementations10 Sep 2015 Matt Barnes, Kyle Miller, Artur Dubrawski

One significant challenge to scaling entity resolution algorithms to massive datasets is understanding how performance changes after moving beyond the realm of small, manually labeled reference datasets.

BIG-bench Machine Learning Entity Resolution

Real-Time Visual Analysis of Microvascular Blood Flow for Critical Care

no code implementations CVPR 2015 Chao Liu, Hernando Gomez, Srinivasa Narasimhan, Artur Dubrawski, Michael R. Pinsky, Brian Zuckerbraun

Our method is able to extract microcirculatory measurements that are consistent with clinical intuition and it has a potential to become a useful tool in critical care medicine.

Video Stabilization

Learning the Semantic Correlation: An Alternative Way to Gain from Unlabeled Text

no code implementations NeurIPS 2008 Yi Zhang, Artur Dubrawski, Jeff G. Schneider

In an empirical study, we construct 190 different text classification tasks from a real-world benchmark, and the unlabeled documents are a mixture from all these tasks.

General Classification text-classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.