no code implementations • 29 Oct 2024 • Chang Liu, Jieshi Chen, Lee H. Harrison, Artur Dubrawski
When selecting data to build machine learning models in practical applications, factors such as availability, acquisition cost, and discriminatory power are crucial considerations.
no code implementations • 18 Oct 2024 • Yifu Cai, Arjun Choudhry, Mononito Goswami, Artur Dubrawski
We believe that the ability to programatically generate questions is fundamental to assessing and improving LLM's ability to understand and reason about time series data.
no code implementations • 19 Sep 2024 • Michał Wiliński, Mononito Goswami, Nina Żukowska, Willa Potosnak, Artur Dubrawski
These findings underscore the value of representational analysis for optimizing models and demonstrate how conceptual steering offers new possibilities for more controlled and efficient time series analysis with TSFMs.
no code implementations • 17 Sep 2024 • Willa Potosnak, Cristian Challu, Mononito Goswami, Michał Wiliński, Nina Żukowska, Artur Dubrawski
Recently, time series foundation models have shown promising zero-shot forecasting performance on time series from a wide range of domains.
no code implementations • 10 Sep 2024 • Cecilia G. Morales, Dhruv Srikanth, Jack H. Good, Keith A. Dufendach, Artur Dubrawski
This is primarily due to the limited availability of ground truth data, in particular, data from live subjects, needed for training and validating reliable models.
1 code implementation • 2 Aug 2024 • Ignacy Stępka, Nicholas Gisolfi, Artur Dubrawski
Recent advancements in machine learning have accelerated its widespread adoption across various real-world applications.
no code implementations • 31 Jul 2024 • Rohini Banerjee, Cecilia G. Morales, Artur Dubrawski
Autonomous robotic ultrasound systems can aid in needle insertion for medication delivery and support non-experts in such tasks.
1 code implementation • 16 Jun 2024 • Ifigeneia Apostolopoulou, Benjamin Eysenbach, Frank Nielsen, Artur Dubrawski
In this paper, we introduce Distance Aware Bottleneck (DAB), i. e., a new method for enriching deep neural networks with this property.
no code implementations • 27 May 2024 • Lukasz Sztukiewicz, Jack Henry Good, Artur Dubrawski
In the real world, data is often noisy, affecting not only the quality of features but also the accuracy of labels.
2 code implementations • 6 Feb 2024 • Mononito Goswami, Konrad Szafer, Arjun Choudhry, Yifu Cai, Shuo Li, Artur Dubrawski
We introduce MOMENT, a family of open-source foundation models for general-purpose time series analysis.
no code implementations • 1 Feb 2024 • Chufan Gao, Nicholas Gisolfi, Artur Dubrawski
Signal quality assessment (SQA) is required for monitoring the reliability of data acquisition systems, especially in AI-driven Predictive Maintenance (PMx) application contexts.
no code implementations • 2 Dec 2023 • Raghavv Goel, Cecilia Morales, Manpreet Singh, Artur Dubrawski, John Galeotti, Howie Choset
Third, to our knowledge we are the first to implement a learnable filter to incorporate non-linear needle motion for improving needle segmentation.
1 code implementation • 22 Sep 2023 • Willa Potosnak, Cristian Challu, Kin Gutierrez Olivares, Keith Dufendach, Artur Dubrawski
Our hybrid global-local architecture improves over patient-specific models by 15. 8% on average.
1 code implementation • NeurIPS 2023 • Mononito Goswami, Vedant Sanil, Arjun Choudhry, Arvind Srinivasan, Chalisa Udompanyawit, Artur Dubrawski
We hope that our proposed design space and benchmark enable practitioners to choose the right tools to improve their label quality and that our benchmark enables objective and rigorous evaluation of machine learning tools facing mislabeled data.
1 code implementation • 11 May 2023 • Kin G. Olivares, David Luo, Cristian Challu, Stefania La Vattiata, Max Mergenthaler, Artur Dubrawski
Large collections of time series data are often organized into hierarchies with different levels of aggregation; examples include product and geographical groupings.
no code implementations • 24 Feb 2023 • Chirag Nagpal, Vedant Sanil, Artur Dubrawski
In this paper we propose a statistical approach to recovering sparse phenogroups (or subtypes) that demonstrate differential treatment effects as compared to the study population.
no code implementations • 18 Jan 2023 • Cecilia Morales, Jason Yao, Tejas Rane, Robert Edman, Howie Choset, Artur Dubrawski
Robot-guided catheter insertion has the potential to deliver urgent medical care in situations where medical personnel are unavailable.
1 code implementation • 7 Jul 2022 • Kin G. Olivares, Azul Garza, David Luo, Cristian Challú, Max Mergenthaler, Souhaib Ben Taieb, Shanika L. Wickramasuriya, Artur Dubrawski
Large collections of time series data are commonly organized into structures with different levels of aggregation; examples include product and geographical groupings.
1 code implementation • 24 Jun 2022 • Chufan Gao, Mononito Goswami, Jieshi Chen, Artur Dubrawski
Healthcare providers usually record detailed notes of the clinical care delivered to each patient for clinical, research, and billing purposes.
no code implementations • 21 Jun 2022 • Brian Kunzer, Mario Berges, Artur Dubrawski
The application of a digital twin framework is highlighted in the field of predictive maintenance, and its extensions utilizing machine learning and physics based modeling.
no code implementations • 18 Jun 2022 • Arnab Dey, Mononito Goswami, Joo Heung Yoon, Gilles Clermont, Michael Pinsky, Marilyn Hravnak, Artur Dubrawski
Our weakly supervised models perform competitively with traditional supervised techniques and require less involvement from domain experts, demonstrating their use as efficient and practical alternatives to supervised learning in HC applications of ML.
no code implementations • 29 Apr 2022 • Maria De-Arteaga, Alexandra Chouldechova, Artur Dubrawski
Effective human-AI collaboration requires a system design that provides humans with meaningful ways to make sense of and critically evaluate algorithmic recommendations.
3 code implementations • 15 Apr 2022 • Chirag Nagpal, Willa Potosnak, Artur Dubrawski
Applications of machine learning in healthcare often require working with time-to-event prediction tasks including prognostication of an adverse event, re-hospitalization or death.
1 code implementation • 23 Mar 2022 • Benedikt Boecking, Vincent Jeanselme, Artur Dubrawski
However, the common practice of relaxing discrete constraints to a continuous domain to ease optimization when learning kernels or metrics can harm generalization, as information which only encodes linkage is transformed to informing distances.
1 code implementation • 22 Mar 2022 • Benedikt Boecking, Nicholas Roberts, Willie Neiswanger, Stefano Ermon, Frederic Sala, Artur Dubrawski
The model outperforms baseline weak supervision label models on a number of multiclass image classification datasets, improves the quality of generated images, and further improves end-model performance through data augmentation with synthetic samples.
3 code implementations • 22 Feb 2022 • Chirag Nagpal, Mononito Goswami, Keith Dufendach, Artur Dubrawski
Estimation of treatment efficacy of real-world clinical interventions involves working with continuous outcomes such as time-to-death, re-hospitalization, or a composite event that may be subject to censoring.
4 code implementations • 30 Jan 2022 • Cristian Challu, Kin G. Olivares, Boris N. Oreshkin, Federico Garza, Max Mergenthaler-Canseco, Artur Dubrawski
Recent progress in neural forecasting accelerated improvements in the performance of large-scale forecasting systems.
no code implementations • 9 Jan 2022 • Mononito Goswami, Benedikt Boecking, Artur Dubrawski
We explore the use of multiple weak supervision sources to learn diagnostic models of abnormal heartbeats via human designed heuristics, without using ground truth labels on individual data points.
no code implementations • 3 Dec 2021 • Piotr S. Maciąg, Robert Bembenik, Artur Dubrawski
We demonstrate that the set of CSTS patterns is a concise representation of all spatio-temporal sequential patterns that can be discovered in a given dataset.
no code implementations • 26 Oct 2021 • Cecilia G. Morales, Nicholas Gisolfi, Robert Edman, James K. Miller, Artur Dubrawski
We recommend using a model-centric, Boolean Satisfiability (SAT) formalism to obtain useful explanations of trained model behavior, different and complementary to what can be gleaned from LIME and SHAP, popular data-centric explanation tools in Artificial Intelligence (AI).
no code implementations • 29 Sep 2021 • Jack Henry Good, Kyle Miller, Artur Dubrawski
FDTs address the sensitivity and tendency to overfitting of decision trees by representing uncertainty through fuzzy partitions.
no code implementations • 29 Sep 2021 • Mononito Goswami, Chufan Gao, Benedikt Boecking, Saswati Ray, Artur Dubrawski
In domains such as clinical research, where data collection and its careful characterization is particularly expensive and tedious, this reliance on pointillisticaly labeled data is one of the biggest roadblocks to the adoption of modern data-hungry ML algorithms.
no code implementations • ICLR 2022 • Ifigeneia Apostolopoulou, Ian Char, Elan Rosenfeld, Artur Dubrawski
Moreover, the architecture for this class of models favors local interactions among the latent variables between neighboring layers when designing the conditioning factors of the involved distributions.
1 code implementation • NeurIPS 2021 • Salva Rühling Cachay, Benedikt Boecking, Artur Dubrawski
Aggregating multiple sources of weak supervision (WS) can ease the data-labeling bottleneck prevalent in many machine learning applications, by replacing the tedious manual collection of ground truth labels.
Ranked #1 on Classification on BiasBios
no code implementations • 18 Jun 2021 • Salva Rühling Cachay, Benedikt Boecking, Artur Dubrawski
Data programming (DP) has proven to be an attractive alternative to costly hand-labeling of data.
no code implementations • 7 Jun 2021 • Cristian Challu, Kin G. Olivares, Gus Welter, Artur Dubrawski
We validate our proposed method, DMIDAS, on high-frequency healthcare and electricity price data with long forecasting horizons (~1000 timestamps) where we improve the prediction accuracy by 5% over state-of-the-art models, reducing the number of parameters of NBEATS by nearly 70%.
2 code implementations • 12 Apr 2021 • Kin G. Olivares, Cristian Challu, Grzegorz Marcjasz, Rafał Weron, Artur Dubrawski
We extend the neural basis expansion analysis (NBEATS) to incorporate exogenous factors.
1 code implementation • 24 Jan 2021 • Maria De-Arteaga, Vincent Jeanselme, Artur Dubrawski, Alexandra Chouldechova
However, there is frequently a construct gap: a gap between the construct of interest to the decision-making task and what is captured in proxies used as labels to train ML models.
no code implementations • 1 Jan 2021 • Sibi Venkatesan, Kyle Miller, Artur Dubrawski
Our synthetic and real-world experiments show promising results for the application of these models to robust representation learning.
1 code implementation • ICLR 2021 • Benedikt Boecking, Willie Neiswanger, Eric Xing, Artur Dubrawski
Our experiments demonstrate that only a small number of feedback iterations are needed to train models that achieve highly competitive test set performance without access to ground truth training labels.
no code implementations • 10 Jul 2020 • Ifigeneia Apostolopoulou, Elan Rosenfeld, Artur Dubrawski
The Variational Autoencoder (VAE) is a powerful framework for learning probabilistic latent variable generative models.
no code implementations • NeurIPS 2020 • Yichong Xu, Ruosong Wang, Lin F. Yang, Aarti Singh, Artur Dubrawski
If preferences are stochastic, and the preference probability relates to the hidden reward values, we present algorithms for PbRL, both with and without a simulator, that are able to identify the best policy up to accuracy $\varepsilon$ with high probability.
no code implementations • 11 May 2020 • Kyle Miller, Artur Dubrawski
This paper reviews current literature in the field of predictive maintenance from the system point of view.
4 code implementations • 2 Mar 2020 • Chirag Nagpal, Xinyu Rachel Li, Artur Dubrawski
We describe a new approach to estimating relative risks in time-to-event prediction problems with censored data in a fully parametric manner.
no code implementations • 16 Dec 2019 • Benedikt Boecking, Artur Dubrawski
We propose to improve modeling of latent class variables in the programmatic creation of labeled datasets by incorporating pairwise feedback into the process.
1 code implementation • NeurIPS 2019 • Ifigeneia Apostolopoulou, Scott Linderman, Kyle Miller, Artur Dubrawski
Despite many potential applications, existing point process models are limited in their ability to capture complex patterns of interaction.
no code implementations • 12 Nov 2019 • Chufan Gao, Fabian Falck, Mononito Goswami, Anthony Wertz, Michael R. Pinsky, Artur Dubrawski
By analyzing the clusters of latent embeddings and visualizing them over time, we hypothesize that the clusters correspond to the physiological response patterns that match physicians' intuition.
no code implementations • 3 Nov 2019 • Yichong Xu, Aparna Joshi, Aarti Singh, Artur Dubrawski
We consider a novel setting of zeroth order non-convex optimization, where in addition to querying the function value at a given point, we can also duel two points and get the point with the larger function value.
no code implementations • 16 Oct 2019 • Yuexin Wu, Yichong Xu, Aarti Singh, Yiming Yang, Artur Dubrawski
Graph Neural Networks (GNNs) for prediction tasks like node classification or edge prediction have received increasing attention in recent machine learning from graphically structured data.
no code implementations • 14 Oct 2019 • Yichong Xu, Xi Chen, Aarti Singh, Artur Dubrawski
The Thresholding Bandit Problem (TBP) aims to find the set of arms with mean rewards greater than a given threshold.
no code implementations • 25 Sep 2019 • Yuexin Wu, Yichong Xu, Aarti Singh, Artur Dubrawski, Yiming Yang
Graph Neural Networks (GNNs) for prediction tasks like node classification or edge prediction have received increasing attention in recent machine learning from graphically structured data.
no code implementations • 25 Sep 2019 • Kin Gutierrez, Cristian Challu, Jin Li, Artur Dubrawski
Adaptive moment methods have been remarkably successful for optimization under the presence of high dimensional or sparse gradients, in parallel to this, adaptive sampling probabilities for SGD have allowed optimizers to improve convergence rates by prioritizing examples to learn efficiently.
1 code implementation • 14 May 2019 • Chirag Nagpal, Rohan Sangave, Amit Chahar, Parth Shah, Artur Dubrawski, Bhiksha Raj
Semi-parametric survival analysis methods like the Cox Proportional Hazards (CPH) regression (Cox, 1972) are a popular approach for survival analysis.
no code implementations • 6 Nov 2018 • Kin Gutierrez, Jin Li, Cristian Challu, Artur Dubrawski
We observe that the benefits of~\textsc{DASGrad} increase with the model complexity and variability of the gradients, and we explore the resulting utility in extensions of distribution-matching multitask learning.
1 code implementation • 18 Jul 2018 • Matt Barnes, Artur Dubrawski
Machine learning systems increasingly depend on pipelines of multiple algorithms to provide high quality and well structured predictions.
no code implementations • 2 Jul 2018 • Maria De-Arteaga, Artur Dubrawski, Alexandra Chouldechova
We explore the problem of learning under selective labels in the context of algorithm-assisted decision making.
no code implementations • ICML 2018 • Yichong Xu, Hariank Muthakana, Sivaraman Balakrishnan, Aarti Singh, Artur Dubrawski
Finally, we present experiments that show the efficacy of RR and investigate its robustness to various sources of noise and model-misspecification.
no code implementations • ICML 2018 • Yichong Xu, Sivaraman Balakrishnan, Aarti Singh, Artur Dubrawski
In supervised learning, we typically leverage a fully labeled dataset to design methods for function estimation or prediction.
1 code implementation • 28 Apr 2018 • Igor Gitman, Jieshi Chen, Eric Lei, Artur Dubrawski
In this paper we propose two novel approaches on how to solve this problem.
no code implementations • NeurIPS 2017 • Yichong Xu, Hongyang Zhang, Kyle Miller, Aarti Singh, Artur Dubrawski
We study the problem of interactively learning a binary classifier using noisy labeling and pairwise comparison oracles, where the comparison oracle answers which one in the given two instances is more likely to be positive.
no code implementations • 17 Sep 2017 • Eric Lei, Kyle Miller, Michael R. Pinsky, Artur Dubrawski
We aim to investigate the usefulness of nonlinear multi-view relations to characterize multi-view data in an explainable manner.
1 code implementation • 30 Apr 2017 • Sibi Venkatesan, James K. Miller, Jeff Schneider, Artur Dubrawski
In this paper, we consider the problem of Active Search where we are given a similarity function between data points.
no code implementations • 19 Apr 2017 • Yichong Xu, Hongyang Zhang, Aarti Singh, Kyle Miller, Artur Dubrawski
We study the problem of interactively learning a binary classifier using noisy labeling and pairwise comparison oracles, where the comparison oracle answers which one in the given two instances is more likely to be positive.
no code implementations • 5 May 2016 • Matt Barnes, Artur Dubrawski
With the recent popularity of graphical clustering methods, there has been an increased focus on the information between samples.
no code implementations • 8 Mar 2016 • Mathieu Guillame-Bert, Artur Dubrawski
We introduce a batched lazy algorithm for supervised classification using decision trees.
no code implementations • 19 Nov 2015 • Maria De-Arteaga, Artur Dubrawski, Peter Huggins
We present an extension of sparse Canonical Correlation Analysis (CCA) designed for finding multiple-to-multiple linear correlations within a single set of variables.
no code implementations • 13 Nov 2015 • William Herlands, Maria De-Arteaga, Daniel Neill, Artur Dubrawski
We compute approximate solutions to L0 regularized linear regression using L1 regularization, also known as the Lasso, as an initialization step.
no code implementations • 10 Sep 2015 • Matt Barnes, Kyle Miller, Artur Dubrawski
One significant challenge to scaling entity resolution algorithms to massive datasets is understanding how performance changes after moving beyond the realm of small, manually labeled reference datasets.
no code implementations • CVPR 2015 • Chao Liu, Hernando Gomez, Srinivasa Narasimhan, Artur Dubrawski, Michael R. Pinsky, Brian Zuckerbraun
Our method is able to extract microcirculatory measurements that are consistent with clinical intuition and it has a potential to become a useful tool in critical care medicine.
no code implementations • NeurIPS 2012 • Madalina Fiterau, Artur Dubrawski
In many applications classification systems often require in the loop human intervention.
no code implementations • NeurIPS 2008 • Yi Zhang, Artur Dubrawski, Jeff G. Schneider
In an empirical study, we construct 190 different text classification tasks from a real-world benchmark, and the unlabeled documents are a mixture from all these tasks.