no code implementations • 28 Oct 2024 • Dimitris Bertsimas, Lisa Everest, Jiayi Gu, Matthew Peroni, Vasiliki Stoumpou
Overall, our approach of Generalized Soft Trees provides a tractable method that is high-performing on (un)structured datasets and preserves interpretability more than traditional deep learning methods.
no code implementations • 24 Oct 2024 • Dimitris Bertsimas, Vasiliki Stoumpou
In binary classification problems, the proposed extensions and the corresponding results suggest the equivalence of bagging and boosting methods in performance, and the edge of bagging in interpretability by leveraging a few learners of the ensemble, which is not an option in the less explainable boosting methods.
1 code implementation • 18 Jul 2024 • Dimitris Bertsimas, Nicholas A. G. Johnson
The runtime of our algorithm is competitive with and often superior to that of the benchmark methods.
no code implementations • 30 May 2024 • Dimitris Bertsimas, Matthew Peroni
Should a black-box or interpretable model be used?
no code implementations • 11 May 2024 • Dimitris Bertsimas, Cynthia Zeng
The escalating frequency and severity of natural disasters, exacerbated by climate change, underscore the critical role of insurance in facilitating recovery and promoting investments in risk reduction.
no code implementations • 29 Apr 2024 • Dimitris Bertsimas, Yu Ma
Developing an integrated many-to-many framework leveraging multimodal data for multiple tasks is crucial to unifying healthcare applications ranging from diagnoses to operations.
no code implementations • 28 Mar 2024 • Dimitris Bertsimas, Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis
We consider the task of retraining machine learning (ML) models when new batches of data become available.
no code implementations • 2 Feb 2024 • Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet
When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions.
1 code implementation • 22 Jan 2024 • Wes Gurnee, Theo Horsley, Zifan Carl Guo, Tara Rezaei Kheirkhah, Qinyi Sun, Will Hathaway, Neel Nanda, Dimitris Bertsimas
In other words, are neural mechanisms universal across different models?
no code implementations • 12 Nov 2023 • Dimitris Bertsimas, Yu Ma
We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least squares regression problem.
no code implementations • 3 Nov 2023 • Dimitris Bertsimas, Angelos G. Koulouras, Georgios Antonios Margonis
We applied our framework to observational data of patients with gastrointestinal stromal tumors (GIST) and validated the OPTs in an external cohort using the sensitivity and specificity metrics.
no code implementations • 3 Nov 2023 • Dimitris Bertsimas, Georgios Margaritis
Many approaches for addressing Global Optimization problems typically rely on relaxations of nonlinear constraints over specific mathematical primitives.
no code implementations • 23 Jul 2023 • Dimitris Bertsimas, Cheol Woo Kim
We propose an approach based on machine learning to solve two-stage linear adaptive robust optimization (ARO) problems with binary here-and-now variables and polyhedral uncertainty sets.
no code implementations • 23 Jul 2023 • Dimitris Bertsimas, Cheol Woo Kim
We use numerical solutions of MFQNET control problems as a training set and apply OCT-H to learn explicit control policies.
1 code implementation • 5 Jun 2023 • Dimitris Bertsimas, Nicholas A. G. Johnson
On real world ECG data, for a given $\ell_2$ reconstruction error our approach produces solutions that are on average $9. 95\%$ more sparse than benchmark methods ($3. 88\%$ more sparse if only compared against the best performing benchmark), while for a given sparsity level our approach produces solutions that have on average $10. 77\%$ lower reconstruction error than benchmark methods ($1. 42\%$ lower error if only compared against the best performing benchmark).
no code implementations • 26 May 2023 • Dimitris Bertsimas, Vassilis Digalakis Jr
Owing to their inherently interpretable structure, decision trees are commonly used in applications where interpretability is essential.
no code implementations • 25 May 2023 • Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein, Dimitris Bertsimas
Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals.
2 code implementations • 20 May 2023 • Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet
Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible.
2 code implementations • 2 May 2023 • Wes Gurnee, Neel Nanda, Matthew Pauly, Katherine Harvey, Dmitrii Troitskii, Dimitris Bertsimas
Despite rapid adoption and deployment of large language models (LLMs), the internal computations of these models remain opaque and poorly understood.
1 code implementation • 9 Apr 2023 • Dimitris Bertsimas, Leonard Boussioux
Accurate time series forecasting is critical for a wide range of problems with temporal data.
no code implementations • 22 Mar 2023 • Dimitris Bertsimas, Leonard Boussioux, Cynthia Zeng
The predictive component of our framework employs various machine learning models, such as gradient-boosted tree-based models and ensemble methods, for time series forecasting.
no code implementations • 11 Mar 2023 • Dimitris Bertsimas, Kimberly Villalobos Carballo
We prove that the proposed approach is asymptotically optimal for multistage stochastic optimization with side information.
no code implementations • 29 Jan 2023 • Cynthia Zeng, Dimitris Bertsimas
This work presents a novel multimodal machine learning approach for multi-year global flood risk prediction, combining geographical information and historical natural disaster dataset.
no code implementations • 15 Oct 2022 • Dimitris Bertsimas, Kosuke Imai, Michael Lingzhi Li
We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders.
no code implementations • 21 Jun 2022 • Kimberly Villalobos Carballo, Liangyuan Na, Yu Ma, Léonard Boussioux, Cynthia Zeng, Luis R. Soenksen, Dimitris Bertsimas
We show that 1) applying our TabText framework enables the generation of high-performing and simple machine learning baseline models with minimal data pre-processing, and 2) augmenting pre-processed tabular data with TabText representations improves the average and worst-case AUC performance of standard machine learning models by as much as 6%.
1 code implementation • 1 Jun 2022 • Dimitris Bertsimas, Wes Gurnee
Discovering governing equations of complex dynamical systems directly from data is a central problem in scientific machine learning.
1 code implementation • 25 Feb 2022 • Luis R. Soenksen, Yu Ma, Cynthia Zeng, Leonard D. J. Boussioux, Kimberly Villalobos Carballo, Liangyuan Na, Holly M. Wiberg, Michael L. Li, Ignacio Fuentes, Dimitris Bertsimas
The generalizable properties and flexibility of our Holistic AI in Medicine (HAIM) framework could offer a promising pathway for future multimodal predictive systems in clinical and operational healthcare settings.
1 code implementation • 17 Dec 2021 • Dimitris Bertsimas, Xavier Boix, Kimberly Villalobos Carballo, Dick den Hertog
We introduce a new approach to adversarial training by minimizing an upper bound of the adversarial loss that is based on a holistic expansion of the network instead of separate bounds for each layer.
1 code implementation • 4 Nov 2021 • Donato Maragno, Holly Wiberg, Dimitris Bertsimas, S. Ilker Birbil, Dick den Hertog, Adejuyigbe Fajemisin
The case studies illustrate the framework's ability to generate high-quality prescriptions as well as the value added by the trust region, the use of ensembles to control model robustness, the consideration of multiple machine learning methods, and the inclusion of multiple learned constraints.
1 code implementation • 29 Oct 2021 • Dimitris Bertsimas, Kimberly Villalobos Carballo, Léonard Boussioux, Michael Lingzhi Li, Alex Paskov, Ivan Paskov
This paper presents a novel holistic deep learning framework that simultaneously addresses the challenges of vulnerability to input perturbations, overparametrization, and performance instability from different train-validation splits.
1 code implementation • 26 Sep 2021 • Dimitris Bertsimas, Ryan Cory-Wright, Nicholas A. G. Johnson
We study the Sparse Plus Low-Rank decomposition problem (SLR), which is the problem of decomposing a corrupted data matrix into a sparse matrix of perturbations plus a low-rank matrix containing the ground truth.
no code implementations • 3 Jul 2021 • Hari Bandi, Dimitris Bertsimas
Systemic bias with respect to gender, race and ethnicity, often unconscious, is prevalent in datasets involving choices among individuals.
no code implementations • 1 Jun 2021 • Dimitris Bertsimas, Agni Orfanoudaki
Specifically, we present an optimization formulation to estimate the risk exposure of a binary classification model given a pre-defined range of premiums.
1 code implementation • 12 May 2021 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet
We invoke the matrix perspective function - the matrix analog of the perspective function - and characterize explicitly the convex hull of epigraphs of simple matrix convex functions under low-rank constraints.
no code implementations • 7 Apr 2021 • Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet
Missing data is a common issue in real-world datasets.
no code implementations • 3 Mar 2021 • Dimitris Bertsimas, Michael Lingzhi Li
We introduce a stochastic version of the cutting-plane method for a large class of data-driven Mixed-Integer Nonlinear Optimization (MINLO) problems.
1 code implementation • 22 Feb 2021 • Dimitris Bertsimas, Vassilis Digalakis Jr, Michael Linghzi Li, Omar Skali Lami
The algorithm is highly scalable, allowing us to train models with thousands of parameters.
no code implementations • 15 Feb 2021 • Dimitris Bertsimas, Vassilis Digalakis Jr., Alexander Jacquillat, Michael Lingzhi Li, Alessandro Previero
As compared to benchmarks based on demographic and epidemiological information, the proposed optimization approach increases the effectiveness of the vaccination campaign by an estimated $20\%$, saving an extra $4000$ extra lives in the United States over a three-month period.
no code implementations • 8 Dec 2020 • Dimitris Bertsimas, Jack Dunn, Emma Gibson, Agni Orfanoudaki
Tree-based models are increasingly popular due to their ability to identify complex relationships that are beyond the scope of parametric models.
3 code implementations • 11 Nov 2020 • Léonard Boussioux, Cynthia Zeng, Théo Guénais, Dimitris Bertsimas
In summary, our work demonstrates that utilizing machine learning techniques to combine different data sources can lead to new opportunities in tropical cyclone forecasting.
1 code implementation • 22 Sep 2020 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet
We propose a framework for modeling and solving low-rank optimization problems to certifiable optimality.
no code implementations • 17 Jul 2020 • Dimitris Bertsimas, Vassilis Digalakis Jr
We present a novel approach for the problem of frequency estimation in data streams that is based on optimization and machine learning.
no code implementations • 30 Jun 2020 • Dimitris Bertsimas, Léonard Boussioux, Ryan Cory Wright, Arthur Delarue, Vassilis Digalakis Jr., Alexandre Jacquillat, Driss Lahlou Kitane, Galit Lukin, Michael Lingzhi Li, Luca Mingardi, Omid Nohadani, Agni Orfanoudaki, Theodore Papalexopoulos, Ivan Paskov, Jean Pauphilet, Omar Skali Lami, Bartolomeo Stellato, Hamza Tazi Bouardi, Kimberly Villalobos Carballo, Holly Wiberg, Cynthia Zeng
Specifically, we propose a comprehensive data-driven approach to understand the clinical characteristics of COVID-19, predict its mortality, forecast its evolution, and ultimately alleviate its impact.
no code implementations • 11 Jun 2020 • Dimitris Bertsimas, Vassilis Digalakis Jr
We present the backbone method, a generic framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems.
1 code implementation • 11 May 2020 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet
Sparse principal component analysis (PCA) is a popular dimensionality reduction technique for obtaining principal components which are linear combinations of a small subset of the original features.
no code implementations • 21 Oct 2019 • Dimitris Bertsimas, Michael Lingzhi Li
We formulate the problem of matrix completion with and without side information as a non-convex optimization problem.
no code implementations • 18 Oct 2019 • Dimitris Bertsimas, Agni Orfanoudaki, Rory B. Weiner
We are able to estimate with average R squared = 0. 801 the time from diagnosis to a potential adverse event (TAE) and gain accurate approximations of the counterfactual treatment effects.
no code implementations • 8 Oct 2019 • Dimitris Bertsimas, Ryan Cory-Wright
We study a cutting-plane method for semidefinite optimization problems (SDOs), and supply a proof of the method's convergence, under a boundedness assumption.
no code implementations • 17 Jul 2019 • Dimitris Bertsimas, Christopher McCord, Bradley Sturt
Through a novel measure concentration result for a class of machine learning methods, we prove that the proposed approach is asymptotically optimal for multi-period stochastic programming with side information.
no code implementations • 8 Jul 2019 • Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin
When quantitative models are used to support decision-making on complex and important topics, understanding a model's ``reasoning'' can increase trust in its predictions, expose hidden biases, or reduce vulnerability to adversarial attacks.
no code implementations • 8 Jul 2019 • Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin
We propose a general optimization framework to create explanations for linear models.
1 code implementation • 4 Jul 2019 • Dimitris Bertsimas, Bartolomeo Stellato
Compared to state-of-the-art MIO routines, the online running time of our method is very predictable and can be lower than a single matrix factorization time.
no code implementations • 3 Jul 2019 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet
We propose a unified framework to address a family of classical mixed-integer optimization problems with logically constrained decision variables, including network design, facility location, unit commitment, sparse portfolio selection, binary quadratic optimization, sparse principal analysis and sparse learning problems.
no code implementations • 25 Jun 2019 • Dimitris Bertsimas, Jourdain Lamperski, Jean Pauphilet
We consider the maximum likelihood estimation of sparse inverse covariance matrices.
no code implementations • 26 Apr 2019 • Dimitris Bertsimas, Christopher McCord
In this paper, we introduce a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data.
1 code implementation • 18 Feb 2019 • Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys
A cogent feature selection method is expected to exhibit a two-fold convergence, namely the accuracy and false detection rate should converge to $1$ and $0$ respectively, as the sample size increases.
Methodology
no code implementations • 8 Feb 2019 • Dimitris Bertsimas, Michael Lingzhi Li
We propose a new scalable algorithm for holistic linear regression building on Bertsimas & King (2016).
1 code implementation • 24 Dec 2018 • Dimitris Bertsimas, Bartolomeo Stellato
We introduce the idea that using optimal classification trees (OCTs) and optimal classification trees with-hyperplanes (OCT-Hs), interpretable machine learning algorithms developed by Bertsimas and Dunn [2017, 2018], we are able to obtain insight on the strategy behind the optimal solution in continuous and mixed-integer convex optimization problem as a function of key parameters that affect the problem.
Optimization and Control
no code implementations • 17 Dec 2018 • Dimitris Bertsimas, Michael Lingzhi Li
We consider the problem of matrix completion on an $n \times m$ matrix.
no code implementations • 3 Dec 2018 • Dimitris Bertsimas, Agni Orfanoudaki, Holly Wiberg
State-of-the-art clustering algorithms use heuristics to partition the feature space and provide little insight into the rationale for cluster membership, limiting their interpretability.
no code implementations • 2 Dec 2018 • Dimitris Bertsimas, Agni Orfanoudaki, Colin Pawlowski
Missing data is a common problem in real-world settings and particularly relevant in healthcare applications where researchers use Electronic Health Records (EHR) and results of observational studies to apply analytics methods.
1 code implementation • 31 Oct 2018 • Dimitris Bertsimas, Ryan Cory-Wright
In numerical experiments, we establish that the outer-approximation procedure gives rise to dramatic speedups for sparse portfolio selection problems.
Optimization and Control
no code implementations • NeurIPS 2018 • Dimitris Bertsimas, Christopher McCord
We consider the optimization of an uncertain objective over continuous and multi-dimensional decision spaces in problems in which we are only provided with observational data.
1 code implementation • 27 Nov 2017 • Dimitris Bertsimas, Bart Van Parys
The associated robust prescriptive methods furthermore reduce to convenient tractable convex optimization problems in the context of local learning methods such as nearest neighbors and Nadaraya-Watson learning.
1 code implementation • 3 Oct 2017 • Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys
In this paper, we formulate the sparse classification problem of $n$ samples with $p$ features as a binary convex optimization problem and propose a cutting-plane algorithm to solve it exactly.
Optimization and Control
no code implementations • 28 Sep 2017 • Dimitris Bertsimas, Bart Van Parys
We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective.
no code implementations • 28 Sep 2017 • Dimitris Bertsimas, Bart Van Parys
The ability of our method to identify all $k$ relevant inputs and all $\ell$ monomial terms is shown empirically to experience a phase transition.
1 code implementation • 15 Aug 2017 • Dimitris Bertsimas, Martin S. Copenhaver, Rahul Mazumder
Nonconvex penalty methods for sparse modeling in linear regression have been a topic of fervent interest in recent years.
no code implementations • 11 Jul 2015 • Dimitris Bertsimas, Angela King, Rahul Mazumder
In the last twenty-five years (1990-2014), algorithmic advances in integer optimization combined with hardware improvements have resulted in an astonishing 200 billion factor speedup in solving Mixed Integer Optimization (MIO) problems.
no code implementations • 22 Nov 2014 • Dimitris Bertsimas, Martin S. Copenhaver
The notion of developing statistical methods in machine learning which are robust to adversarial perturbations in the underlying data has been the subject of increasing interest in recent years.
no code implementations • 21 May 2014 • Dimitris Bertsimas, J. Daniel Griffith, Vishal Gupta, Mykel J. Kochenderfer, Velibor V. Mišić, Robert Moss
In this paper, we adapt both MCTS and MO to a problem inspired by tactical wildfire and management and undertake an extensive computational study comparing the two methods on large scale instances in terms of both the state and the action spaces.
1 code implementation • 22 Feb 2014 • Dimitris Bertsimas, Nathan Kallus
To demonstrate the power of our approach in a real-world setting we study an inventory management problem faced by the distribution arm of an international media conglomerate, which ships an average of 1bil units per year.