Search Results for author: Dimitris Bertsimas

Found 72 papers, 25 papers with code

Deep Trees for (Un)structured Data: Tractability, Performance, and Interpretability

no code implementations28 Oct 2024 Dimitris Bertsimas, Lisa Everest, Jiayi Gu, Matthew Peroni, Vasiliki Stoumpou

Overall, our approach of Generalized Soft Trees provides a tractable method that is high-performing on (un)structured datasets and preserves interpretability more than traditional deep learning methods.

STS

Binary Classification: Is Boosting stronger than Bagging?

no code implementations24 Oct 2024 Dimitris Bertsimas, Vasiliki Stoumpou

In binary classification problems, the proposed extensions and the corresponding results suggest the equivalence of bagging and boosting methods in performance, and the edge of bagging in interpretability by leveraging a few learners of the ensemble, which is not an option in the less explainable boosting methods.

Binary Classification Classification

Catastrophe Insurance: An Adaptive Robust Optimization Approach

no code implementations11 May 2024 Dimitris Bertsimas, Cynthia Zeng

The escalating frequency and severity of natural disasters, exacerbated by climate change, underscore the critical role of insurance in facilitating recovery and promoting investments in risk reduction.

M3H: Multimodal Multitask Machine Learning for Healthcare

no code implementations29 Apr 2024 Dimitris Bertsimas, Yu Ma

Developing an integrated many-to-many framework leveraging multimodal data for multiple tasks is crucial to unifying healthcare applications ranging from diagnoses to operations.

Binary Classification Patient Phenotyping +1

Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

no code implementations28 Mar 2024 Dimitris Bertsimas, Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis

We consider the task of retraining machine learning (ML) models when new batches of data become available.

Feature Importance

Adaptive Optimization for Prediction with Missing Data

no code implementations2 Feb 2024 Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions.

Imputation regression

Robust Regression over Averaged Uncertainty

no code implementations12 Nov 2023 Dimitris Bertsimas, Yu Ma

We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least squares regression problem.

regression

The R.O.A.D. to precision medicine

no code implementations3 Nov 2023 Dimitris Bertsimas, Angelos G. Koulouras, Georgios Antonios Margonis

We applied our framework to observational data of patients with gastrointestinal stromal tumors (GIST) and validated the OPTs in an external cohort using the sensitivity and specificity metrics.

Specificity

Global Optimization: A Machine Learning Approach

no code implementations3 Nov 2023 Dimitris Bertsimas, Georgios Margaritis

Many approaches for addressing Global Optimization problems typically rely on relaxations of nonlinear constraints over specific mathematical primitives.

A Machine Learning Approach to Two-Stage Adaptive Robust Optimization

no code implementations23 Jul 2023 Dimitris Bertsimas, Cheol Woo Kim

We propose an approach based on machine learning to solve two-stage linear adaptive robust optimization (ARO) problems with binary here-and-now variables and polyhedral uncertainty sets.

Optimal Control of Multiclass Fluid Queueing Networks: A Machine Learning Approach

no code implementations23 Jul 2023 Dimitris Bertsimas, Cheol Woo Kim

We use numerical solutions of MFQNET control problems as a training set and apply OCT-H to learn explicit control policies.

Compressed Sensing: A Discrete Optimization Approach

1 code implementation5 Jun 2023 Dimitris Bertsimas, Nicholas A. G. Johnson

On real world ECG data, for a given $\ell_2$ reconstruction error our approach produces solutions that are on average $9. 95\%$ more sparse than benchmark methods ($3. 88\%$ more sparse if only compared against the best performing benchmark), while for a given sparsity level our approach produces solutions that have on average $10. 77\%$ lower reconstruction error than benchmark methods ($1. 42\%$ lower error if only compared against the best performing benchmark).

Data Compression Denoising +2

Improving Stability in Decision Tree Models

no code implementations26 May 2023 Dimitris Bertsimas, Vassilis Digalakis Jr

Owing to their inherently interpretable structure, decision trees are commonly used in applications where interpretability is essential.

Patient Outcome Predictions Improve Operations at a Large Hospital Network

no code implementations25 May 2023 Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein, Dimitris Bertsimas

Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals.

Decision Making

Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

2 code implementations20 May 2023 Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet

Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible.

Low-Rank Matrix Completion Product Recommendation

Finding Neurons in a Haystack: Case Studies with Sparse Probing

2 code implementations2 May 2023 Wes Gurnee, Neel Nanda, Matthew Pauly, Katherine Harvey, Dmitrii Troitskii, Dimitris Bertsimas

Despite rapid adoption and deployment of large language models (LLMs), the internal computations of these models remain opaque and poorly understood.

Reducing Air Pollution through Machine Learning

no code implementations22 Mar 2023 Dimitris Bertsimas, Leonard Boussioux, Cynthia Zeng

The predictive component of our framework employs various machine learning models, such as gradient-boosted tree-based models and ensemble methods, for time series forecasting.

Time Series Forecasting

Multistage Stochastic Optimization via Kernels

no code implementations11 Mar 2023 Dimitris Bertsimas, Kimberly Villalobos Carballo

We prove that the proposed approach is asymptotically optimal for multistage stochastic optimization with side information.

Management Stochastic Optimization

Global Flood Prediction: a Multimodal Machine Learning Approach

no code implementations29 Jan 2023 Cynthia Zeng, Dimitris Bertsimas

This work presents a novel multimodal machine learning approach for multi-year global flood risk prediction, combining geographical information and historical natural disaster dataset.

Management Time Series +2

Distributionally Robust Causal Inference with Observational Data

no code implementations15 Oct 2022 Dimitris Bertsimas, Kosuke Imai, Michael Lingzhi Li

We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders.

Causal Inference

TabText: A Flexible and Contextual Approach to Tabular Data Representation

no code implementations21 Jun 2022 Kimberly Villalobos Carballo, Liangyuan Na, Yu Ma, Léonard Boussioux, Cynthia Zeng, Luis R. Soenksen, Dimitris Bertsimas

We show that 1) applying our TabText framework enables the generation of high-performing and simple machine learning baseline models with minimal data pre-processing, and 2) augmenting pre-processed tabular data with TabText representations improves the average and worst-case AUC performance of standard machine learning models by as much as 6%.

Learning Sparse Nonlinear Dynamics via Mixed-Integer Optimization

1 code implementation1 Jun 2022 Dimitris Bertsimas, Wes Gurnee

Discovering governing equations of complex dynamical systems directly from data is a central problem in scientific machine learning.

Model Discovery regression

Integrated multimodal artificial intelligence framework for healthcare applications

1 code implementation25 Feb 2022 Luis R. Soenksen, Yu Ma, Cynthia Zeng, Leonard D. J. Boussioux, Kimberly Villalobos Carballo, Liangyuan Na, Holly M. Wiberg, Michael L. Li, Ignacio Fuentes, Dimitris Bertsimas

The generalizable properties and flexibility of our Holistic AI in Medicine (HAIM) framework could offer a promising pathway for future multimodal predictive systems in clinical and operational healthcare settings.

Time Series Analysis

Robust Upper Bounds for Adversarial Training

1 code implementation17 Dec 2021 Dimitris Bertsimas, Xavier Boix, Kimberly Villalobos Carballo, Dick den Hertog

We introduce a new approach to adversarial training by minimizing an upper bound of the adversarial loss that is based on a holistic expansion of the network instead of separate bounds for each layer.

Mixed-Integer Optimization with Constraint Learning

1 code implementation4 Nov 2021 Donato Maragno, Holly Wiberg, Dimitris Bertsimas, S. Ilker Birbil, Dick den Hertog, Adejuyigbe Fajemisin

The case studies illustrate the framework's ability to generate high-quality prescriptions as well as the value added by the trust region, the use of ensembles to control model robustness, the consideration of multiple machine learning methods, and the inclusion of multiple learned constraints.

BIG-bench Machine Learning Decision Making +1

Holistic Deep Learning

1 code implementation29 Oct 2021 Dimitris Bertsimas, Kimberly Villalobos Carballo, Léonard Boussioux, Michael Lingzhi Li, Alex Paskov, Ivan Paskov

This paper presents a novel holistic deep learning framework that simultaneously addresses the challenges of vulnerability to input perturbations, overparametrization, and performance instability from different train-validation splits.

Adversarial Robustness Deep Learning

Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

1 code implementation26 Sep 2021 Dimitris Bertsimas, Ryan Cory-Wright, Nicholas A. G. Johnson

We study the Sparse Plus Low-Rank decomposition problem (SLR), which is the problem of decomposing a corrupted data matrix into a sparse matrix of perturbations plus a low-rank matrix containing the ground truth.

Collaborative Filtering Data Compression

The Price of Diversity

no code implementations3 Jul 2021 Hari Bandi, Dimitris Bertsimas

Systemic bias with respect to gender, race and ethnicity, often unconscious, is prevalent in datasets involving choices among individuals.

Diversity

Algorithmic Insurance

no code implementations1 Jun 2021 Dimitris Bertsimas, Agni Orfanoudaki

Specifically, we present an optimization formulation to estimate the risk exposure of a binary classification model given a pre-defined range of premiums.

Binary Classification Breast Cancer Detection +1

A new perspective on low-rank optimization

1 code implementation12 May 2021 Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

We invoke the matrix perspective function - the matrix analog of the perspective function - and characterize explicitly the convex hull of epigraphs of simple matrix convex functions under low-rank constraints.

Stochastic Cutting Planes for Data-Driven Optimization

no code implementations3 Mar 2021 Dimitris Bertsimas, Michael Lingzhi Li

We introduce a stochastic version of the cutting-plane method for a large class of data-driven Mixed-Integer Nonlinear Optimization (MINLO) problems.

Slowly Varying Regression under Sparsity

1 code implementation22 Feb 2021 Dimitris Bertsimas, Vassilis Digalakis Jr, Michael Linghzi Li, Omar Skali Lami

The algorithm is highly scalable, allowing us to train models with thousands of parameters.

regression

Where to locate COVID-19 mass vaccination facilities?

no code implementations15 Feb 2021 Dimitris Bertsimas, Vassilis Digalakis Jr., Alexander Jacquillat, Michael Lingzhi Li, Alessandro Previero

As compared to benchmarks based on demographic and epidemiological information, the proposed optimization approach increases the effectiveness of the vaccination campaign by an estimated $20\%$, saving an extra $4000$ extra lives in the United States over a three-month period.

Fairness

Optimal Survival Trees

no code implementations8 Dec 2020 Dimitris Bertsimas, Jack Dunn, Emma Gibson, Agni Orfanoudaki

Tree-based models are increasingly popular due to their ability to identify complex relationships that are beyond the scope of parametric models.

Hurricane Forecasting: A Novel Multimodal Machine Learning Framework

3 code implementations11 Nov 2020 Léonard Boussioux, Cynthia Zeng, Théo Guénais, Dimitris Bertsimas

In summary, our work demonstrates that utilizing machine learning techniques to combine different data sources can lead to new opportunities in tropical cyclone forecasting.

BIG-bench Machine Learning Decoder +3

Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

1 code implementation22 Sep 2020 Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

We propose a framework for modeling and solving low-rank optimization problems to certifiable optimality.

Frequency Estimation in Data Streams: Learning the Optimal Hashing Scheme

no code implementations17 Jul 2020 Dimitris Bertsimas, Vassilis Digalakis Jr

We present a novel approach for the problem of frequency estimation in data streams that is based on optimization and machine learning.

BIG-bench Machine Learning

The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

no code implementations11 Jun 2020 Dimitris Bertsimas, Vassilis Digalakis Jr

We present the backbone method, a generic framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems.

BIG-bench Machine Learning regression +1

Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

1 code implementation11 May 2020 Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

Sparse principal component analysis (PCA) is a popular dimensionality reduction technique for obtaining principal components which are linear combinations of a small subset of the original features.

Dimensionality Reduction

Fast Exact Matrix Completion: A Unified Optimization Framework for Matrix Completion

no code implementations21 Oct 2019 Dimitris Bertsimas, Michael Lingzhi Li

We formulate the problem of matrix completion with and without side information as a non-convex optimization problem.

Matrix Completion

Personalized Treatment for Coronary Artery Disease Patients: A Machine Learning Approach

no code implementations18 Oct 2019 Dimitris Bertsimas, Agni Orfanoudaki, Rory B. Weiner

We are able to estimate with average R squared = 0. 801 the time from diagnosis to a potential adverse event (TAE) and gain accurate approximations of the counterfactual treatment effects.

BIG-bench Machine Learning counterfactual +1

On Polyhedral and Second-Order Cone Decompositions of Semidefinite Optimization Problems

no code implementations8 Oct 2019 Dimitris Bertsimas, Ryan Cory-Wright

We study a cutting-plane method for semidefinite optimization problems (SDOs), and supply a proof of the method's convergence, under a boundedness assumption.

Dynamic optimization with side information

no code implementations17 Jul 2019 Dimitris Bertsimas, Christopher McCord, Bradley Sturt

Through a novel measure concentration result for a class of machine learning methods, we prove that the proposed approach is asymptotically optimal for multi-period stochastic programming with side information.

BIG-bench Machine Learning Management

The Price of Interpretability

no code implementations8 Jul 2019 Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin

When quantitative models are used to support decision-making on complex and important topics, understanding a model's ``reasoning'' can increase trust in its predictions, expose hidden biases, or reduce vulnerability to adversarial attacks.

Decision Making

Optimal Explanations of Linear Models

no code implementations8 Jul 2019 Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin

We propose a general optimization framework to create explanations for linear models.

Online Mixed-Integer Optimization in Milliseconds

1 code implementation4 Jul 2019 Dimitris Bertsimas, Bartolomeo Stellato

Compared to state-of-the-art MIO routines, the online running time of our method is very predictable and can be lower than a single matrix factorization time.

energy management Management +2

A unified approach to mixed-integer optimization problems with logical constraints

no code implementations3 Jul 2019 Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

We propose a unified framework to address a family of classical mixed-integer optimization problems with logically constrained decision variables, including network design, facility location, unit commitment, sparse portfolio selection, binary quadratic optimization, sparse principal analysis and sparse learning problems.

Sparse Learning

Certifiably Optimal Sparse Inverse Covariance Estimation

no code implementations25 Jun 2019 Dimitris Bertsimas, Jourdain Lamperski, Jean Pauphilet

We consider the maximum likelihood estimation of sparse inverse covariance matrices.

From Predictions to Prescriptions in Multistage Optimization Problems

no code implementations26 Apr 2019 Dimitris Bertsimas, Christopher McCord

In this paper, we introduce a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data.

regression

Sparse Regression: Scalable algorithms and empirical performance

1 code implementation18 Feb 2019 Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys

A cogent feature selection method is expected to exhibit a two-fold convergence, namely the accuracy and false detection rate should converge to $1$ and $0$ respectively, as the sample size increases.

Methodology

Scalable Holistic Linear Regression

no code implementations8 Feb 2019 Dimitris Bertsimas, Michael Lingzhi Li

We propose a new scalable algorithm for holistic linear regression building on Bertsimas & King (2016).

regression

The Voice of Optimization

1 code implementation24 Dec 2018 Dimitris Bertsimas, Bartolomeo Stellato

We introduce the idea that using optimal classification trees (OCTs) and optimal classification trees with-hyperplanes (OCT-Hs), interpretable machine learning algorithms developed by Bertsimas and Dunn [2017, 2018], we are able to obtain insight on the strategy behind the optimal solution in continuous and mixed-integer convex optimization problem as a function of key parameters that affect the problem.

Optimization and Control

Interpretable Clustering via Optimal Trees

no code implementations3 Dec 2018 Dimitris Bertsimas, Agni Orfanoudaki, Holly Wiberg

State-of-the-art clustering algorithms use heuristics to partition the feature space and provide little insight into the rationale for cluster membership, limiting their interpretability.

Clustering

Imputation of Clinical Covariates in Time Series

no code implementations2 Dec 2018 Dimitris Bertsimas, Agni Orfanoudaki, Colin Pawlowski

Missing data is a common problem in real-world settings and particularly relevant in healthcare applications where researchers use Electronic Health Records (EHR) and results of observational studies to apply analytics methods.

Imputation Time Series +1

A Scalable Algorithm For Sparse Portfolio Selection

1 code implementation31 Oct 2018 Dimitris Bertsimas, Ryan Cory-Wright

In numerical experiments, we establish that the outer-approximation procedure gives rise to dramatic speedups for sparse portfolio selection problems.

Optimization and Control

Optimization over Continuous and Multi-dimensional Decisions with Observational Data

no code implementations NeurIPS 2018 Dimitris Bertsimas, Christopher McCord

We consider the optimization of an uncertain objective over continuous and multi-dimensional decision spaces in problems in which we are only provided with observational data.

BIG-bench Machine Learning

Bootstrap Robust Prescriptive Analytics

1 code implementation27 Nov 2017 Dimitris Bertsimas, Bart Van Parys

The associated robust prescriptive methods furthermore reduce to convenient tractable convex optimization problems in the context of local learning methods such as nearest neighbors and Nadaraya-Watson learning.

Decision Making

Sparse Classification and Phase Transitions: A Discrete Optimization Perspective

1 code implementation3 Oct 2017 Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys

In this paper, we formulate the sparse classification problem of $n$ samples with $p$ features as a binary convex optimization problem and propose a cutting-plane algorithm to solve it exactly.

Optimization and Control

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

no code implementations28 Sep 2017 Dimitris Bertsimas, Bart Van Parys

We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective.

regression Vocal Bursts Intensity Prediction

Sparse Hierarchical Regression with Polynomials

no code implementations28 Sep 2017 Dimitris Bertsimas, Bart Van Parys

The ability of our method to identify all $k$ relevant inputs and all $\ell$ monomial terms is shown empirically to experience a phase transition.

regression

The Trimmed Lasso: Sparsity and Robustness

1 code implementation15 Aug 2017 Dimitris Bertsimas, Martin S. Copenhaver, Rahul Mazumder

Nonconvex penalty methods for sparse modeling in linear regression have been a topic of fervent interest in recent years.

Best Subset Selection via a Modern Optimization Lens

no code implementations11 Jul 2015 Dimitris Bertsimas, Angela King, Rahul Mazumder

In the last twenty-five years (1990-2014), algorithmic advances in integer optimization combined with hardware improvements have resulted in an astonishing 200 billion factor speedup in solving Mixed Integer Optimization (MIO) problems.

regression Sparse Learning

Characterization of the equivalence of robustification and regularization in linear and matrix regression

no code implementations22 Nov 2014 Dimitris Bertsimas, Martin S. Copenhaver

The notion of developing statistical methods in machine learning which are robust to adversarial perturbations in the underlying data has been the subject of increasing interest in recent years.

Matrix Completion regression

A Comparison of Monte Carlo Tree Search and Mathematical Optimization for Large Scale Dynamic Resource Allocation

no code implementations21 May 2014 Dimitris Bertsimas, J. Daniel Griffith, Vishal Gupta, Mykel J. Kochenderfer, Velibor V. Mišić, Robert Moss

In this paper, we adapt both MCTS and MO to a problem inspired by tactical wildfire and management and undertake an extensive computational study comparing the two methods on large scale instances in terms of both the state and the action spaces.

Management Stochastic Optimization

From Predictive to Prescriptive Analytics

1 code implementation22 Feb 2014 Dimitris Bertsimas, Nathan Kallus

To demonstrate the power of our approach in a real-world setting we study an inventory management problem faced by the distribution arm of an international media conglomerate, which ships an average of 1bil units per year.

Management Stochastic Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.