Search Results for author: Dimitris Bertsimas

Found 66 papers, 24 papers with code

Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

no code implementations • 28 Mar 2024 • Dimitris Bertsimas, Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis

In this study, we develop a mixed integer optimization algorithm that holistically considers the problem of retraining machine learning models across different data batch updates.

Feature Importance

Paper
Add Code

Adaptive Optimization for Prediction with Missing Data

no code implementations • 2 Feb 2024 • Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions.

Imputation regression

Paper
Add Code

Universal Neurons in GPT2 Language Models

1 code implementation • 22 Jan 2024 • Wes Gurnee, Theo Horsley, Zifan Carl Guo, Tara Rezaei Kheirkhah, Qinyi Sun, Will Hathaway, Neel Nanda, Dimitris Bertsimas

In other words, are neural mechanisms universal across different models?

Paper
Code

Robust Regression over Averaged Uncertainty

no code implementations • 12 Nov 2023 • Dimitris Bertsimas, Yu Ma

We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least-squared regression problem.

regression

Paper
Add Code

Global Optimization: A Machine Learning Approach

no code implementations • 3 Nov 2023 • Dimitris Bertsimas, Georgios Margaritis

Many approaches for addressing Global Optimization problems typically rely on relaxations of nonlinear constraints over specific mathematical primitives.

Paper
Add Code

The R.O.A.D. to precision medicine

no code implementations • 3 Nov 2023 • Dimitris Bertsimas, Angelos G. Koulouras, Georgios Antonios Margonis

We applied our framework to observational data of patients with gastrointestinal stromal tumors (GIST) and validated the OPTs in an external cohort using the sensitivity and specificity metrics.

Specificity

Paper
Add Code

Optimal Control of Multiclass Fluid Queueing Networks: A Machine Learning Approach

no code implementations • 23 Jul 2023 • Dimitris Bertsimas, Cheol Woo Kim

We use numerical solutions of MFQNET control problems as a training set and apply OCT-H to learn explicit control policies.

Paper
Add Code

A Machine Learning Approach to Two-Stage Adaptive Robust Optimization

no code implementations • 23 Jul 2023 • Dimitris Bertsimas, Cheol Woo Kim

We propose an approach based on machine learning to solve two-stage linear adaptive robust optimization (ARO) problems with binary here-and-now variables and polyhedral uncertainty sets.

Paper
Add Code

Compressed Sensing: A Discrete Optimization Approach

1 code implementation • 5 Jun 2023 • Dimitris Bertsimas, Nicholas A. G. Johnson

We derive a second order cone relaxation of this problem and show that under mild conditions on the regularization parameter, the resulting relaxation is equivalent to the well studied basis pursuit denoising problem.

Data Compression Denoising +1

Paper
Code

Improving Stability in Decision Tree Models

no code implementations • 26 May 2023 • Dimitris Bertsimas, Vassilis Digalakis Jr

Owing to their inherently interpretable structure, decision trees are commonly used in applications where interpretability is essential.

Paper
Add Code

Patient Outcome Predictions Improve Operations at a Large Hospital Network

no code implementations • 25 May 2023 • Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein, Dimitris Bertsimas

Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals.

Decision Making

Paper
Add Code

Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

2 code implementations • 20 May 2023 • Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet

Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible.

Low-Rank Matrix Completion Product Recommendation

Paper
Code

Finding Neurons in a Haystack: Case Studies with Sparse Probing

2 code implementations • 2 May 2023 • Wes Gurnee, Neel Nanda, Matthew Pauly, Katherine Harvey, Dmitrii Troitskii, Dimitris Bertsimas

Despite rapid adoption and deployment of large language models (LLMs), the internal computations of these models remain opaque and poorly understood.

2,029

Paper
Code

Ensemble Modeling for Time Series Forecasting: an Adaptive Robust Optimization Approach

1 code implementation • 9 Apr 2023 • Dimitris Bertsimas, Leonard Boussioux

Accurate time series forecasting is critical for a wide range of problems with temporal data.

Management Time Series +2

Paper
Code

Reducing Air Pollution through Machine Learning

no code implementations • 22 Mar 2023 • Dimitris Bertsimas, Leonard Boussioux, Cynthia Zeng

The predictive component of our framework employs various machine learning models, such as gradient-boosted tree-based models and ensemble methods, for time series forecasting.

Time Series Forecasting

Paper
Add Code

Multistage Stochastic Optimization via Kernels

no code implementations • 11 Mar 2023 • Dimitris Bertsimas, Kimberly Villalobos Carballo

We prove that the proposed approach is asymptotically optimal for multistage stochastic optimization with side information.

Management Stochastic Optimization

Paper
Add Code

Global Flood Prediction: a Multimodal Machine Learning Approach

no code implementations • 29 Jan 2023 • Cynthia Zeng, Dimitris Bertsimas

This work presents a novel multimodal machine learning approach for multi-year global flood risk prediction, combining geographical information and historical natural disaster dataset.

Management Time Series +2

Paper
Add Code

Distributionally Robust Causal Inference with Observational Data

no code implementations • 15 Oct 2022 • Dimitris Bertsimas, Kosuke Imai, Michael Lingzhi Li

We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders.

Causal Inference

Paper
Add Code

TabText: A Flexible and Contextual Approach to Tabular Data Representation

no code implementations • 21 Jun 2022 • Kimberly Villalobos Carballo, Liangyuan Na, Yu Ma, Léonard Boussioux, Cynthia Zeng, Luis R. Soenksen, Dimitris Bertsimas

We show that 1) applying our TabText framework enables the generation of high-performing and simple machine learning baseline models with minimal data pre-processing, and 2) augmenting pre-processed tabular data with TabText representations improves the average and worst-case AUC performance of standard machine learning models by as much as 6%.

Paper
Add Code

Learning Sparse Nonlinear Dynamics via Mixed-Integer Optimization

1 code implementation • 1 Jun 2022 • Dimitris Bertsimas, Wes Gurnee

Discovering governing equations of complex dynamical systems directly from data is a central problem in scientific machine learning.

Model Discovery regression

Paper
Code

Integrated multimodal artificial intelligence framework for healthcare applications

1 code implementation • 25 Feb 2022 • Luis R. Soenksen, Yu Ma, Cynthia Zeng, Leonard D. J. Boussioux, Kimberly Villalobos Carballo, Liangyuan Na, Holly M. Wiberg, Michael L. Li, Ignacio Fuentes, Dimitris Bertsimas

The generalizable properties and flexibility of our Holistic AI in Medicine (HAIM) framework could offer a promising pathway for future multimodal predictive systems in clinical and operational healthcare settings.

Time Series Analysis

Paper
Code

Robust Upper Bounds for Adversarial Training

1 code implementation • 17 Dec 2021 • Dimitris Bertsimas, Xavier Boix, Kimberly Villalobos Carballo, Dick den Hertog

We introduce a new approach to adversarial training by minimizing an upper bound of the adversarial loss that is based on a holistic expansion of the network instead of separate bounds for each layer.

Paper
Code

Mixed-Integer Optimization with Constraint Learning

1 code implementation • 4 Nov 2021 • Donato Maragno, Holly Wiberg, Dimitris Bertsimas, S. Ilker Birbil, Dick den Hertog, Adejuyigbe Fajemisin

The case studies illustrate the framework's ability to generate high-quality prescriptions as well as the value added by the trust region, the use of ensembles to control model robustness, the consideration of multiple machine learning methods, and the inclusion of multiple learned constraints.

BIG-bench Machine Learning Decision Making +1

100

Paper
Code

Holistic Deep Learning

1 code implementation • 29 Oct 2021 • Dimitris Bertsimas, Kimberly Villalobos Carballo, Léonard Boussioux, Michael Lingzhi Li, Alex Paskov, Ivan Paskov

This paper presents a novel holistic deep learning framework that simultaneously addresses the challenges of vulnerability to input perturbations, overparametrization, and performance instability from different train-validation splits.

Adversarial Robustness

Paper
Code

Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

1 code implementation • 26 Sep 2021 • Dimitris Bertsimas, Ryan Cory-Wright, Nicholas A. G. Johnson

We study the Sparse Plus Low-Rank decomposition problem (SLR), which is the problem of decomposing a corrupted data matrix into a sparse matrix of perturbations plus a low-rank matrix containing the ground truth.

Collaborative Filtering Data Compression

Paper
Code

The Price of Diversity

no code implementations • 3 Jul 2021 • Hari Bandi, Dimitris Bertsimas

Systemic bias with respect to gender, race and ethnicity, often unconscious, is prevalent in datasets involving choices among individuals.

Paper
Add Code

Algorithmic Insurance

no code implementations • 1 Jun 2021 • Dimitris Bertsimas, Agni Orfanoudaki

Specifically, we present an optimization formulation to estimate the risk exposure of a binary classification model given a pre-defined range of premiums.

Binary Classification Breast Cancer Detection +1

Paper
Add Code

A new perspective on low-rank optimization

1 code implementation • 12 May 2021 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

We invoke the matrix perspective function - the matrix analog of the perspective function - and characterize explicitly the convex hull of epigraphs of simple matrix convex functions under low-rank constraints.

Paper
Code

Simple Imputation Rules for Prediction with Missing Data: Contrasting Theoretical Guarantees with Empirical Performance

no code implementations • 7 Apr 2021 • Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

Missing data is a common issue in real-world datasets.

Common Sense Reasoning Imputation +2

Paper
Add Code

Stochastic Cutting Planes for Data-Driven Optimization

no code implementations • 3 Mar 2021 • Dimitris Bertsimas, Michael Lingzhi Li

We introduce a stochastic version of the cutting-plane method for a large class of data-driven Mixed-Integer Nonlinear Optimization (MINLO) problems.

Paper
Add Code

Slowly Varying Regression under Sparsity

1 code implementation • 22 Feb 2021 • Dimitris Bertsimas, Vassilis Digalakis Jr, Michael Linghzi Li, Omar Skali Lami

The algorithm is highly scalable, allowing us to train models with thousands of parameters.

regression

Paper
Code

Where to locate COVID-19 mass vaccination facilities?

no code implementations • 15 Feb 2021 • Dimitris Bertsimas, Vassilis Digalakis Jr., Alexander Jacquillat, Michael Lingzhi Li, Alessandro Previero

As compared to benchmarks based on demographic and epidemiological information, the proposed optimization approach increases the effectiveness of the vaccination campaign by an estimated $20\%$, saving an extra $4000$ extra lives in the United States over a three-month period.

Fairness

Paper
Add Code

Optimal Survival Trees

no code implementations • 8 Dec 2020 • Dimitris Bertsimas, Jack Dunn, Emma Gibson, Agni Orfanoudaki

Tree-based models are increasingly popular due to their ability to identify complex relationships that are beyond the scope of parametric models.

Paper
Add Code

Hurricane Forecasting: A Novel Multimodal Machine Learning Framework

3 code implementations • 11 Nov 2020 • Léonard Boussioux, Cynthia Zeng, Théo Guénais, Dimitris Bertsimas

In summary, our work demonstrates that utilizing machine learning techniques to combine different data sources can lead to new opportunities in tropical cyclone forecasting.

BIG-bench Machine Learning Hurricane Forecasting +2

Paper
Code

Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

1 code implementation • 22 Sep 2020 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

We propose a framework for modeling and solving low-rank optimization problems to certifiable optimality.

Paper
Code

Frequency Estimation in Data Streams: Learning the Optimal Hashing Scheme

no code implementations • 17 Jul 2020 • Dimitris Bertsimas, Vassilis Digalakis Jr

We present a novel approach for the problem of frequency estimation in data streams that is based on optimization and machine learning.

BIG-bench Machine Learning

Paper
Add Code

From predictions to prescriptions: A data-driven response to COVID-19

no code implementations • 30 Jun 2020 • Dimitris Bertsimas, Léonard Boussioux, Ryan Cory Wright, Arthur Delarue, Vassilis Digalakis Jr., Alexandre Jacquillat, Driss Lahlou Kitane, Galit Lukin, Michael Lingzhi Li, Luca Mingardi, Omid Nohadani, Agni Orfanoudaki, Theodore Papalexopoulos, Ivan Paskov, Jean Pauphilet, Omar Skali Lami, Bartolomeo Stellato, Hamza Tazi Bouardi, Kimberly Villalobos Carballo, Holly Wiberg, Cynthia Zeng

Specifically, we propose a comprehensive data-driven approach to understand the clinical characteristics of COVID-19, predict its mortality, forecast its evolution, and ultimately alleviate its impact.

Descriptive Management

Paper
Add Code

The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

no code implementations • 11 Jun 2020 • Dimitris Bertsimas, Vassilis Digalakis Jr

We present the backbone method, a generic framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems.

BIG-bench Machine Learning regression +1

Paper
Add Code

Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

1 code implementation • 11 May 2020 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

Sparse principal component analysis (PCA) is a popular dimensionality reduction technique for obtaining principal components which are linear combinations of a small subset of the original features.

Dimensionality Reduction

Paper
Code

Fast Exact Matrix Completion: A Unified Optimization Framework for Matrix Completion

no code implementations • 21 Oct 2019 • Dimitris Bertsimas, Michael Lingzhi Li

We formulate the problem of matrix completion with and without side information as a non-convex optimization problem.

Matrix Completion

Paper
Add Code

Personalized Treatment for Coronary Artery Disease Patients: A Machine Learning Approach

no code implementations • 18 Oct 2019 • Dimitris Bertsimas, Agni Orfanoudaki, Rory B. Weiner

We are able to estimate with average R squared = 0. 801 the time from diagnosis to a potential adverse event (TAE) and gain accurate approximations of the counterfactual treatment effects.

BIG-bench Machine Learning counterfactual +1

Paper
Add Code

On Polyhedral and Second-Order Cone Decompositions of Semidefinite Optimization Problems

no code implementations • 8 Oct 2019 • Dimitris Bertsimas, Ryan Cory-Wright

We study a cutting-plane method for semidefinite optimization problems (SDOs), and supply a proof of the method's convergence, under a boundedness assumption.

Paper
Add Code

Dynamic optimization with side information

no code implementations • 17 Jul 2019 • Dimitris Bertsimas, Christopher McCord, Bradley Sturt

Through a novel measure concentration result for a class of machine learning methods, we prove that the proposed approach is asymptotically optimal for multi-period stochastic programming with side information.

BIG-bench Machine Learning Management

Paper
Add Code

The Price of Interpretability

no code implementations • 8 Jul 2019 • Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin

When quantitative models are used to support decision-making on complex and important topics, understanding a model's ``reasoning'' can increase trust in its predictions, expose hidden biases, or reduce vulnerability to adversarial attacks.

Decision Making

Paper
Add Code

Optimal Explanations of Linear Models

no code implementations • 8 Jul 2019 • Dimitris Bertsimas, Arthur Delarue, Patrick Jaillet, Sebastien Martin

We propose a general optimization framework to create explanations for linear models.

Paper
Add Code

Online Mixed-Integer Optimization in Milliseconds

1 code implementation • 4 Jul 2019 • Dimitris Bertsimas, Bartolomeo Stellato

Compared to state-of-the-art MIO routines, the online running time of our method is very predictable and can be lower than a single matrix factorization time.

energy management Management +2

Paper
Code

A unified approach to mixed-integer optimization problems with logical constraints

no code implementations • 3 Jul 2019 • Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

We propose a unified framework to address a family of classical mixed-integer optimization problems with logically constrained decision variables, including network design, facility location, unit commitment, sparse portfolio selection, binary quadratic optimization, sparse principal analysis and sparse learning problems.

Sparse Learning

Paper
Add Code

Certifiably Optimal Sparse Inverse Covariance Estimation

no code implementations • 25 Jun 2019 • Dimitris Bertsimas, Jourdain Lamperski, Jean Pauphilet

We consider the maximum likelihood estimation of sparse inverse covariance matrices.

Paper
Add Code

From Predictions to Prescriptions in Multistage Optimization Problems

no code implementations • 26 Apr 2019 • Dimitris Bertsimas, Christopher McCord

In this paper, we introduce a framework for solving finite-horizon multistage optimization problems under uncertainty in the presence of auxiliary data.

regression

Paper
Add Code

Sparse Regression: Scalable algorithms and empirical performance

1 code implementation • 18 Feb 2019 • Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys

A cogent feature selection method is expected to exhibit a two-fold convergence, namely the accuracy and false detection rate should converge to $1$ and $0$ respectively, as the sample size increases.

Methodology

Paper
Code

Scalable Holistic Linear Regression

no code implementations • 8 Feb 2019 • Dimitris Bertsimas, Michael Lingzhi Li

We propose a new scalable algorithm for holistic linear regression building on Bertsimas & King (2016).

regression

Paper
Add Code

The Voice of Optimization

1 code implementation • 24 Dec 2018 • Dimitris Bertsimas, Bartolomeo Stellato

We introduce the idea that using optimal classification trees (OCTs) and optimal classification trees with-hyperplanes (OCT-Hs), interpretable machine learning algorithms developed by Bertsimas and Dunn [2017, 2018], we are able to obtain insight on the strategy behind the optimal solution in continuous and mixed-integer convex optimization problem as a function of key parameters that affect the problem.

Optimization and Control

Paper
Code

Interpretable Matrix Completion: A Discrete Optimization Approach

no code implementations • 17 Dec 2018 • Dimitris Bertsimas, Michael Lingzhi Li

We consider the problem of matrix completion on an $n \times m$ matrix.

Matrix Completion

Paper
Add Code

Interpretable Clustering via Optimal Trees

no code implementations • 3 Dec 2018 • Dimitris Bertsimas, Agni Orfanoudaki, Holly Wiberg

State-of-the-art clustering algorithms use heuristics to partition the feature space and provide little insight into the rationale for cluster membership, limiting their interpretability.

Clustering

Paper
Add Code

Imputation of Clinical Covariates in Time Series

no code implementations • 2 Dec 2018 • Dimitris Bertsimas, Agni Orfanoudaki, Colin Pawlowski

Missing data is a common problem in real-world settings and particularly relevant in healthcare applications where researchers use Electronic Health Records (EHR) and results of observational studies to apply analytics methods.

Imputation Time Series +1

Paper
Add Code

A Scalable Algorithm For Sparse Portfolio Selection

1 code implementation • 31 Oct 2018 • Dimitris Bertsimas, Ryan Cory-Wright

In numerical experiments, we establish that the outer-approximation procedure gives rise to dramatic speedups for sparse portfolio selection problems.

Optimization and Control

Paper
Code

Optimization over Continuous and Multi-dimensional Decisions with Observational Data

no code implementations • NeurIPS 2018 • Dimitris Bertsimas, Christopher McCord

We consider the optimization of an uncertain objective over continuous and multi-dimensional decision spaces in problems in which we are only provided with observational data.

BIG-bench Machine Learning

Paper
Add Code

Bootstrap Robust Prescriptive Analytics

1 code implementation • 27 Nov 2017 • Dimitris Bertsimas, Bart Van Parys

The associated robust prescriptive methods furthermore reduce to convenient tractable convex optimization problems in the context of local learning methods such as nearest neighbors and Nadaraya-Watson learning.

Decision Making

Paper
Code

Sparse Classification and Phase Transitions: A Discrete Optimization Perspective

1 code implementation • 3 Oct 2017 • Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys

In this paper, we formulate the sparse classification problem of $n$ samples with $p$ features as a binary convex optimization problem and propose a cutting-plane algorithm to solve it exactly.

Optimization and Control

Paper
Code

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

no code implementations • 28 Sep 2017 • Dimitris Bertsimas, Bart Van Parys

We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective.

regression Vocal Bursts Intensity Prediction

Paper
Add Code

Sparse Hierarchical Regression with Polynomials

no code implementations • 28 Sep 2017 • Dimitris Bertsimas, Bart Van Parys

The ability of our method to identify all $k$ relevant inputs and all $\ell$ monomial terms is shown empirically to experience a phase transition.

regression

Paper
Add Code

The Trimmed Lasso: Sparsity and Robustness

1 code implementation • 15 Aug 2017 • Dimitris Bertsimas, Martin S. Copenhaver, Rahul Mazumder

Nonconvex penalty methods for sparse modeling in linear regression have been a topic of fervent interest in recent years.

Paper
Code

Best Subset Selection via a Modern Optimization Lens

no code implementations • 11 Jul 2015 • Dimitris Bertsimas, Angela King, Rahul Mazumder

In the last twenty-five years (1990-2014), algorithmic advances in integer optimization combined with hardware improvements have resulted in an astonishing 200 billion factor speedup in solving Mixed Integer Optimization (MIO) problems.

regression Sparse Learning

Paper
Add Code

Characterization of the equivalence of robustification and regularization in linear and matrix regression

no code implementations • 22 Nov 2014 • Dimitris Bertsimas, Martin S. Copenhaver

The notion of developing statistical methods in machine learning which are robust to adversarial perturbations in the underlying data has been the subject of increasing interest in recent years.

Matrix Completion regression

Paper
Add Code

A Comparison of Monte Carlo Tree Search and Mathematical Optimization for Large Scale Dynamic Resource Allocation

no code implementations • 21 May 2014 • Dimitris Bertsimas, J. Daniel Griffith, Vishal Gupta, Mykel J. Kochenderfer, Velibor V. Mišić, Robert Moss

In this paper, we adapt both MCTS and MO to a problem inspired by tactical wildfire and management and undertake an extensive computational study comparing the two methods on large scale instances in terms of both the state and the action spaces.

Management Stochastic Optimization

Paper
Add Code

From Predictive to Prescriptive Analytics

1 code implementation • 22 Feb 2014 • Dimitris Bertsimas, Nathan Kallus

To demonstrate the power of our approach in a real-world setting we study an inventory management problem faced by the distribution arm of an international media conglomerate, which ships an average of 1bil units per year.

Management Stochastic Optimization

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.