no code implementations • 28 Mar 2024 • Dimitris Bertsimas, Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis
Using SHAP feature importance, we show that analytical insights are consistent across retraining iterations.
no code implementations • 18 Jan 2024 • Phevos Paschalidis, Runyu Zhang, Na Li
The reward of the system is modeled as a weighted sum of the rewards the agents observe, where the weights capture some transformation of the reward associated with multiple agents sampling the same node at the same time.