Search Results for author: Dan Ley

Found 8 papers, 3 papers with code

Are Large Language Models Post Hoc Explainers?

1 code implementation • 9 Oct 2023 • Nicholas Kroeger, Dan Ley, Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

To this end, several approaches have been proposed in recent literature to explain the behavior of complex predictive models in a post hoc fashion.

Explainable artificial intelligence Explainable Artificial Intelligence (XAI) +1

Paper
Code

On Minimizing the Impact of Dataset Shifts on Actionable Explanations

no code implementations • 11 Jun 2023 • Anna P. Meyer, Dan Ley, Suraj Srinivas, Himabindu Lakkaraju

To this end, we conduct rigorous theoretical analysis to demonstrate that model curvature, weight decay parameters while training, and the magnitude of the dataset shift are key factors that determine the extent of explanation (in)stability.

Paper
Add Code

Consistent Explanations in the Face of Model Indeterminacy via Ensembling

no code implementations • 9 Jun 2023 • Dan Ley, Leonard Tang, Matthew Nazari, Hongjin Lin, Suraj Srinivas, Himabindu Lakkaraju

This work addresses the challenge of providing consistent explanations for predictive models in the presence of model indeterminacy, which arises due to the existence of multiple (nearly) equally well-performing models for a given dataset and task.

Paper
Add Code

GLOBE-CE: A Translation-Based Approach for Global Counterfactual Explanations

1 code implementation • 26 May 2023 • Dan Ley, Saumitra Mishra, Daniele Magazzeni

Counterfactual explanations have been widely studied in explainability, with a range of application dependent methods prominent in fairness, recourse and model understanding.

counterfactual Fairness +1

Paper
Code

OpenXAI: Towards a Transparent Evaluation of Model Explanations

2 code implementations • 22 Jun 2022 • Chirag Agarwal, Dan Ley, Satyapriya Krishna, Eshika Saxena, Martin Pawelczyk, Nari Johnson, Isha Puri, Marinka Zitnik, Himabindu Lakkaraju

OpenXAI comprises of the following key components: (i) a flexible synthetic data generator and a collection of diverse real-world datasets, pre-trained models, and state-of-the-art feature attribution methods, and (ii) open-source implementations of eleven quantitative metrics for evaluating faithfulness, stability (robustness), and fairness of explanation methods, in turn providing comparisons of several explanation methods across a wide variety of metrics, models, and datasets.

Benchmarking Explainable Artificial Intelligence (XAI) +1

213

Paper
Code

Global Counterfactual Explanations: Investigations, Implementations and Improvements

no code implementations • 14 Apr 2022 • Dan Ley, Saumitra Mishra, Daniele Magazzeni

Counterfactual explanations have been widely studied in explainability, with a range of application dependent methods emerging in fairness, recourse and model understanding.

counterfactual Counterfactual Explanation +1

Paper
Add Code

Diverse, Global and Amortised Counterfactual Explanations for Uncertainty Estimates

no code implementations • 5 Dec 2021 • Dan Ley, Umang Bhatt, Adrian Weller

To interpret uncertainty estimates from differentiable probabilistic models, recent work has proposed generating a single Counterfactual Latent Uncertainty Explanation (CLUE) for a given data point where the model is uncertain, identifying a single, on-manifold change to the input such that the model becomes more certain in its prediction.

counterfactual

Paper
Add Code

δ-CLUE: Diverse Sets of Explanations for Uncertainty Estimates

no code implementations • 13 Apr 2021 • Dan Ley, Umang Bhatt, Adrian Weller

To interpret uncertainty estimates from differentiable probabilistic models, recent work has proposed generating Counterfactual Latent Uncertainty Explanations (CLUEs).

counterfactual

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.