On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Wei, Dennis; Nair, Rahul; Dhurandhar, Amit; Varshney, Kush R.; Daly, Elizabeth M.; Singh, Moninder

Computer Science > Machine Learning

arXiv:2211.01498 (cs)

[Submitted on 2 Nov 2022]

Title:On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Authors:Dennis Wei, Rahul Nair, Amit Dhurandhar, Kush R. Varshney, Elizabeth M. Daly, Moninder Singh

View PDF

Abstract:Interpretable and explainable machine learning has seen a recent surge of interest. We focus on safety as a key motivation behind the surge and make the relationship between interpretability and safety more quantitative. Toward assessing safety, we introduce the concept of maximum deviation via an optimization problem to find the largest deviation of a supervised learning model from a reference model regarded as safe. We then show how interpretability facilitates this safety assessment. For models including decision trees, generalized linear and additive models, the maximum deviation can be computed exactly and efficiently. For tree ensembles, which are not regarded as interpretable, discrete optimization techniques can still provide informative bounds. For a broader class of piecewise Lipschitz functions, we leverage the multi-armed bandit literature to show that interpretability produces tighter (regret) bounds on the maximum deviation. We present case studies, including one on mortgage approval, to illustrate our methods and the insights about models that may be obtained from deviation maximization.

Comments:	Published at NeurIPS 2022
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2211.01498 [cs.LG]
	(or arXiv:2211.01498v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.01498

Submission history

From: Dennis Wei [view email]
[v1] Wed, 2 Nov 2022 21:57:24 UTC (1,322 KB)

Computer Science > Machine Learning

Title:On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators