Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings

Name: Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings
Start: 2022-07-20T10:30:00-04:00
End: 2022-07-20T10:35:00-04:00
Location: Baltimore Convention Center, Baltimore, USA

Spotlight & Poster

Jan Macdonald, Mathieu Besançon, Sebastian Pokutta

Abstract

We study the effects of constrained optimization formulations and Frank-Wolfe algorithms for obtaining interpretable neural network predictions. Reformulating the Rate-Distortion Explanations (RDE) method for relevance attribution as a constrained optimization problem provides precise control over the sparsity of relevance maps. This enables a novel multi-rate as well as a relevance-ordering variant of RDE that both empirically outperform standard RDE and other baseline methods in a well-established comparison test. We showcase several deterministic and stochastic variants of the Frank-Wolfe algorithm and their effectiveness for RDE.

Date

Jul 20, 2022 10:30 AM — 10:35 AM

Event

39th International Conference on Machine Learning

Location

Baltimore Convention Center, Baltimore, USA

Deep Neural Networks Explainable Neural Networks