Treeview and Disentangled Representations for Explaining Deep Neural Networks Decisions

Prasanna Sattigeri; Karthikeyan Natesan Ramamurthy; Jayaraman J. Thiagarajan; Bhavya Kailkhura

doi:10.1109/IEEECONF51394.2020.9443487

ACSSC 2020

Conference paper

01 Nov 2020

Treeview and Disentangled Representations for Explaining Deep Neural Networks Decisions

View publication

Abstract

With the advent of highly predictive but opaque deep learning models, it has become more important than ever to understand and explain the predictions of such models. Many popular approaches define interpretability as the inverse of complexity and achieve interpretability at the cost of accuracy. This introduces a risk of producing interpretable but misleading explanations. As humans, we are prone to engage in this kind of behavior [11]. In this paper, we take the view that the complexity of the explanations should correlate with complexity of the decision. We propose to build a Treeview representation of the complex model using disentangled representations, which reveals the iterative rejection of unlikely class labels until the correct association is predicted.

Conference paper