Auditing Deep Neural Networks to Understand Recidivism Predictions
Date
2016
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Producer
Director
Performer
Choreographer
Costume Designer
Music
Videographer
Lighting Designer
Set Designer
Crew Member
Funder
Rehearsal Director
Concert Coordinator
Advisor
Moderator
Panelist
Alternative Title
Department
Haverford College. Department of Computer Science
Type
Thesis
Original Format
Running Time
File Format
Place of Publication
Date Span
Copyright Date
Award
Language
eng
Note
Table of Contents
Terms of Use
Rights Holder
Access Restrictions
Open Access
Terms of Use
Tripod URL
Identifier
Abstract
In recent years, deep neural network models have proven to be incredibly accurate on many classification benchmarks. Due to this high accuracy, many non-technical fields are interested in using these models to assist in decision making processes. However, this curiosity is generally tempered by the realization that it is di fficult to understand what features of the data contribute to the prediction. We present a method to evaluate the effect of each feature in a data set on the predictions of a model, which we refer to as gradient feature auditing (GFA). To test this method, we trained four models (a deep neural network, SVM, SLIM, and decision tree) on recidivism data and then applied GFA to each model. The experimental portion verified the ability of GFA to obtain a ranked ordering of features. Next, we attempted to use methods from interpretable learning to validate our procedure. Overall, GFA allows domain experts to use the most effective model of their data in the decision making process, while also retaining the ability to explain how those decisions are being made.