Deep Learning for Case-Based Reasoning Through Prototypes: A Neural Network That Explains Its Predictions

Abstract

Deep neural networks are widely used for classification. These deep models often suffer from a lack of interpretability---they are particularly difficult to understand because of their non-linear nature. As a result, neural networks are often treated as "black box" models, and in the past, have been trained purely to optimize the accuracy of predictions. In this work, we create a novel network architecture for deep learning that naturally explains its own reasoning for each prediction. This architecture contains an autoencoder and a special prototype layer, where each unit of that layer stores a weight vector that resembles an encoded training input. The encoder of the autoencoder allows us to do comparisons within the latent space, while the decoder allows us to visualize the learned prototypes. The training objective has four terms: an accuracy term, a term that encourages every prototype to be similar to at least one encoded input, a term that encourages every encoded input to be close to at least one prototype, and a term that encourages faithful reconstruction by the autoencoder. The distances computed in the prototype layer are used as part of the classification process. Since the prototypes are learned during training, the learned network naturally comes with explanations for each prediction, and the explanations are loyal to what the network actually computes.

Keywords

AutoencoderInterpretabilityArtificial intelligenceComputer scienceDeep learningArtificial neural networkMachine learningTerm (time)Black boxEncoderNetwork architectureProcess (computing)Layer (electronics)ArchitectureDeep belief networkRecurrent neural network

Affiliated Institutions

Related Publications

This Looks Like That: Deep Learning for Interpretable Image Recognition

Chaofan Chen , Oscar Li , Chaofan Tao +3 more

When we are faced with challenging image classification tasks, we often explain our reasoning by dissecting the image, and pointing out prototypical aspects of one class or anot...

2018 arXiv (Cornell University) 562 citations

Exploring Strategies for Training Deep Neural Networks

Hugo Larochelle , Yoshua Bengio , Jérôme Louradour +1 more

Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently...

2009 Journal of Machine Learning Research 1114 citations

Reducing the Dimensionality of Data with Neural Networks

Geoffrey E. Hinton , Ruslan Salakhutdinov

High-dimensional data can be converted to low-dimensional codes by training a multilayer neural network with a small central layer to reconstruct high-dimensional input vectors....

2006 Science 20153 citations

Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection

Dong Gong , Lingqiao Liu , Vuong Le +4 more

Deep autoencoder has been extensively used for anomaly detection. Training on the normal data, the autoencoder is expected to produce higher reconstruction error for the abnorma...

2019 2019 IEEE/CVF International Conferenc... 1528 citations

PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment

Kaixin Wang , Jun Hao Liew , Yingtian Zou +2 more

Despite the great progress made by deep CNNs in image semantic segmentation, they typically require a large number of densely-annotated images for training and are difficult to ...

2019 1244 citations

Publication Info

Year: 2018
Type: article
Volume: 32
Issue: 1
Citations: 390
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Deep Learning for Case-Based Reasoning Through Prototypes: A Neural Network That Explains Its Predictions

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

390

OpenAlex

Cite This

APA Style

                            
                                
                                    Oscar Li, 
                                
                                    Hao Liu, 
                                
                                    Chaofan Chen
                                
                                et al.
                            
                            (2018). 
                            Deep Learning for Case-Based Reasoning Through Prototypes: A Neural Network That Explains Its Predictions. 
                            Proceedings of the AAAI Conference on Artificial Intelligence
                            , 32
                            (1)
                            .
                            https://doi.org/10.1609/aaai.v32i1.11771
                        

Identifiers

DOI: 10.1609/aaai.v32i1.11771