Content Tags

There are no tags.

Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections.

RSS Source

Authors

Xin Zhang, Armando Solar-Lezama, Rishabh Singh

The paper describes a new algorithm to generate minimal, stable, and symboliccorrections to an input that will cause a neural network with ReLU neurons tochange its output. We argue that such a correction is a useful way to providefeedback to a user when the neural network produces an output that is differentfrom a desired output. Our algorithm generates such a correction by solving aseries of linear constraint satisfaction problems. The technique is evaluatedon a neural network that has been trained to predict whether an applicant willpay a mortgage.

Download PDF

More news

Continue reading and listening

Stay in the loop.

Subscribe to our newsletter for a weekly update on the latest podcast, news, events, and jobs postings.