11.2 Method: SpRay
An international cooperation investigated recent ML successes in order to find whether or not the models deliver reliably for the problems they are trained for (Lapuschkin et al. 2019)
Basics of SpRay:
- Identifies and quantifies decision-making behaviors
- Finds undesirable decisions in vast data sets
- Can be used to improve
- model or
- dataset
Lets examine a model which classifies horses
SpRay identifies 4 different prediction strategies for classifying images as “horse”
Detected prediction strategies:
- b \(\implies\) detect a horse (and rider)
- c \(\implies\) detect a source tag in portrait oriented images
- d \(\implies\) detect wooden hurdles and other contextual elements of horseback riding
- e \(\implies\) detect a source tag in landscape-oriented images
References
Lapuschkin, Sebastian, Stephan Wäldchen, Alexander Binder, Grégoire Montavon, Wojciech Samek, and Klaus-Robert Müller. 2019. “Unmasking Clever Hans Predictors and Assessing What Machines Really Learn.” Nature Communications 10 (1): 1–8.